Data Lake

The Future of Hadoop: A Podcast with Splice Machine’s Monte Zweben

In this episode of the Designing Enterprise Platforms podcast from Early Adopter Research (EAR), EAR’s Dan Woods speaks with Monte Zweben, the CEO and cofounder of Splice Machine. Like Woods, Zweben has been an observer of the Hadoop ecosystem for quite some time. Splice Machine is a distributed SQL database but it’s also a database […]

Tags: ,

Read more

The Power of Storage Orchestration: A Podcast with Alluxio’s Steven Mih

On this edition of the Designing Enterprise Platforms podcast of Early Adopter Research (EAR), Dan Woods, the founder and principal analyst at Early Adopter Research speaks with Steven Mih, the CEO of a new company called Alluxio. Alluxio is a company that comes out of the AMPLab at UC Berkeley, which brought the world Spark […]

Tags: ,

Read more

Why Analysts Need Hands-On Access to Big Data: The Case for the Data-Native Approach

We’ve now gone down the road with big data and data lakes and the rise of AI and machine learning powered techniques long enough that it’s appropriate to take a step back and ask a few basic questions about what we’re trying to do and how we’re trying to do it. What’s interesting to me, […]

Tags: , , ,

Read more

Harnessing Big Data: A Podcast with Arcadia Data’s Sushil Thomas

In this edition of the Early Adopter Research Podcast, Dan Woods spoke with Sushil Thomas, co-founder and CEO of Arcadia Data, while both were at Strata NYC. Woods has written about how Arcadia Data’s platform is attempting to save the data lake by making it more actionable and useful for companies — and not just a […]

Tags: , , ,

Read more

Podcast on Data Fabric with MapR CEO John Schroeder

John Schroeder is the founder and CEO of MapR. He has extensive experience in enterprise software, having worked at Calista Technologies, Rainfinity, BRYO, and Compuware. Our conversation focused on MapR’s unique data fabric technology, which is an evolution of the data warehouse and data lake. The data fabric expands upon these earlier repositories to push […]

Read more

Data Lake Failure Modes – Insights from Arcadia Data

To support the research mission Saving Your Data Lake, we’re speaking with various companies to capture their perspectives on why and how data lakes fail. Arcadia Data has experience using data inside the data lake in a new way that allows analysts to interact directly with data in what they call a data native mode […]

Read more

Data Lake Failure Modes – Insights from Podium Data

To support the research mission Saving Your Data Lake, we’re speaking with various companies to capture their perspectives on why and how data lakes fail. During our conversations with Podium Data, we discussed three main failure modes of data lakes: Polluted data lakes A polluted data lake occurs when many pilot projects with many tools […]

Read more