With less than 24 hours to go until the start of Hadoop Summit San Jose, the Pepperdata team is getting ready for 3 days filled with announcements, technical sessions, and meetings with customers and partners. From having a quick look at the conference agenda online, it's evident that the topics... read more →
Jun
08
Apr
16
Enabling customers to rely on Hadoop today, while building toward our long-term vision We founded Pepperdata in 2012 with the vision of enabling every company to benefit from large-scale distributed computing on huge volumes of rapidly-changing data. Sean and I had already built, operated, and witnessed the power of such... read more →
Mar
02
If you came by our booth at Strata, you probably noticed the big jar of peppers in the front. We were encouraging people to guess the number of peppers in the jar for a chance to win a PS4. This was surprisingly fun, as many of our contestants spent a... read more →
Dec
15
One question that we often get is “How is the visibility functionality of Pepperdata different from tools like Ganglia, Cloudera Manager, and Ambari?” We wanted to take some time to address this, as while we’re fans of those tools, visibility in Pepperdata has some important differences in technology and use... read more →
Oct
01
As co-founder of a Hadoop software company in Silicon Valley, I have the privilege of spending time on a daily basis with companies that are on the cutting edge of big data analytics. It’s exciting to be part of an industry that is advancing so quickly, and fascinating to learn... read more →
Sep
19
Sean and I are often asked why we started Pepperdata. We founded the company in May 2012, but our journey with Hadoop began much earlier; both of us started working with Hadoop during its very early days at Yahoo. Sean managed Yahoo’s web search engineering team, which in 2006 was... read more →
Sep
04
Since its introduction less than a decade ago, Hadoop has ushered in a data revolution. Even century-old companies are rapidly transforming themselves into data-driven businesses, driving new revenue streams through data-based products and services that were unimaginable before Hadoop: A travel booking service crunches billions of flight-price records to predict... read more →
Aug
04
This month marks the two-year anniversary of the Apache Hadoop community’s decision to decouple YARN from MapReduce and promote it as a separate sub-project of Apache Hadoop. YARN effectively opens up the Hadoop platform to new applications and modes of processing beyond MapReduce. Hadoop 1 featured HDFS as the data... read more →
Jul
10
When Chad and I started building Pepperdata's product, we knew we needed a Hadoop cluster to test the software on. But we realized that since the whole point of the product is to provide more clarity, control, and capacity for Hadoop operators in the face of limited hardware resources, we... read more →
Jun
23
My team was one of the first to use Hadoop in production – even before it was called Hadoop. It was a powerful platform that we used to support Yahoo's search engine, and we certainly had our share of technical challenges. The cluster inexplicably melted down on more than one... read more →