Live Online WebinarsProduct VideosTechnical Videos (YouTube)Register for Online Customer Training
Intro to Pepperdata: Built-in Chargeback Reports
Quickly Find Rogue Users and Jobs on Hadoop Clusters
Pepperdata – HDFS on Kubernetes: Lessons Learned – Presented by Kimoon Kim
Making OpenTSBD Perform at Massive Scale Meetup
Creatively Visualizing Spark Data
Final Thoughts from Spark Webinar Series–Best Practices for Spark in Production
Question Fourteen–Do cluster mode failures (driver and executors) result in duplicate processing?
Question Twelve–Have you seen data stores for Spark other than HDFS in use?
Question Thirteen–Does cluster mode recommendation apply when running Spark on AWS EMR?
Question Eleven–What's most common? Yarn client mode? Yarn cluster mode? Stand-alone mode?
Question Ten–Are people running more than one version of Spark in the cluster
Question Nine–What are the most popular Spark ecosystems tools are being used?
Question Eight–What common mistakes do Spark users make?
Question Seven–How do I migrate my MapReduce production cluster to use Spark?
Question Six–What lessons have you learned from Spark (problems and gotchas to avoid)?
Question Five–In what use cases have you seen advantages to using MapReduce over Spark?
Question Four–What are people using for streaming and why?