OPTIMIZE PERFORMANCE FOR YOUR ENTIRE BIG DATA STACK

PLATFORM SPOTLIGHT

APPLICATION SPOTLIGHT

CAPACITY OPTIMIZER

The 451 Take on Cloud-Native: Truly Transformative for Enterprise IT

Helping to shape the modern software development and IT operations paradigms, cloud-native represents a significant shift in enterprise IT. In this report, we define cloud-native and offer some perspective on why it matters and what it means for the industry.

Elements of Big Data APM Success

Pepperdata delivers proven big data APM products, operational experience, and deep expertise.

PLATFORM SPOTLIGHT
PLACEHOLDER

Request a trial to see firsthand how Pepperdata big data solutions can help you achieve big data performance success. Pepperdata’s proven APM solutions provide a 360° degree view of both your platform and applications, with realtime tuning, recommendations, and alerting. See and understand how Pepperdata big data performance solutions helps you to quickly pinpoint and resolve big data performance bottlenecks. See for yourself why Pepperdata’s big data APM solutions are used to manage performance on over 30K Hadoop production clusters.

Request Trial

Resources

Cloudwick Collaborates with Pepperdata to Ensure SLAs and Performance are Maintained for AWS Migration Service

Pepperdata Provides Pre- and Post-Migration Workload Analysis, Application Performance Assessment and SLA Validation for Cloudwick AWS Migration Customers

San Francisco — Strata Data Conference (Booth 926)  — March 27, 2019 — Pepperdata, the leader in big data Application Performance Management (APM), and Cloudwick, leading provider of digital business services and solutions to the Global 1000, today announced a collaborative offering for enterprises migrating their big data to Amazon Web Services (AWS). Pepperdata provides Cloudwick with a baseline of on-premises performance, maps workloads to optimal static and on-demand instances, diagnoses any issues that arise during migration and assesses performance after the move to ensure the same or better performance and SLAs.

“The biggest challenge for enterprises migrating big data to the cloud is ensuring SLAs are maintained without having to devote resources to entirely re-engineer applications,” said Ash Munshi, Pepperdata CEO. “Cloudwick and Pepperdata ensure workloads are migrated successfully by analyzing and establishing a metrics-based performance baseline.”

“Migrating to the cloud without looking at the performance data first is risky for organizations and if a migration is not done right, the complaints from lines of business are unavoidable,” said Mark Schreiber, General Manager for Cloudwick. “Without Pepperdata’s metrics and analysis before and after the migration, there is no way to prove performance levels are maintained in the cloud.”

For Cloudwick’s AWS Migration Services, Pepperdata is installed on customers’ existing, on-premises clusters — it takes under 30 minutes — and automatically collects over 350 real-time operational metrics from applications and infrastructure resources, including CPU, RAM, disk I/O, and network usage metrics on every job, task, user, host, workflow, and queue. These metrics are used to analyze performance and SLAs, accurately map workloads to appropriate AWS instances, and provide cost projections. Once the AWS migration is complete, the same operational metrics from the cloud are collected and analyzed to assess performance results and validate migration success.

To learn more, stop by the Pepperdata booth (926) at Strata Data Conference March 25-28 at Moscone West in San Francisco.

More Info

About Pepperdata
Pepperdata (https://pepperdata.com) is the leader in big data Application Performance Management (APM) solutions and services, solving application and infrastructure issues throughout the stack for developers and operations managers. The company partners with its customers to provide proven products, operational experience, and deep expertise to deliver predictable performance, empowered users, managed costs and managed growth for their big data investments, both on-premise and in the cloud. Leading companies like Comcast, Philips Wellcentive and NBC Universal depend on Pepperdata to deliver big data success.

 Founded in 2012 and headquartered in Cupertino, California, Pepperdata has attracted executive and engineering talent from Yahoo, Google, Microsoft and Netflix. Pepperdata investors include Citi Ventures, Costanoa Ventures, Signia Venture Partners, Silicon Valley Data Capital and Wing Venture Capital, along with leading high-profile individual investors. For more information, visit www.pepperdata.com.

About Cloudwick

Cloudwick is the leading provider of digital business services and solutions to the Global 1000. Its solutions include data migration, business intelligence modernization, data science, cybersecurity, IoT and mobile application development and more, enabling data-driven enterprises to gain competitive advantage from big data, cloud computing and advanced analytics. Learn more at www.cloudwick.com.

###

Contact:
Samantha Leggat
samantha@pepperdata.com

Pepperdata and the Pepperdata logo are registered trademarks of Pepperdata, Inc. Other names may be trademarks of their respective owners.

March 27, 2019

Pepperdata Announces Free Big Data Cloud Migration Cost Assessment to Automatically Select Optimal Instance Types and Provide Accurate Cost Projections

Pepperdata Eliminates Guesswork and Complexity Associated with Identifying Best Candidate Workloads Down to Queue, Job and User Level, for Moving to AWS, Azure, Google Cloud or IBM Cloud

CUPERTINO, Calif. — March 6, 2019 — Pepperdata, the leader in big data Application Performance Management (APM), today announced its new Big Data Cloud Migration Cost Assessment for enterprises looking to migrate their big data workloads to AWS, Azure, Google Cloud or IBM Cloud. By analyzing current workloads and service level agreements, the detailed, metrics-based Assessment enables enterprises to make informed decisions, helping minimize risk while ensuring SLAs are maintained after cloud migration.

The Pepperdata Big Data Cloud Migration Cost Assessment provides organizations with an accurate understanding of their network, compute and storage needs to run their big data applications in the hybrid cloud. Analyzing memory, CPU and IO every five seconds for every task, Pepperdata maps the on-premises workloads to optimal static and on-demand instances on AWS, Azure, Google Cloud, and IBM Cloud. Pepperdata also identifies how many of each instance type will be needed and calculates cloud CPU and memory costs to achieve the same performance and SLAs of the existing on-prem infrastructure.

“When enterprises consider a hybrid cloud strategy, they estimate the cost of moving entire clusters, but that’s not the best approach,” said Ash Munshi, Pepperdata CEO. “It’s far better to identify specific workloads that can be moved to take full advantage of the pricing and elasticity of the cloud. Pepperdata collects and analyzes detailed, granular resource metrics to accurately identify optimal workloads for cloud migration while maintaining SLAs.”

The Big Data Cloud Migration Cost Assessment enables enterprises to:

  • Automatically analyze every workload in your cluster to accurately determine their projected cloud costs
  • Get cost projections and instance recommendations for workloads, queues, jobs, and users
  • Map big data workloads to various instance types including static and on-demand
  • Compare AWS, Azure, Google Cloud, and IBM Cloud

Availability

Pepperdata Big Data Cloud Migration Cost Assessment is available free at pepperdata.com/free-big-data-cloud-migration-cost-assessment. Pepperdata customers should email support@pepperdata.com for their free assessment.

Learn more:

About Pepperdata
Pepperdata (https://www.pepperdata.com) is the leader in big data Application Performance Management (APM) solutions and services, solving application and infrastructure issues throughout the stack for developers and operations managers. The company partners with its customers to provide proven products, operational experience, and deep expertise to deliver predictable performance, empowered users, managed costs and managed growth for their big data investments, both on-premise and in the cloud. Leading companies like Comcast, Philips Wellcentive and NBC Universal depend on Pepperdata to deliver big data success.

 Founded in 2012 and headquartered in Cupertino, California, Pepperdata has attracted executive and engineering talent from Yahoo, Google, Microsoft and Netflix. Pepperdata investors include Citi Ventures, Costanoa Ventures, Signia Venture Partners, Silicon Valley Data Capital and Wing Venture Capital, along with leading high-profile individual investors. For more information, visit www.pepperdata.com.

###

Contact:
Samantha Leggat

925-447-5300
samantha@pepperdata.com

Pepperdata and the Pepperdata logo are registered trademarks of Pepperdata, Inc. Other names may be trademarks of their respective owners.

March 5, 2019

Pepperdata Unveils 360° Reports, Enabling Enterprises to Make More Informed Operational Decisions to Maximize Capacity and Improve Application Performance

360° Reports Empower Executives to Better Understand Financial Impacts of Operational Decisions

CUPERTINO, Calif. — February 19, 2019 — Pepperdata, the leader in big data Application Performance Management (APM), today announced the availability of 360° Reports for Platform Spotlight. Pepperdata 360° Reports leverage the vast amount of proprietary data collected and correlated by Pepperdata to give executives capacity utilization insights so they better understand the financial impacts of operational decisions.

“Pepperdata 360° Reports demonstrate the power of data and the valuable insights Pepperdata provides, enabling enterprises to make more informed and effective operational decisions,” said Ash Munshi, Pepperdata CEO. “Operators get a better understanding of what and where they’re spending, where waste can be reclaimed, and where policy and resource adjustments can be made to save money, maximize capacity and improve application performance.”

360° Reports for Pepperdata Platform Spotlight include:

  • Capacity Optimizer Report: This gives operators insight into memory and money saved by leveraging Pepperdata Capacity Optimizer to dynamically recapture wasted capacity.
  • Application Waste Report: This report compares memory requested with actual memory utilization so operators can optimize resources by changing resource reservation parameters.
  • Application Type Report: This gives operators insight on the technologies used across the cluster and the percentage of each (percentage of Spark jobs, etc.). This provides executives with insights into technology trends to make more data-driven investment decisions.
  • Default Container Size Report: This report identifies jobs using default container size and where any waste occurred so operators can make default container size adjustments to save money.
  • Pepperdata Usage Report: This presents Pepperdata dashboard usage data, highlighting top users, days used, and more to give operators insights to maximize their investment. With this data, operators can identify activities to grow the user base, such as promoting features, scheduling onboarding sessions, and training on custom alarms.

Availability

Pepperdata 360° Reports are available immediately for Pepperdata Platform Spotlight customers. For a free trial of Pepperdata, visit https://www.pepperdata.com/trial.

About Pepperdata
Pepperdata (https://pepperdata.com) is the leader in big data Application Performance Management (APM) solutions and services, solving application and infrastructure issues throughout the stack for developers and operations managers. The company partners with its customers to provide proven products, operational experience, and deep expertise to deliver predictable performance, empowered users, managed costs and managed growth for their big data investments, both on-premise and in the cloud. Leading companies like Comcast, Philips Wellcentive and NBC Universal depend on Pepperdata to deliver big data success.

 Founded in 2012 and headquartered in Cupertino, California, Pepperdata has attracted executive and engineering talent from Yahoo, Google, Microsoft and Netflix. Pepperdata investors include Citi Ventures, Costanoa Ventures, Signia Venture Partners, Silicon Valley Data Capital and Wing Venture Capital, along with leading high-profile individual investors. For more information, visit www.pepperdata.com.

###

Contact:
Samantha Leggat
samantha@pepperdata.com

Pepperdata and the Pepperdata logo are registered trademarks of Pepperdata, Inc. Other names may be trademarks of their respective owners.

Sample report attached.

Sample Capacity Optimizer Report – memory and money saved with Capacity Optimizer

February 19, 2019

Key APM Functions and Metrics – A Primer

For those of us who are APM industry “insiders”, there’s not a lot of mystery when it comes to understanding the functionality and value of application performance management. We tend to take for granted that everyone else shares the same knowledge, but that’s not a wise assumption to make. So we thought it would be worthwhile to provide a brief APM tutorial for those who may not be fully familiar with the subject.

Application performance management (APM) is the monitoring and management of performance and availability of software applications. The primary goal of APM solutions is to maintain an expected level of service and availability by understanding why transactions in your application/workload are slow or failing. For example, a development or operations team can instantly tell from their APM solution if an application is causing some performance spikes. They can then leverage their APM solution to determine the root cause, identify which queries were affected and make appropriate adjustments to resolve the issue.

Here are some examples of common application problems that APM solutions can help to quickly identify and resolve.

  • Track application usage to understand spikes in traffic
  • Find bottlenecks or latency problems with application dependencies including SQL, queues, caching, etc.
  • Identify slow SQL queries
  • Find the highest volume and slowest web pages or transactions
  • Find the root cause(s) for application problems
  • Identify ways to optimize application performance

APM solutions perform a plethora of functions in the quest to optimize application performance, including:

  • Anomaly detection, which enables a rapid response to application behaviors that begin to fall outside the normal operating range, and identify events that do not conform to expected behavioral patterns
  • Application debugging, the process of finding the causes of undesirable effects on application behaviors. This requires identifying defects and errors in program code
  • Distributed profiling of the many potential sources of application performance degradation. This involves identifying and localizing the sources of performance degradation across an ecosystem consisting of applications, services and machines.
  • IT service monitoring shows whether response time and uptime commitments are being met, including adhering to service level agreements and performance thresholds
  • Root cause analysis attempts to determine the probable cause of a problem, then constructing a causality chain correlating case and effect.
  • End-user experience monitoring captures user-based performance data to gauge how well the application is performing and identify potential performance problems.
  • Application topology discovery and visualization is the visual expression of the application in a flow-map to establish all the different components of the application and how they interact with each other.
  • User-defined transaction profiling examines specific interactions to recreate conditions that lead to performance problems for testing purposes.
  • IT operations analytics is the discovery of usage patterns, identification of performance problems, and anticipation of potential problems before they happen.

APM solutions also gather an incredible set of metrics as they continuously monitor the cluster and application environment. Here are just a few of the key metrics that are captured and analyzed.

Average Response Time is the amount of time an application takes to return a query or request to a user. To measure average response time, an application is tested under different circumstances (i.e. number of concurrent users, number of transactions requested) Typically, this metric is measured from the start of the request to the time the last byte is sent. Other factors, like geographic location of the user and the complexity of the information being requested, can affect the average response time for users. These should all be considered in the overall evaluation of application performance.

Error Rates – The last thing you want your users to see are errors. Monitoring error rates is a critical application performance metric.
There are potentially three different ways to track application errors:

  • HTTP Error % – Number of web requests that ended in an error
  • Logged Exceptions – Number of unhandled and logged errors from your application
  • Thrown Exceptions – Number of all exceptions that have been thrown

It is common to see thousands of exceptions being thrown and ignored within an application. Hidden application exceptions can also cause a lot of performance problems.

Request Rate – Understanding how much traffic your application receives will impact the success of your application. Potentially all other application performance metrics are affected by increases or decreases in traffic. Request rates can be useful to correlate to other application performance metrics to understand the dynamics of how your application scales. Monitoring the request rate can also be good to watch for spikes or sudden inactivity. If you have a busy API that suddenly gets no traffic at all, that could signal something bad. A similar but slightly different metric to track is the number of concurrent users.

Application and Server CPU and Memory – If the CPU usage on your machines is extremely high, or you have limited memory resources, you can expect to eventually suffer from application performance problems. Monitoring the CPU and memory usage of your servers, cluster and applications is a basic and critical metric.

Application Availability – Monitoring and measuring if your application is online and available is a key metric you should be tracking. Most companies use this as a way to measure uptime for service level agreements (SLA).

All metrics should be evaluated over time, and one that is critical should be fed into a rules engine that raises alerts when a set threshold is exceeded. Ultimately all metrics can and should be used to understand what is normal/typical for your application so that abnormal/atypical behavior can be detected, analyzed and resolved.

Other Things You Can Do

Interested in learning more about APM and how it can be applied to optimize your application and infrastructure environment?

May 14, 2019

Three Keys to Understanding Application Performance in the Cloud

The key cloud advantages of self-service, automatic provisioning, and rapid elasticity come at the cost of increased complexity at the application level. Each newly provisioned instance can have a hidden impact on the performance of already-running applications, an impact that may be visible only when we look at the underlying shared infrastructure.

Cloud features such as automatic provisioning and workload management allow us to ignore the relationship between VMs and assigned underlying hardware, but our applications must still run on the actual hardware. And when things go wrong we need to be able to track down the problem quickly and accurately.

As technology evolves, new challenges will arise. One thing remains the same: We need to understand the impact of resource utilization and hardware latencies on the performance of our application and workload and maintain the correlation between application and hardware at any given point to resolve issues. Here are three ways you can achieve better insight into the performance of your applications in the cloud.

Make sure your APM/IPM solution offers fine-grained visibility

An effective application performance management (APM)/infrastructure performance management (IPM) solution for the cloud optimizes your time by automatically providing you with time-series performance data that spans your big data hardware, software, and applications, sampling every five seconds and allowing you to solve the hard problems. Real-time visibility into how an application is using resources beyond CPU and memory is required to find contention and resource hogs wherever they lurk. The ROI benefit is measured in hundreds of man-hours that can be spent on other tasks, as well as the money you don’t have to spend running inefficient apps that are unnecessarily fighting for resources. Pepperdata provides application tuning recommendations, auto-generated reports, and insight tables that give you actionable observations and recommendations, not just raw data.

Deploy an APM/IPM solution that provides AI-driven optimization

The primary driver for moving to the cloud is cost-savings. On-premises, over-provisioning for workloads is a given, and resource costs are a secondary concern. But in the cloud, you pay for every minute of compute and storage resources that you use, so over-provisioning is a major issue.  A standard YARN deployment allows for CPU and memory resources to be reserved and not used by applications as a normal practice. This inefficiency is wasting resources and driving up your cloud costs. By auto-tuning the platform, these inefficiencies are reduced by up to 50%. Pepperdata leverages AI to automatically optimize your big data platform in response to inefficient CPU and memory allocation. This programmatic approach to tuning is done at scale affecting thousands of applications simultaneously and eliminates the time-consuming hassle of trying to manually tune every job.  

Choose an APM/IPM solution that includes a world-class support team

Pepperdata leverages the richest set of performance data available to tackle unknown challenges that stem from managing a complex, ever-changing combination of big data hardware and software components, and delivers that information continuously to an industry-leading UI.  But new application and infrastructure demands are constantly emerging. Pepperdata Support and Data Science teams are ready to partner with you to solve the questions that can’t be easily answered in a static UI. With the combined expertise of your team and ours, you will be well-prepared to tackle any new performance problems that arise.

Getting a true picture of application performance in the cloud can be challenging if you don’t have the right tools. You can rely on Pepperdata to answer the currently unknown questions with performance metrics gathered every five seconds from each layer of the big data hardware, software, and application stack, and retained for long-term analysis.  We can help you solve the most challenging performance problems by leveraging our AI-based automation and insightful visualizations. No matter where you deploy Pepperdata, you can be assured that solutions and features are fully cross-compatible and support hybrid cloud architectures with zero loss in functionality.

With an eclectic combination of on-premises and cloud deployments represented across hundreds of clusters, Pepperdata optimizes application and infrastructure performance at scale for over thirty thousand production nodes.  The world’s largest enterprises rely on Pepperdata to help them meet the SLAs they need for their business-critical applications.

Other Things You Should Do While You’re Here

May 9, 2019

It’s All About the Cloud: Insights from the 2019 Spark + AI Summit in San Francisco

Apache Spark is playing a critical role in the adoption and evolution of Big Data technologies because it provides sophisticated ways for enterprises to leverage Big Data compared to Hadoop. The increasing amounts of data being analyzed and processed through the framework is massive and continues to push the boundaries of the engine.

Built on the premise that Apache Spark is the only unified analytics engine that combines large-scale data processing with state-of-the-art machine learning and AI algorithms, the 2019 Spark + AI Summit rolled into San Francisco last week.  The event was billed as the largest data and machine learning conference in the world. The Pepperdata team was there to exhibit and meet customers and prospects, giving us an opportunity to learn more about customer wants and needs in the rapidly evolving Big Data application and infrastructure performance (APM / IPM) environment.

This was not a large-scale event, but it was very well attended.  It’s clear that Big Data is playing an increasingly important role in business operations that rely on massive amounts of data to support revenue-generating online applications.  Think always-on streaming services, on-demand applications, and business-critical transactional processing for retail, travel, banking, insurance, and healthcare services.

We saw a constant stream of visitors to the Pepperdata exhibit booth with many large enterprises being represented. Here are some of our impressions and takeaways from this event…

Although focused on Spark and AI, the common thread that ran through the summit was the adoption of and migration to the cloud. The big cloud service providers were in evidence, including the Big Four: AWS, Azure, Google Cloud and IBM Cloud.  And of course, Databrick runs its unified analytics platform in the cloud. Almost every exhibitor had booth messaging that referenced the cloud, and many sessions at the event had a cloud theme. Up, up and away!

We spoke with a lot of people who are considering managed platforms, ephemeral clusters, or no cluster at all because they are frustrated with wrestling with large Hadoop platforms. Data science in the cloud is happening at scale, but cloud can be hard to manage. So, many enterprises are choosing to off-load cloud management to vendors like Databricks and Snowflake.  We also had many visitors to our booth who were adopting a hybrid cloud strategy involving two or more cloud service providers; one as primary and the other as failover.

Attendees who stopped by the Pepperdata booth were interested to learn about our Big Data Cloud Migration Cost Assessment, which takes the stress and guesswork out of cloud migration by automating the task of profiling on-premises workloads and identifying cost- and performance-optimized cloud instances on any and all of The Big Four cloud service providers.

And if you’re a data scientist or Spark programmer, the many postings cluttering the job board indicated the high demand for your services.  Three of the “exhibiting” vendors were at the event for the stated purpose of recruiting.

In summary, the 2019 edition of the Spark + AI summit in San Francisco proved to be a rich learning experience for attendees, including the Pepperdata team.  We’re grateful to the many customers, prospects and friends who paid us a visit at the show! In spite of all the modern video and voice conferencing tools at our disposal, you simply can’t beat face-to-face, human interaction. That’s something that AI will never replace.

Other Things You Can Do

May 1, 2019