Autodesk is a global leader in design and manufacturing software with primary markets in engineering, architecture, construction, manufacturing, media, and entertainment industries.
Autodesk found that scaling Amazon EMR resources to handle its data workloads resulted in runaway costs. Its goal was to reduce costs by 50 percent to accommodate for the continual increase in the company’s data processing needs.
Autodesk used Pepperdata Capacity Optimizer for real-time, automated resource optimization to improve the utilization of CPU and memory of its data workloads for reduced costs.
Autodesk significantly increased resource utilization for its Amazon EMR workloads, reduced Amazon EC2 costs by over 50 percent, and freed its platform data team from manual tuning to focus on revenue-generating innovation.
Autodesk uses Apache Spark on Amazon EMR to process and analyze large sets of big data and turn them into insights. While this approach originally proved effective, workload performance for the company became a significant issue due to resource overprovisioning.
“Spark is notoriously hard to tune correctly. People don’t have time to go into every job. As a result, our entire platform just wasn’t as efficient as it could have been.”
—Mark Kidwell, Chief Data Architect, Autodesk Data Platforms and Services
Autodesk incorporated optimization methods like instance rightsizing and autoscaling to improve the price/performance of its workloads, but the data team could not keep up with the constant manual config tuning required for optimization.
As the company accumulated more and more data, the increased compute consumption continued to quickly eat through the budget. Each Amazon EMR cluster was consuming two or three times the planned capacity. In 2020, Autodesk saw its data processing needs increase 10x over the previous year.
The company was concerned that if this trend of doubling capacity and overprovisioning resources continued, it would be overwhelmed by runaway costs, low latencies, and increased downtime.
Autodesk turned to Pepperdata Capacity Optimizer real-time, automated resource optimization to automatically increase the utilization of CPU and memory for the company’s data workloads, minimize overprovisioning waste, and reduce costs.
The company set a goal to reduce application costs on Amazon EMR by 50 percent—and with the immediate utilization improvement, Autodesk reduced overprovisioning to help it achieve its cost reduction target.
Pepperdata Capacity Optimizer provided the Amazon EMR system scheduler with real-time visibility into actual CPU and memory utilization levels inside Autodesk’s data applications to enable more intelligent resource allocation decisions—automatically, continuously, and without the need for application code changes or manual config tuning. As a result, Autodesk realized a 50 percent reduction of Amazon EC2 instance hours.
“After adopting Pepperdata, we were able to see more efficient resource utilization. On average, we were saving 50% on our costs because of Pepperdata, because of the decreased waste and more efficient allocation of resources to applications.”
—Mark Kidwell, Chief Data Architect, Autodesk Data Platforms and Services
Pepperdata Capacity Optimizer enhanced the efficiency of the cloud autoscaler by ensuring new instances were only provisioned when existing instances became fully utilized. The company could run more applications without adding additional hardware and personnel to tune them—saving both money and time for its engineers.
Pepperdata Capacity Optimizer’s observability dashboards also provided Autodesk with thousands of application-level metrics in one aggregated view so the data team could quickly and accurately diagnose, troubleshoot, and resolve both cluster-wide and low-level application issues without having to cross-reference multiple dashboards.
After implementing Pepperdata, Autodesk significantly increased resource utilization for its Amazon EMR workloads, reduced Amazon EC2 costs by over 50 percent, and freed its platform data team from manual tuning with a real-time, automated solution that pays for itself.
“Pepperdata allowed us to significantly increase capacity for our Amazon EMR workloads and reduce our EC2 costs by over 50%. We can focus on our business, while they optimize for costs and performance.”
—Mark Kidwell, Chief Data Architect, Autodesk Data Platforms and Services
Looking for a safe, proven method to reduce waste and cost by 30% or more and maximize value for your cloud environment? Sign up now for a free cost optimization demo to learn how Pepperdata Capacity Optimizer can help you start saving immediately.