SUPPORTED PLATFORMS

  • The Apache Software Foundation is an American non-profit corporation to support Apache software projects, including the Apache HTTP Server.
    Apache Foundation
  • Amazon Web Services is a subsidiary of Amazon.com that provides on-demand cloud computing platforms to individuals, companies and governments, on a paid subscription basis.
    Amazon AWS
  • Cloudera provides a software platform for data engineering, data warehousing, machine learning and analytics that runs in the cloud or on premises.
    Cloudera
  • Hortonworks develops, supports, and provides expertise on a set of open source software designed to manage data and processing for IOT, single view of X, and advanced analytics and machine learning
    Hortonworks
  • IBM manufactures and markets computer hardware, middleware and software, and provides hosting and consulting services in areas ranging from mainframe computers to nanotechnology.
    IBM
  • MapR provides access to a variety of data sources from a single computer cluster, including big data workloads.
    MapR
  • Microsoft Azure is a cloud computing service created by Microsoft for building, testing, deploying, and managing applications and services through a global network of Microsoft-managed data centers
    Microsoft Azure

SUPPORTED TECHNOLOGY

  • The Hadoop Distributed File System (HDFS) is designed to store very large data sets reliably, and to stream those data sets at high bandwidth to user applications. In a large cluster, thousands of servers both host directly attached storage and execute user application tasks.
    HDFS
  • Part of the core Hadoop project, YARN is the architectural center of Hadoop that allows multiple data processing engines such as interactive SQL, real-time streaming, data science and batch processing to handle data stored in a single platform, unlocking an entirely new approach to analytics.
    YARN
  • HBase is an open-source, non-relational, distributed database modeled after Google's Bigtable and written in Java.
    HBASE
  • MapReduce is a programming model and an associated implementation for processing and generating big data sets with a parallel, distributed algorithm on a cluster.
    MapReduce
  • Apache Spark is an open-source distributed general-purpose cluster-computing framework.
    Spark
  • Apache Hive is a data warehouse software project built on top of Apache Hadoop for providing data query and analysis. Hive gives an SQL-like interface to query data stored in various databases and file systems that integrate with Hadoop.
    HIVE
  • Tez was a mobile payments service by Google, targeted at users in India. It was rebranded to Google Pay on August 28, 2018. It operates atop the Unified Payments Interface, developed by the National Payments Corporation of India.
    TEZ
  • Apache Impala is an open source massively parallel processing SQL query engine for data stored in a computer cluster running Apache Hadoop.
    Impala
  • Kubernetes, pronounced "Kū-bər-NəT-ēz" is an open-source container-orchestration system for automating deployment, scaling and management of containerized applications.
    Kubernetes
  • IBM Db2 Big SQL is an enterprise grade, hybrid ANSI-compliant SQL on Hadoop engine, delivering massively parallel processing (MPP) and advanced data query. Big SQL offers a single database connection or query for disparate sources such as HDFS, RDMS, NoSQL databases, object stores and WebHDFS.
    BIG SQL
  • Apache Kafka is an open-source stream-processing software platform developed by the Apache Software Foundation, written in Scala and Java.
    Kafka

Request a trial to see firsthand how Pepperdata big data solutions can help you achieve big data performance success. Pepperdata’s proven APM solutions provide a 360° degree view of both your platform and applications, with realtime tuning, recommendations, and alerting. See and understand how Pepperdata big data performance solutions helps you to quickly pinpoint and resolve big data performance bottlenecks. See for yourself why Pepperdata’s big data APM solutions are used to manage performance on over 30K Hadoop production clusters.

Request Trial