Supported Technology

SUPPORTED PLATFORMS

  • Apache Foundation
    The Apache Software Foundation is an American non-profit corporation to support Apache software projects, including the Apache HTTP Server.
  • Amazon AWS
    Amazon Web Services is a subsidiary of Amazon.com that provides on-demand cloud computing platforms to individuals, companies and governments, on a paid subscription basis.
  • Cloudera
    Cloudera provides a software platform for data engineering, data warehousing, machine learning and analytics that runs in the cloud or on premises.
  • Hortonworks
    Hortonworks develops, supports, and provides expertise on a set of open source software designed to manage data and processing for IOT, single view of X, and advanced analytics and machine learning
  • IBM
    IBM manufactures and markets computer hardware, middleware and software, and provides hosting and consulting services in areas ranging from mainframe computers to nanotechnology.
  • MapR
    MapR provides access to a variety of data sources from a single computer cluster, including big data workloads.
  • Microsoft Azure
    Microsoft Azure is a cloud computing service created by Microsoft for building, testing, deploying, and managing applications and services through a global network of Microsoft-managed data centers

SUPPORTED TECHNOLOGY

  • HDFS
    The Hadoop Distributed File System (HDFS) is designed to store very large data sets reliably, and to stream those data sets at high bandwidth to user applications. In a large cluster, thousands of servers both host directly attached storage and execute user application tasks.
  • YARN
    Part of the core Hadoop project, YARN is the architectural center of Hadoop that allows multiple data processing engines such as interactive SQL, real-time streaming, data science and batch processing to handle data stored in a single platform, unlocking an entirely new approach to analytics.
  • HBASE
    HBase is an open-source, non-relational, distributed database modeled after Google's Bigtable and written in Java.
  • MapReduce
    MapReduce is a programming model and an associated implementation for processing and generating big data sets with a parallel, distributed algorithm on a cluster.
  • Spark
    Apache Spark is an open-source distributed general-purpose cluster-computing framework.
  • HIVE
    Apache Hive is a data warehouse software project built on top of Apache Hadoop for providing data query and analysis. Hive gives an SQL-like interface to query data stored in various databases and file systems that integrate with Hadoop.
  • TEZ
    Tez was a mobile payments service by Google, targeted at users in India. It was rebranded to Google Pay on August 28, 2018. It operates atop the Unified Payments Interface, developed by the National Payments Corporation of India.
  • Impala
    Apache Impala is an open source massively parallel processing SQL query engine for data stored in a computer cluster running Apache Hadoop.
  • Kubernetes
    Kubernetes, pronounced "Kū-bər-NəT-ēz" is an open-source container-orchestration system for automating deployment, scaling and management of containerized applications.
  • BIG SQL
    IBM Db2 Big SQL is an enterprise grade, hybrid ANSI-compliant SQL on Hadoop engine, delivering massively parallel processing (MPP) and advanced data query. Big SQL offers a single database connection or query for disparate sources such as HDFS, RDMS, NoSQL databases, object stores and WebHDFS.
  • Kafka
    Apache Kafka is an open-source stream-processing software platform developed by the Apache Software Foundation, written in Scala and Java.