Hadoop
Open-source software for reliable, scalable, distributed computing.
Some of the top features or benefits of Hadoop are: Scalability, Cost-Effective, Fault Tolerance, Flexibility, Parallel Processing, and Community Support. You can visit the info page to learn more.
Hadoop Alternatives & Competitors
The best Hadoop alternatives based on verified products, community votes, reviews and other factors.
Filter:
12
Open-Source Alternatives.
Latest update:
-
Apache Spark is an engine for big data processing, with built-in modules for streaming, SQL, machine learning and graph processing.
Key Apache Spark features:
Speed Ease of Use Advanced Analytics Scalability
-
PostgreSQL is a powerful, open source object-relational database system.
Key PostgreSQL features:
Open Source ACID Compliance Extensible Advanced SQL Features
-
Turn SQL Data into Decisions. Build professional dashboards and data visualizations without technical expertise. Easily embed analytics anywhere, receive automated alerts, and discover AI-powered insights all through a straightforward interface.
Key Draxlr features:
Dashboards and Visualizations Slack Notifications Email notifications Query Builder
-
Apache Storm is a free and open source distributed realtime computation system.
Key Apache Storm features:
Real-Time Processing Scalability Fault Tolerance Broad Language Support
-
MongoDB (from "humongous") is a scalable, high-performance NoSQL database.
Key MongoDB features:
Scalability Flexible Schema High Performance Rich Query Language
-
The Apache Cassandra database is the right choice when you need scalability and high availability without compromising performance.
Key Apache Cassandra features:
Scalability High Availability Performance Flexible Data Model
-
Apache Hive data warehouse software facilitates querying and managing large datasets residing in distributed storage.
Key Apache Hive features:
Scalability SQL-like Interface Integration with Hadoop Ecosystem Schema on Read
-
Amazon Elastic MapReduce is a web service that makes it easy to quickly process vast amounts of data.
Key Amazon EMR features:
Scalability Cost-effectiveness Ease of Use Managed Service
-
Apache Kafka is an open-source message broker project developed by the Apache Software Foundation written in Scala.
Key Apache Kafka features:
High Throughput Scalability Fault Tolerance Durability
-
Flink is a streaming dataflow engine that provides data distribution, communication, and fault tolerance for distributed computations.
Key Apache Flink features:
Real-time Stream Processing Event Time Processing State Management Fault Tolerance
-
The world's most popular open source database.
Key MySQL features:
Reliability Performance Open Source Scalability
-
ClickHouse is an open-source column-oriented database management system that allows generating analytical data reports in real time.
Key ClickHouse features:
High Performance Scalability Real-time Data Ingestion Cost Efficiency
-
Greenplum Database is an open source parallel data warehousing platform.
Key Greenplum Database features:
Scalability Open Source Advanced Analytics Integration with Hadoop
-
Amazon Kinesis services make it easy to work with real-time streaming data in the AWS cloud.
Key Amazon Kinesis features:
Real-time data processing Scalability Fully managed service Integration with AWS ecosystem
Hadoop discussion
