Project Library

Discover and explore quality open source projects

Pachyderm - Data Version Control, Data Pipelines and Data Lineage Tools

6222
Go
Apache License 2.0
177 贡献者

Open Messaging - OpenMessaging, aims to establish industry guidelines and provide a common framework for messaging, streaming standards, financial, e-commerce, IoT and big data fields. The design principles are cloud-oriented, simple, flexible and language-independent in distributed heterogeneous environments. Compliance with these specifications will enable the development of heterogeneous messaging applications on all major platforms and operating systems.

725
Java
Apache License 2.0
16 贡献者

CloudEvents - CloudEvents is a new open specification that provides a consistent description standard for event data. The open specification was proposed by the Serverless Working Group under CNCF, and CNCF has established partnerships with multiple cloud service and cloud providers.

5320
Python
Apache License 2.0
133 贡献者

Beam - Apache Beam is a unified model for defining batch and streaming data parallel processing pipelines, as well as a set of language-specific SDKs for building pipelines and Runners to execute them on distributed processing backends, including Apache Apex, Apache Flink, Apache Spark and Google Cloud Dataflow.

8080
Java
Apache License 2.0
1265 贡献者

Storm - Apache Storm is a distributed real-time computing system. Similar to Hadoop providing a set of general primitives for batch processing, Storm processes real-time computing.

6617
Java
Apache License 2.0
386 贡献者

Spark - Apache Spark is a fast and general-purpose cluster computing system for big data. It provides high-level APIs in Scala, Java, Python and R, as well as an optimized engine for generic computation graphs that support data analysis.

41.0k
Scala
Apache License 2.0
2180 贡献者

NiFi - Apache NiFi is an easy-to-use, powerful and reliable system for processing and distributing data

5266
Java
Apache License 2.0
521 贡献者

Heron - Apache Heron (incubating) is a real-time, distributed, fault-tolerant stream processing engine from Twitter

3625
Java
Apache License 2.0
133 贡献者

High-Performance server for NATS.io, the cloud and edge native messaging system.

16.9k
Go
Apache License 2.0
162 贡献者

ActiveMQ - a high-performance message queue under Apache

2349
Java
Apache License 2.0
164 贡献者

RocketMQ - a distributed message and stream platform under Apache, with low latency, high performance, high reliability, trillions of capacity, and dynamic scalability.

21.7k
Java
Apache License 2.0
554 贡献者

Kafka - a distributed stream platform belonging to Apache

29.9k
Java
Apache License 2.0
1262 贡献者

Ehcache - the most widely used Java cache

2043
Java
Apache License 2.0
55 贡献者

A Multithreaded Fork of Redis

11.9k
C++
BSD 3-Clause "New" or "Revised" License
503 贡献者

Redis - can be used as a memory data structure storage tool for databases, caches, and message queues

3
C
Other
1 贡献者

© 2025 GitHub Fun. All rights reserved.