Project Library

Discover and explore quality open source projects

The GPT Researcher is an autonomous agent designed to conduct comprehensive online research on a variety of tasks. The agent can generate detailed, factual, and unbiased research reports, offering customized options that focus on relevant resources, outlines, and curricula. Inspired by AutoGPT and recent Plan-and-Solve papers, the GPT Researcher addresses issues of speed and determinacy by employing parallel agent operations (as opposed to synchronous operations), thereby delivering more stable performance and increased speed.

21.0k
Python
Apache License 2.0
146 贡献者

🚀 A simple method to train and use PyTorch models with multi-GPU, TPU, and mixed precision. This project offers a convenient way to accelerate the training and usage of PyTorch models by supporting parallel training on multiple GPUs and TPUs, as well as mixed precision computation, thereby enhancing the efficiency and performance of model training.

8628
Python
Apache License 2.0
463 贡献者

A project designed for rapid segmentation of any object. It offers an efficient algorithm and methodology that enables precise object segmentation in images and videos, providing a powerful tool for researchers and developers in the fields of computer vision and image processing.

7829
Python
GNU Affero General Public License v3.0
26 贡献者

An autonomous LLM agent designed for complex task solving, capable of emulating human cognition and independently executing intricate tasks. The advent of large language models (LLMs) has ushered in a new era of autonomous agents. XAgent signifies a significant technological advancement in this field, aimed at tackling complex tasks and mimicking human intelligence.

8297
Python
Apache License 2.0
45 贡献者

An open-source implementation of Microsoft's VALL-E zero-shot TTS (text-to-speech) model. This project offers a robust natural language processing tool capable of converting text into natural and fluent speech. The open-source nature of VALL-E X makes it suitable for research, development, and application, providing developers with access to high-quality speech synthesis technology.

7849
Python
MIT License
9 贡献者

Allows you to distribute and run LLM languages using a single file. It aims to make open-source large language models more accessible by converting the complexity of LLMs into a single executable file, known as a "llamafile," which can be run locally on most computers without the need for installation.

22.2k
C++
Other
61 贡献者

The TypeChat library is designed to make building natural language interfaces easy. Through this project, developers can more simply create applications and systems with natural language interaction capabilities. TypeChat offers a more convenient way to integrate natural language processing features, helping users interact with systems effortlessly.

8426
TypeScript
MIT License
33 贡献者

A cross-sensory artificial intelligence model has achieved the "connection" of diverse forms of information by recognizing relationships among images, videos, audio, text, depth, thermal, and inertial measurement units. This breakthrough helps advance AI in enabling machines to better analyze multiple forms of information.

8606
Python
Other
78 贡献者

A method that utilizes large language models (LLMs) to automatically generate a wide range of open-domain instructions of varying difficulty and skill scope, replacing manual effort. This project aims to enhance the performance of LLMs and explore their application in broader fields. By using WizardLM, users can create a diverse array of instructions, thereby expanding the application scope of LLMs.

9369
Python
6 贡献者

A project for 3D real-time radiance field rendering. It employs Gaussian splatting technology to achieve high-quality radiance field rendering, suitable for the fields of graphic rendering and visualization. This project provides valuable tools and resources for developers engaged in research and application of real-time rendering.

16.5k
Python
Other
15 贡献者

QLoRA is a tool designed for the quantized fine-tuning of large language models (LLMs). It offers efficient methods and techniques for fine-tuning LLMs, enabling the models to better adapt and perform on specific tasks. QLoRA provides a powerful tool for both research and practice in the field of quantized LLMs.

10.4k
Jupyter Notebook
MIT License
17 贡献者

A fast, memory-efficient attention algorithm. It accelerates attention and reduces memory usage without any approximations. Numerous institutions and research labs have adopted FlashAttention to speed up training and inference. For AI researchers and engineers in need of high-performance attention algorithms, this project offers a valuable solution.

17.0k
Python
BSD 3-Clause "New" or "Revised" License
118 贡献者

A robust LLM (Language and Learning Model), it offers appropriate API calls and has been trained on multiple large-scale machine learning center datasets. Its performance is superior, especially in zero-shot learning. For developers and researchers in need of powerful natural language processing capabilities, Gorilla is a valuable model.

12.0k
Python
Apache License 2.0
121 贡献者

A project involving state-of-the-art foundational models for speech and text translation. This initiative may encompass technologies and algorithms designed to make communication between speech and text more seamless.

11.5k
Jupyter Notebook
Other
99 贡献者

A concept-style WYSIWYG (What You See Is What You Get) editor equipped with AI-powered auto-completion functionality. It offers users an intuitive and creative way to edit text and content. Novel's auto-completion feature leverages artificial intelligence technology to provide intelligent suggestions and completions, aiding users in creating documents and articles more efficiently.

14.4k
TypeScript
Apache License 2.0
55 贡献者

© 2025 GitHub Fun. All rights reserved.