Multimedia Processing
Audio, video and image processing technologies
A tool that connects to a computer via USB (or TCP/IP) for high-definition display and smooth control of Android devices, suitable for GNU/Linux, Windows, and Mac OS, without requiring root privileges.
Spotube is an open-source, cross-platform Spotify client that leverages Spotify's data API and YouTube, Piped.video, or JioSaavn as audio sources, eliminating the need for Spotify Premium. It offers ad-free music streaming, track downloads, and cross-platform compatibility (Windows, macOS, Android, iOS, Linux). Key features include anonymous login, time-synced lyrics, local playback control, and no telemetry or data collection. Spotube is lightweight, privacy-focused, and built for native performance. It supports multiple installation methods, including direct downloads, package managers, and nightly builds. While it excludes Spotify podcasts and shows, it emphasizes supporting creators through alternative engagement. Licensed under BSD-4-Clause, Spotube is a community-driven project with a focus on simplicity and user freedom.
A tool that can quickly package various web pages into MacApp. Compared with Electron, Pake is more compact and easy to use, faster, and has achieved universal shortcut transmission, immersive window, drag-and-drop, and packaging style compatibility features. For developers who need to package web applications into desktop applications, Pake provides a convenient and efficient open-source tool.
A lightweight open source free third-party client for YouTube, which does not require Google service framework and YouTube account login, supports 4K video playback, picture-in-picture mode, search for videos/audio/channels/playlist, download videos/audio/subtitles, etc.
Cobalt is a user-friendly, ad-free media downloader designed for simplicity and efficiency. It allows users to download publicly accessible content by pasting a link, ensuring a seamless experience without ads, trackers, or paywalls. The project is structured as a monorepo, containing source code for the API, frontend, and related packages, along with comprehensive documentation for running and protecting instances. Cobalt operates as a proxy, taking zero liability for user actions and never caching content. It emphasizes ethical use, supporting only free and publicly available content, and is not intended for piracy. Contributions are welcome, guided by clear guidelines, and the project is supported by sponsors like RoyaleHosting. Licensed under AGPL-3.0, Cobalt prioritizes transparency and community engagement.
An efficient lossless audio and video editing tool that can complete lossless compression of audio and video in a few seconds, especially suitable for high-definition video files. It also supports fast extraction of video highlights, addition of music subtitle tracks, and other functions.
A third-party NetEase Cloud Music player with high aesthetics, developed based on Vue.js. It has all the basic functions of a music player, supports MV playback, dark mode, custom shortcuts and other functions.
A BT download client that can directly play videos online, which can download resources through seed files and magnet links, supporting Windows/macOS/Linux platforms
A cross-platform video download command line tool written in Go, supporting almost all video platforms such as TikTok, Bilibili, YouTube, etc., and can control the format, clarity and subtitles of the downloaded videos.
Jitsi Meet is an open-source video conferencing platform offering high-quality HD audio and video, accessible via browsers and mobile apps. Key features include content sharing, chat with private conversations, polls, virtual backgrounds, and interactive tools like raise hand and reactions. It supports web and native SDKs for seamless integration and is scalable for both personal and enterprise use. Users can start meetings instantly with a Google, Facebook, or GitHub account or deploy their own instance using Debian packages or Docker. For enterprise needs, Jitsi as a Service (JaaS) provides a managed solution with branding capabilities. The platform emphasizes security with end-to-end encryption and comprehensive documentation for developers and contributors.
🎥 A video production tool based on React, which provides a way to program videos using React. With Remotion, developers can use React components to create and edit videos, add animations, effects, and interactions, making it easy to produce stunning video content.
A feature-rich Android video player that supports various advanced functions such as subtitles, filters, watermarks, screenshots, and simultaneous playback and caching. It also supports synchronized gravity rotation and manual rotation, making the video playback experience smoother and more personalized. This player is designed for developers and users who pursue high performance and rich features.
You can share the computer screen to any device with a browser, making it the second screen of the computer.
A free YouTube app that focuses on protecting user privacy. It allows users to browse, watch and subscribe to YouTube videos without tracking their activities. FreeTube aims to provide a privacy-friendly way to access YouTube without being interfered by ad tracking or data collection. This is an application dedicated to maintaining user privacy rights.
A Mac audio tool that can automatically pause all background music when playing other audio, and resume the background music after playback ends, and supports controlling the volume of individual applications
An open-source front-end alternative to YouTube. It provides an independent front-end interface that allows users to browse and watch YouTube video content without visiting the official YouTube website. Invidious's open-source nature enables users to freely customize and control their access and experience of YouTube content.
collected many open source software developed based on Python
An open-source command-line GIF generation tool that can automatically generate command-line GIF animations and apply them to tutorial, document, tool demonstration, testing, and other scenarios. The tool supports custom configuration of animation font, spacing, line height, delay, typing speed, theme, deletion, and pointer movement effects, and has very complete functions.
A cross-terminal music player on GitHub: Navidrome. Built based on Material UI design style, it supports all mainstream music formats and has functions such as playback, collection, and album management. This player is compatible with mainstream desktop systems and mobile devices such as macOS, Linux, and Windows, and has a built-in multi-language module that can be translated into other languages at any time.
An open source cross-platform video resource player
A practical Chrome screen recording and annotation plugin, which supports recording and annotation information at any position on the screen, has a click highlight effect, and can export recorded videos in MP4, gif, webm file formats, etc.
A powerful multi-track audio editing and recording tool that can import and export various audio formats, and quickly analyze audio signals through a visual interface.
An open-source video and image management web application written in Go, supporting self-deployment, categorization of videos and images by tags, information extraction, statistics, and other functions. The project supports installation and deployment through Windows, macOS, Linux, Docker, etc., and has multiple CSS styles available for customization.
An open-source virtual browser tool that supports multiple people accessing and controlling the same browser interface at the same time, achieving synchronized picture and sound, with built-in chat system, file transfer and other functions, which can be used for remote collaboration, online movie watching and other scenarios.
If you want to ensure the privacy of data when live streaming with friends, you can build your own real-time streaming server using the Owncast project on GitHub.
A completely free and open source TikTok/TikTok collection tool.
A high-quality, simple and elegant command-line version of NetEase Cloud Music written in Python
A complete voice toolkit that provides functions such as voice recognition (supporting Mandarin), voice enhancement, voice processing, multi-microphone signal processing, and modular customization.
Ultimate camera streaming application with support for RTSP, RTMP, HTTP-FLV, WebRTC, MSE, HLS, MP4, MJPEG, HomeKit, FFmpeg, etc.
An open-source screen sharing tool that allows you to choose from three sharing modes: full-screen, window, and browser tabs. It offers lower latency and higher quality screen sharing experience.
An open source IP toolbox that can check IP addresses, geographical locations, DNS leaks, internet speed tests, Ping tests and website availability, etc., providing 258 security checklists with detailed explanations.
An interesting open source project that can turn the screen of a mobile phone or tablet into a computer's graphic input board and touch screen. It allows you to draw on your mobile phone or tablet and display it on the computer screen, as if you have connected a touch drawing board to the computer.
An open-source application: Gyroflow, which can make the effect of sports videos more stable by adjusting data such as gyroscope and accelerometer. In addition, it also supports real-time preview, parameter adjustment and various complex calculations, GPU processing and rendering functions.
A convenient and practical image and video background removal tool that can help users easily remove the background from images and videos to achieve better visual effects.
An open-source voice chat software with low latency, high communication quality, strong security and privacy, suitable for phone calls or game exchanges.
A cross-platform file transfer tool that is ready to use out of the box, no registration or installation required, just a browser to freely transfer files between any devices on the same local area network, also supports creating temporary public rooms for public network transmission, and uses peer-to-peer transmission to ensure data security and privacy.
An open source and free video online extraction tool, which supports streaming media download, video download, m3u8 file download and Bilibili video download. It has the characteristics of no need to capture packets, mobile playback, batch download, etc., and provides Windows and macOS desktop clients.
🚀🎬ShortGPT - An experimental AI framework for automatic short/video content creation. It enables creators to quickly produce, manage, and deliver content using artificial intelligence and automation.
A tool that converts videos into GIFs, supports any format of video and allows you to select a certain period for conversion.
Microsoft recently open-sourced an AI music project on GitHub, which is based on deep learning and can automatically complete music creation. The creation process mainly consists of two steps: one is music understanding (symbol classification, sound recognition), and the other is music synthesis (song lyrics creation, music melody generation).
An open-source cross-platform IPTV player with a simple and powerful interface, supporting playlist import, automatic updates, channel search, TV archives, grouped channel lists, multiple themes, external players, HTML video players, and multi-language support.
A fast, easy-to-use and highly configurable terminal Spotify player with a minimalist interface, personalized configuration, remote control, lyrics display, cross-platform support, image rendering, desktop notifications, and more, providing rich CLI commands.
A stylish and simple open-source music player on GitHub, designed based on the Material Design concept, with powerful music classification management functions, supporting Windows and Linux systems.
An open source and free tool that allows Xiao Ai speaker to play local music, supports multiple audio formats, voice control playback, automatic search and download of matching songs, supports online playlists, radio stations, Docker deployment and other functions.
An open source animation editing tool that can be opened and used directly in the browser. It supports video cutting, image search filtering, text animation library, layer mask and other functions.
A beauty camera that provides real-time beautification, filters, photography, static picture beautification, special effects, cropping and rotation, night scene enhancement, and image masking and blurring, etc.
Music player, can play local music, Baidu Music, QQ Music, Xiami Music, NetEase Cloud Music
A cross-platform music player that can search NetEase Cloud Music, QQ Music, Xiami Music, and supports QQ login, cloud playlists, etc.
An open source video conferencing system, designed for interactive online tutoring. The system mainly supports real-time video conferencing, screen, whiteboard, file sharing, text message chat, mathematical graphics drawing and other functions.
A set of JavaScript-based bullet controller, providing support from basic skeleton to advanced bullets
An interesting dynamic imaging application that supports the processing of music into visualized dynamic effects and exports it as a video.
AudioKit, an open-source cool audio synthesizer
If you are used to using ffmpeg to process videos, why not try using vdx, a command-line tool, to optimize the process
Convert any music library into a music production sample-library with ML