sound-processing

Here are 90 public repositories matching this topic...

iver56 / audiomentations

A Python library for audio data augmentation. Useful for making audio ML models work well in the real world, not just in the lab.

audio python music machine-learning deep-learning dsp sound sound-processing data-augmentation augmentation audio-effects audio-data-augmentation

Updated Apr 25, 2025
Python

asteroid-team / torch-audiomentations

Star

Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.

audio python music machine-learning deep-learning dsp waveform sound pytorch sound-processing data-augmentation augmentation audio-effects differentiable-data-augmentation audio-data-augmentation

Updated Jan 15, 2025
Python

birdnet-team / BirdNET-V1

Star

Soundscape analysis with BirdNET.

lasagne theano birds sound-processing soundscape bioacoustics

Updated Mar 10, 2025
Python

EtienneAb3d / WhisperHallu

Star

Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts

text-to-speech sound-processing vad whisper audio-processing asr noise-removal vocals

Updated Nov 12, 2024
Python

LeviBorodenko / spectrographic

Star

Turn an image into sound whose spectrogram looks like the image.

python audio-visualizer image-processing sound sound-processing spectrogram frequencies audio-processing sound-synthesis image-to-sound

Updated Dec 8, 2022
Python

Yujia-Yan / Transkun

Star

A simple yet effective Audio-to-Midi Automatic Piano Transcription system

audio music midi crf pytorch sound-processing piano transcription piano-transcription audio-to-midi music-transcription automatic-transcription

Updated Nov 22, 2024
Python

KentoNishi / torch-pitch-shift

Star

Pitch-shift audio clips quickly with PyTorch (CUDA supported)! Additional utilities for searching efficient transformations are included.

torch pytorch sound-processing augmentation pitch-shift gpu-support torchaudio audio-augmentation

Updated Sep 25, 2024
Python

verlab / Learning2Dance_CAG_2020

Star

PyTorch implementation of our graph convolutional network (GCN) for human motion generation from music. Also with paired dance-music data for training!

computer-vision sound-processing graph-convolutional-networks gcn multimodal-learning motion-analysis motion-animation motion-synthesis human-motion human-motion-analysis graph-adversarial-learning computer-and-graphics

Updated Jan 28, 2024
Python

SuperKogito / pydiogment

Sponsor

Star

📣 Python library for audio augmentation

audio python machine-learning deep-learning sound sound-processing audio-processing augmentation audio-effects

Updated Jul 6, 2023
Python

davidpraise45 / Audio-Signal-Processing

Star

Removing background noise in a sound file

sound sound-synthesis-processes sound-processing filters background-sound

Updated Jul 1, 2019
Python

Fireboltz / Psychic-CCTV

Star

A video analysis tool built completely in python.

resolution video python3 pytorch sound-processing yolo super-resolution hacktoberfest pyqt5-desktop-application pysimplegui cctv-feed psychich-cctv

Updated Jun 30, 2021
Python

datarootsio / fresh-coffee-listener

Star

Using a raspberry pi, we listen to the coffee machine and count the number of coffee consumption

python counter systemd raspberrypi sound-processing raspbian coffee-machine mfcc-features

Updated Sep 23, 2021
Python

KentoNishi / torch-time-stretch

Star

Time-stretch audio clips quickly with PyTorch (CUDA supported)! Additional utilities for searching efficient transformations are included.

torch pytorch sound-processing augmentation gpu-support torchaudio time-stretch audio-augmentation

Updated Sep 5, 2022
Python

crlandsc / torch-log-wmse

Star

logWMSE, an audio quality metric & loss function with support for digital silence target. Useful for training and evaluating audio source separation systems.

Updated Apr 24, 2025
Python

lockepatton / sonipy

Star

Sonification tool for turning scatter plots into perceptually uniform sound files for science and science access.

astronomy sound data-visualization sound-processing sonification sound-synthesis

Updated Oct 17, 2023
Python

Soundstorm is a cutting-edge AI-powered audio manipulation application designed to provide a rich yet simplified experience for sound designers, algorithmic composers, and experimental audio enthusiasts. From sample pack creation and algorithmic composition to AI text-to-audio and onscreen ChatGPT, Soundstorm is a sonic powerhouse.

midi chatbot sound sound-processing gpt algorithmic-music algorithmic-composition sounds audio-processing random-music audio-tools sound-design text-to-audio audio-toolbox ai-audio gpt-4 chatgpt chat-gpt ai-audio-generation

Updated May 4, 2024
Python

zilliz-bootcamp / audio_search

Star

This project use PANNs for audio tagging and sound event detection, and finally get audio embeddings. Then Milvus is used to search the similarity audio items.

embeddings sound-processing similarity-search sound-detection audio-search vector-search milvus