Skip to content
View soham97's full-sized avatar

Highlights

  • Pro

Block or report soham97

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Explaining audio differences using language

Python 9 Updated Feb 11, 2025

Unified automatic quality assessment for speech, music, and sound.

Python 381 22 Updated Feb 13, 2025

Code for the paper: MACE: Leveraging Audio for Evaluating Audio Captioning Systems

Python 11 1 Updated Jan 16, 2025

Audio Entailment: Deductive Reasoning for Audio Understanding

10 1 Updated Dec 10, 2024

Awesome speech/audio LLMs, representation learning, and codec models

907 58 Updated Feb 28, 2025

A simple library for Fréchet Audio Distance (FAD) calculation

Python 179 23 Updated Feb 11, 2025

PAM is a no-reference audio quality metric for audio generation tasks

Python 58 5 Updated Jul 19, 2024

Repository for "Training Audio Captioning Models without Audio"

9 2 Updated Sep 26, 2023

An Audio Language model for Audio Tasks

Python 302 15 Updated Apr 19, 2024

Tracking states of the arts and recent results (bibliography) on sound tasks.

32 1 Updated Jan 10, 2023

Web-crawl for "Audio Retrieval with WavText5K and CLAP Training"

Python 49 Updated Nov 10, 2022

Learning audio concepts from natural language supervision

Python 528 39 Updated Sep 18, 2024

Code repo for "Multi-Task Learning for Interpretable Weakly Labelled Sound Event Detection"

Python 16 4 Updated Nov 9, 2022

speech enhancement\speech seperation\sound source localization

1,093 223 Updated Nov 14, 2023

Self-Supervised Speech Pre-training and Representation Learning Toolkit

Python 2,338 492 Updated Feb 12, 2025

Reading list for research topics in Sound AI

179 10 Updated Aug 8, 2024