An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.
-
Updated
Oct 10, 2023 - Python
An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.
This code provides word level language identification tool for identifying language for individual words in Code-Mixed text. e.g. The text that includes words from two languages such as Hindi written in roman script, mixed with English.
Implementation of meta-transfer-learning for ASR and LM (ACL 2020)
Multilingual Meta-Embeddings for Named Entity Recognition (RepL4NLP & EMNLP 2019)
Pytorch implementation of CS-Tacotron, a code-switching speech synthesis end-to-end generative TTS model.
Code-switching patterns can be an effective route to improve performance of downstream NLP applications: A case study of humour, sarcasm and hate speech detection
Code-Switching Language Modeling using Syntax-Aware Multi-Task Learning (CALCS 2018, ACL)
💬 MaskLID: Code-Switching Language Identification through Iterative Masking -- ACL 2024
A sequence tagging model with active learning
POSIT aims to segment and tag mixed-text that contains English and C-like code, such that the user both knows what a token is, and within the language it's used in, what role, such as an AST tag or PoS tag, it serves.
Repository containing Abusive Tweet Detection, Location Detection and Gender Detection codes
Implementation of a deep learning model (BiLSTM) to detect code-switching
Code for "CoVoSwitch: Machine Translation of Synthetic Code-Switched Text Based on Intonation Units" (Accepted at ACL-SRW 2024) 🇹🇭
a socket script to obtain chinese phones-sequence for any english word
Official repository for the paper titled "From Machine Translation to Code-Switching: Generating High-Quality Code-Switched Text" accepted at ACL 2021
Chrome extension for translating highlighted English text into Chinglish (a chinese + english hybrid)
Code-Switching Sentence Generation by Generative Adversarial Networks and its Application to Data Augmentation. (Interspeech 2019)
A word level Language Identification (LID) tool for Tagalog-English (Taglish) text.
Code-switching detection for Pacific languages
The official code for the "True Bilingual NMT" paper
Add a description, image, and links to the code-switching topic page so that developers can more easily learn about it.
To associate your repository with the code-switching topic, visit your repo's landing page and select "manage topics."