Long Qin
Long Qin
Office
GHC 6225
Carnegie Mellon University
5000 Forbes Avenue
Pittsburgh, PA 15213
Email
lqin (at) cs (dot) cmu (dot) edu
long.qin (at) mmodal (dot) com
I’m currently a Software Engineer at Duolingo working on speech tasks in the Duolingo learning app and test center. Before that, I worked at M*Modal as a Research Scientist on improving speech recognition for medical transcription. I received my PhD and MS degrees from the Language Technologies Institute of Carnegie Mellon University under the supervision of Prof. Alex Rudnicky. I also received a MS and a BS degree from the University of Science and Technology of China.
CV [pdf]
Research
•Deep learning (DNN) in speech recognition
•Automatic Speech Assessment
•Voice Activity Detection (VAD)
•Out-of-vocabulary (OOV) word learning
•Discriminative acoustic modeling
•Speaker adaptive training (SAT)
•Unsupervised / semi-supervised lexicon learning
•Statistical parametric speech synthesis
Selected PublicationS
•PhD Dissertation: Learning out-of-vocabulary words in automatic speech recognition, Carnegie Mellon University. [document] [presentation]
•Building a vocabulary self-learning speech recognition system, Interspeech-2014. [pdf]
•Learning better lexical properties for recurrent OOV words, ASRU-2013. [pdf]
•Using web text to improve keyword spotting in speech, ASRU-2013. [pdf]
•Finding recurrent OOV words, Interspeech-2013. [pdf]
•OOV word detection using hybrid models with mixed types of fragments, Interspeech-2012. [pdf]
•System combination for out-of-vocabulary word detection, ICASSP-2012. [pdf]
•OOV detection and recovery using hybrid models with different fragments, Interspeech-2011. [pdf]
•The effect of lattice pruning on MMIE training, ICASSP-2010. [pdf]
•Implementing and improving MMIE training in SphinxTrain, CMU Sphinx Workshop 2010. [pdf]
Courses
•10-701 Machine Learning
•11-711 Algorithm for NLP
•11-721 Grammars and Lexicons
•11-733 Multilingual Speech to Speech Translation
•11-741 Information Retrieval
•11-751 Speech Recognition and Understanding
•11-752 Speech II
•11-754 Dialog System
•11-756 Design and Implementation of ASR Systems
•11-761 Language and Statistics
•11-791 Software Engineering
Interests
Football, Soccer, Movie, Ski, Skate