Ian Lane - Research Assistant Professor, Carnegie Mellon University

Recent publications:

Rapid Training of Acoustic Models using GPUs

Innovative Parallel Computing (InPar), 2012

Full Poster PDF

HYDRA A Hybrid CPU/GPU-based speech Recognition Engine for Real-Time LVCSR

InterSpeech, 2012

Full Poster PDF

AIDAS: Immersive Interaction within Vehicles

SLT, 2012

Full Poster PDF

Other publications:

Peer-reviewed journal articles

Y. Tam, Ian Lane and T. Schultz, “Bilingual-LSA based Adaptation for Statistical Machine Translation. Machine Translation,” Springer Netherlands, Vol. 21, No. 4, pp. 931-938, December 2008.

Ian Lane, T. Kawahara, T. Matsui, and S. Nakamura, “Out-Of-Domain Utterance Detection using Classification Confidences of Multiple Topics,” IEEE Trans. Audio, Speech & Language Process., Vol.15, No.1, pp. 150-161, 2007.

Ian Lane, and T. Kawahara, “Verification of Speech Recognition Results Incorporating In-Domain Confidence and Discourse Coherence Measures,” IEICE Trans., Vol.E89-D, No.3, pp. 931\-938, 2006.

Ian Lane, T. Kawahara, T. Matsui, and S. Nakamura, “Dialogue Speech Recognition by Combining Hierarchical Topic Classification and Language Model Switching,” IEICE Trans., Vol. E88-D, No.3, pp. 446-454, 2005.

Peer reviewed books, book chapters, books edited

Y. Tam, Ian Lane and T. Schultz, “Rapid Unsupervised Topic Adaptation - A Latent Semantic Approach,” Handbook of Natural Language Processing and Machine Translation, Chapter 3.4.5, pp. 468-486, Springer, ISBN 978-1-4419-7712-0, 2011.

S. Chu, H. Kuo, L. Mangu, Q. Shi, S. Zhang, Y. Qin, Q. Jin, Ian Lane, and Y. Tam, “Toward the State of the Art in Automatic Mandarin Broadcast Speech Transcription,” Handbook of Natural Language Processing and Machine Translation, Chapter 3.5.2, pp. 487-495, Springer, ISBN 978-1-4419-7712-0, 2011.

R. Hsiao, M. Fuhs, Y. Tam, Q. Jin, Ian Lane and T. Schultz, “CMU/InterACT Mandarin Speech Recognition System for GALE,” Handbook of Natural Language Processing and Machine Translation, Chapter 3.5.3, pp. 496-504, Springer, ISBN 978-1-4419-7712-0, 2011.

U. Nallasamy, Ian Lane, M. Fuhs, M. Noamany, Y. Tam, Q. Jin and T. Schultz, “CMU/InterACT Arabic Speech Recognition System for GALE,” Handbook of Natural Language Processing and Machine Translation, Chapter 3.6.4, pp. 535-540, Springer, ISBN 978-1-4419-7712-0, 2011.

M. Paulik, Ian Lane, T. Schultz, “Improving Machine Translation of Spoken Language,” Handbook of Natural Language Processing and Machine Translation, Chapter 3.7.2, pp. 570-580, Springer, ISBN 978-1-4419-7712-0, 2011.

Refereed conference proceedings

J. Kim, J. Chong, Ian Lane, “HYDRA - A Hybrid CPU/GPU Speech Recognition Engine for real-time LVCSR”, Demonstration Poster, SLT 2012

Ian Lane, Y. Ma, and A. Raux, “AIDAS - Immersive Interaction within Vehicles”, Demonstration Poster, SLT 2012 ~PDF~

J. Kim, J. Chong, Ian Lane, “Efficient On-The-Fly Hypothesis Rescoring in a Hybrid GPU/CPU-based Large Vocabulary Continuous Speech Recognition Engine,” Interspeech 2012 ~PDF~

Ian Lane, V. Prasad, G. Sinha, A. Umuhoza, S. Luo, A. Chandrashekaran and A. Raux, “HRItk: The Human-Robot Interaction ToolKit - Rapid Development of Speech-Centric Interactive Systems in ROS,” NAACL-HLT, SDCTD 2012

D. Cohen, Ian Lane, “A Simulation-based Framework for Spoken Language Understanding and Action Selection in Situated Interaction,” NAACL-HLT, SDCTD 2012 ~PDF~

P. Maergner, A. Waibel, Ian Lane, “Unsupervised Vocabulary Selection for Domain-Independent Simultaneous Lecture Translation,” ICASSP, 2012 ~PDF~

A. Saluja, Y. Zhang, Ian Lane, “Context-aware Language Modelling for Conversational Speech Translation”, MTSummit, 2011 ~PDF~

S. Buthpitiya, J. Chong, Ian Lane. Rapid Training of Acoustic Models using GPUs, Interspeech, 2011

M. Cossalter, P. Sundararajan, and Ian Lane, “Ad-Hoc Meeting Transcription on Clusters of Mobile Devices,” Interspeech, 2011 ~PDF~

Z. Sun, Aveek Purohit, Dan Siewiorek, Asim Smailagic, Ian Lane, and Pei Zhang, “PANDAA: Physical Arrangement Detection of Networked Devices through Ambient-Sound Awareness,” Ubicomp 2011

Z. Sun, Aveek Purohit, Dan Siewiorek, Asim Smailagic, Ian Lane, and Pei Zhang, “CoughLoc: Location-Aware Indoor Acoustic Sensing for Non-Intrusive Cough Detection,” Mobisense, 2011

M. Eck, Y. Zhang, Ian Lane and A. Waibel. Jibbigo: Speech-to-Speech Translation on Mobile Devices, SLT, 2010

D. Lim, Ian Lane and A. Waibel. Real-Time Spoken Language Identification and Recognition For Speech-to-Speech Translation, IWSLT, pp. 307-312, 2010

Ian Lane and A. Waibel. Data-Driven Morphological Decomposition and Named-Entity Projection for Field Maintainable Speech-to-Speech Translation, Interspeech, pp. 2882-2885, 2010.

Z. Sun, A. Purohit, K. Yang, N. Pattan, D. Siewiorek, A. Smailagic, Ian Lane and P. Zhang. VMA: An Inexpensive Indoor Acoustic Sensing Platform for In-home Patient Monitoring. MobiSys, 2010

Ian Lane, M. Eck, K. Rottmann and A. Waibel, “Tools for Collecting Speech Corpora via Mechanical-Turk,” In Proc. Creating Speech and Language Data with Amazons Mechanical Turk, 2010

H. Al-Haj, R. Hsiao, Ian Lane and A. Waibel, “Pronunciation Modelling for Dialectal Arabic Speech Recognition,” In Proc. ASRU, pp. 525{528, 2009.

D. Lim and Ian Lane, “Language Identication for Speech-to-Speech Translation,” In Proc. Interspeech, pp. 2362{2365, 2009.

N. Bach, R. Hsiao, M. Eck, P. Charoenpornsawat, S. Vogel, T. Schultz, Ian Lane, A. Waibel and A. Black, “Incremental Adaptation of Speech-to-Speech Translation,” In Proc. HLT-NAACL, pp. 149-152, 2009.

Ian Lane and A. Waibel, “Class-Based Statistical Machine Translation for Field Maintainable Speech-to-Speech Translation,” In Proc. Interspeech, pp. 2362-2365, 2008.

M. Paulik, S. Rao, Ian Lane, S. Vogel and T. Schultz. Sentence Segmentation and Punctuation Recovery for Spoken Language Translation. In Proc. IEEE-ICASSP, pp. 2362-2365, 2008.

Ian Lane, A. Zollmann, L. Nguyen, N. Bach, A. Venugopal, S. Vogel, K. Rottmann, Y. Zhang and A. Waibel, “The CMU-UKA Statistical Machine Translation Systems for IWSLT 2007,” In Proc. IWSLT, pp. 130{137, 2007.

S. Rao, Ian Lane, and T. Schultz, “Improving Spoken Language Translation by Automatic Disfluency Removal : Evidence from Conversational Speech Transcripts,” In Proc. MT Summit XI, pp. 385{389, 2007.

S. Rao, Ian Lane, and T. Schultz. Optimizing Sentence Segmentation for Spoken Language Translation. In Proc. Interspeech, pp. 2845-2848, 2007.

N. Bach, M. Noamany, Ian Lane, and T. Schultz, “Handling OOV Words In Arabic ASR Via Flexible Morphological Constraints,” In Proc. Interspeech, pp. 2373-2376 2007.

Y. Tam, Ian Lane, and T. Schultz. Bilingual-LSA Based LM Adaptation for Spoken Language Translation. In Proc. ACL, pp. 520-527, 2007.

B. Zhao, N. Bach, Ian Lane, and S. Vogel. A Log-Linear Block Transliteration Model based on Bi-Stream HMMs. In Proc. HLT, pp. 364-371, 2007.

M. Eck, Ian Lane, N. Bach, S. Hewavitharana, M. Kolss, B. Zhao, A. Hildebrand, S. Vogel, and A. Waibel, “The UKA/CMU Statistical Machine Translation System for IWSLT 2006,” In Proc. IWSLT, pp. 130-137, 2006.

Ian Lane and T. Kawahara, “Utterance Verification Incorporating In-Domain Confidence and Discourse Coherence Measures,” In Proc. Interspeech, pp. 421-424, 2005.

Ian Lane and T. Kawahara, “Incorporating Dialogue Context and Topic Clustering in Out-Of-Domain Detection,” In Proc. IEEE-ICASSP, Vol. 1, pp. 1045-1048, 2005.

Ian Lane, T. Kawahara. T. Matsui, and S. Nakamura. Topic Classification and Verification Modelling for Out-Of-Domain Utterance Detection. In Proc. ICSLP, pp. 2197-2200, 2004.

S. Ueno, Ian Lane and T. Kawahara. Example-based Training of Dialogue Planning Incorporating User and Situation Models. In Proc. ICSLP, pp. 2837-2840, 2004.

Ian Lane, S. Ueno and T. Kawahara. Cooperative Dialogue Planning with User and Situation Models via Example-based Training. In Proc. MMSS, pp. 93-102, 2004.

Ian Lane, T. Kawahara. T. Matsui and S. Nakamura. Out-Of-Domain Detection based on Confidence Measures from Multiple Topic Classification. In Proc. IEEE-ICASSP, Vol.1, pp. 757-760, 2004.

Ian Lane, T. Matsui, S. Nakamura, and T. Kawahara. Hierarchical Topic Classification for Dialog Speech Recognition based on Language Model Switching. In Proc. EUROSPEECH, pp. 429-432, 2003.

Ian Lane, T. Kawahara. and T. Matsui. Language model Switching based on Topic Detection for Dialog Speech Recognition. In Proc. IEEE-ICASSP, Vol.1, pp. 616-619, 2003.

Patents

Ian Lane, Alex Waibel, Methods for Enhancing Speech-to-Speech Translation Systems, US Patent. Feb., 2008.

Ian Lane, Alex Waibel, System and Methods for Maintaining Speech-to-Speech Translation in the Field, International Patent. Feb., 2008.

Ian Lane, and Tatsuta Kawahara, Method to Assess Recognition Confidence Incorporating Measures of In-Domain Confidence and Discourse Coherence, Japanese Patent. March, 2005.

Ian Lane, Tatsuta Kawahara, Tomoko Matsui, and Satoshi Nakamura, Training Device for Domain Verifier, Domain Verifying Device for Input Data, and Computer Program, Japanese Patent JP2005164836. Jan., 2003.

Ian Lane, Tatsuta Kawahara, Tomoko Matsui, and Satoshi Nakamura, A Speech Recognition Framework Combining Hierarchical Topic Detection and Topic-Dependent Language Modelling, Japanese Patent JP2004198597. Dec., 2002.

Rapid Training of Acoustic Models using GPUs

HYDRA A Hybrid CPU/GPU-based speech Recognition Engine for Real-Time LVCSR

AIDAS: Immersive Interaction within Vehicles

Peer-reviewed journal articles

Peer reviewed books, book chapters, books edited

Refereed conference proceedings

Patents

Appointments

Professional Preparation

Appointments

Professional Preparation