45

Betty Yee Man Cheng

Ph. D. Student

Language Technologies Institute

School of Computer Science

Carnegie Mellon University

 

Office: Gates-Hillman Complex 5511

Address:

 

 

Fax:

E-mail:

CMU / LTI (GHC 5407)

5000 Forbes Avenue

Pittsburgh, PA  15213

412-268-6298

ymcheng@cs.cmu.edu

 

Education

 

Carnegie Mellon University, Pittsburgh, PA, USA

Ph.D. in Language Technologies, School of Computer Science

2004-present

Relevant Courses: Intro to Immunology, Biochemistry I, Intermediate Statistics

Carnegie Mellon University, Pittsburgh, PA, USA

M.S. in Language Technologies, School of Computer Science

2002-2004

Thesis: “Prediction of Coupling Specificity of G-Protein Coupled Receptors”
Relevant Courses: Machine Learning (Ph.D. Level), Bioinformatics, Language & Statistics, Information Retrieval, Machine Translation, NLP, Grammar Formalisms

Simon Fraser University, Burnaby, BC, Canada

B.S. Honours in Computer Science

B.S. Major in Mathematics

1998-2002

Graduated with First Class Honours
Relevant Courses: AI, Adv. Algorithms, Databases, Information System Design, Software Engineering, Computers in Biomedicine, Formal Languages & Automata, Programming Languages, Combinatorics, Discrete & Continuous Optimization, Graph Theory, Number Theory, Abstract Algebra

AWARDS

 

·     Best Presentation Award in LTI Student Research Symposium, Language Technologies Institute, Carnegie Mellon University

2003

·     Best Scientific Session Award in Bioinformatics & Data Mining session of Advancing Practice Instruction & Innovation through Informatics conference

2003

·     Best New Research Idea Award in the Biological Language conference, Pittsburgh, Pennsylvania

2003

·     Dean’s Scholarship from Simon Fraser University, Canada

1998-2002

·     Rene Descartes Scholarship from University of Waterloo, Canada

1998

·     First place in Canada in Fermat Mathematics Competition

1997 & 1998

·     First place in British Columbia Colleges High School Math Contest, Senior Div.

1998

·     National Biology Scholar award in National Biology Competition

1998

·     Finalist in Canadian Youth Health Awareness Award essay competition

1995

Journal Publications

 

·     Cheng BY, Carbonell JG, Klein-Seetharaman J (2005).  Protein Classification based on Text Document Classification Techniques.  Proteins: Structure, Function and Bioinformatics, 58(4): 955-970.

·     Cai Y, Snel I, Cheng B, Bharathi BS, Klein C, Klein-Seetharaman J (2004).  BioSim – A Biomedical Character-Based Problem Solving Environment.  International Journal Future Generation Computer Systems - Interaction and Visualisation Techniques for Problem-Solving Environments, 21(7): 1145-1156.

Full-length Conference Papers

 

·     Cheng BY, Carbonell JG (2007).  Combining N-grams and Alignment in G-Protein Coupling Specificity Prediction.  Advances in Bioinformatics & Computational Biology: the 5th Asia-Pacific Bioinformatics Conference, Imperial College Press, p. 363-372.

·     Cheng BY, Carbonell JG, Klein-Seetharaman J (2005).  A Machine Text-Inspired Machine Learning Approach for Identification of Transmembrane Helix Boundaries.  In LNAI 3488: 15th International Symposium on Methodologies for Intelligent Systems, Springer-Verlag, pp. 29-37.

·     Cheng BY, Carbonell JG, Klein-Seetharaman J (2003).  Document Classification of Protein Sequences.  In proceedings of the 1st Biological Language Conference.

Invited Talks

 

·     Cheng BY (2004).  Language Technologist’s Approach to Understanding G-Protein-GPCR Interaction.  In LTI Student Research Symposium, Language Technologies Institute, Carnegie Mellon University, Pittsburgh, Pennsylvania.  (keynote speaker)

Peer-Reviewed Abstracts for Oral & Poster Presentations

 

·     Cheng BY, Carbonell JG, Klein-Seetharaman J (2005).  Combining Alignment and N-grams in G-protein Coupling Specificity Prediction.  In 13th International Conference on Intelligent Systems for Molecular Biology, Detroit, Michigan, USA.  (poster)

·     Cheng BY, Carbonell JG, Klein-Seetharaman J (2004).  Prediction of G-Protein Coupling Specificity of GPCR.  In 12th International Conference on Intelligent Systems for Molecular Biology / European Conference on Computational Biology, Glasgow, Scotland.  (poster)

·     Cheng BY, Klein-Seetharaman J, Carbonell JG (2003).  Identifying Important Words in the Language of Proteins.  In 35th Central Regional Meeting of the American Chemical Society, Pittsburgh, Pennsylvania, USA.  (oral + poster)

·     Cheng BY, Klein-Seetharaman J, Carbonell JG (2003).  A Linguistic Approach to Identification of Motifs and Pharmaceutical Classification of GPCRs.  In Advancing Practice, Instruction and Innovation Through Informatics, Pittsburgh, Pennsylvania, USA.  (oral + poster)  (Best Scientific Session Award for oral presentation)

·     Cheng BY, Klein-Seetharaman J, Carbonell JG (2003).  Identifying Important Words in the Language of Proteins.  In Science 2003: Improving the Human Condition, University of Pittsburgh, Pittsburgh, Pennsylvania, USA.  (poster)

·     Cheng BY, Klein-Seetharaman J, Carbonell JG (2003).  Identifying Important Words in the Language of Proteins.  In LTI Student Research Symposium, Language Technologies Institute, Carnegie Mellon University, Pittsburgh, Pennsylvania, USA.  (oral)  (Best Presentation Award)

·     Cai Y, Snel I, Bharathi BS, Klein C, Klein-Seetharaman J (2003).  Towards Biomedical Problem Solving in a Game Environment.  In International Conference on Computational Science, Melbourne, Australia.  (oral)  (Best Paper award in the conference)

Other Publications

 

·     Cheng BY (2004).  Prediction of Coupling Specificity of G-Protein Coupled Receptors.  Master’s thesis, Language Technologies Institute, School of Computer Science, Carnegie Mellon University.

Projects

 

Immunology Research, Biological Language Modeling Toolkit

Using machine learning to investigate common reactions in the immune system.  Prediction of resistance to a multi-drug treatment for HIV patients with the goal of personalizing treatments to the unique virus population in each patient.  Supervised by Jaime Carbonell.

2005 – present


 

GPCR Research, Biological Language Modeling Toolkit

Performed bioinformatics research on G-protein coupled receptor sequences to make predictions with respect to their classification, secondary structure and coupling specificity.  Supervised by Jaime Carbonell & Judith Klein-Seetharaman.

2002 – 2004

Mindkin

A social networking website developed with 3 students that incorporates elements of gaming using ASP .NET 2.0, AJAX and JavaScript.  Patent pending and incorporated.  Featured in CMU Tartans newspaper and the Chronicle of Higher Education.

2006 – present

BookSmart

An online fan fiction recommendation system based on Naïve Bayes and bag-of-words approach.  Tested by over 60 users in North America and Europe.

2000

Industry Experience

 

InTime Solutions Inc., Burnaby, BC, Canada

Software Engineer

Designed and implemented a major upgrade to scheduling software, Officer Scheduling, specialized for security guards and healthcare professionals.

May 2001
– Jan 2002

Safeway IT, Vancouver, BC, Canada

Database Programmer & Tester

Modified stored procedures and performed regression testing for the Safeway Promotion Planning & Optimization Tools software.

May 2000
 – Aug 2000

Teaching Experience

 

Carnegie Mellon University, Pittsburgh, PA, USA

 

TA & Co-instructor – “Competition Programming & Problem Solving”

2006-2007

Worked with Greg Kesden and Dr. Eugene Fink in selecting and preparing the CMU teams for ACM Programming Competition by creating practice problem sets and co-running practice sessions.  CMU team placed 2nd in East Central N. America region and competed in World Finals in Tokyo, March 2007.

Vancouver, BC, Canada

 

Private Piano Teacher

2001-2002

Taught children age 5 through 10 in beginner’s to grade 4 piano.

Formosa Academy, Vancouver, BC, Canada

 

Teaching Assistant to Dr. Cary Chien

1996-1999

Prepared materials for class, graded homework and provided extra help to students on high school mathematics and calculus.

Extracurricular Activities & Community Service

 

Carnegie Mellon University

Women@SCS (outreach activities & advisor to undergrad web team)
Language Technologies Institute Ph.D. Student Rep
Language Technologies Institute Student Research Symposium Judge/Reviewer
Language Technologies Institute Activities Committee
Campus Judicial committee board member

 

2002 – present

2005 – 2007

2005 & 2006

2003 – 2005
2007 – 2008

 

Piano

First Class Honours in Royal Conservatory of Music Grade 10 Piano Practical exam
First Class Honours in Royal Conservatory of Music Written Pedagogy exam

1991 – 2002

 

Karate

3rd Kyu (brown belt) in North American Shotokan Karate Association, Japan Karate Association

1999 – 2002