Zhenzhen Kou

Yahoo! Search Sciences

2821 Mission College Blvd, Santa Clara, CA 95054

Email: zzkou@yahoo-inc.com, Phone: (408) 336-0895

 

BRIEF BIOGRAPHY

 

 

Dr. Zhenzhen Kou is a Relevance Scientist in Search Sciences Department at Yahoo!. Kou received her Ph.D. (2007) in Machine Learning Department from Carnegie Mellon University’s School of computer Science, and joined Yahoo! in 2008.

 

Her research interests include machine learning and data/text mining, especially relational data mining, information retrieval, and collaborative filtering.

  

 

PROFESSIONAL EXPERIENCE

                                               

 

Yahoo! Inc., Feb. 2008 - present

Search Relevance Scientist

Research areas: information retrieval and machine learning

 

Carnegie Mellon University, May. 2005 – Dec. 2007

Research Assistant, CALO project

Research areas: information extraction, statistical relational learning, and data mining

 

 

Carnegie Mellon University, Sep. 2002 –May. 2005

Research Assistant, SLIF project

Research areas: image analysis, named entity extraction, web interface design for an SQL database

 

IBM TJ Watson Research Center, May 2007 – Aug. 2007

Technical Co-op, Math. Dept.

Research: worked with Yan Liu to develop a link prediction model for movie recommendation, which ranks as the Second Runner-up in the KDD Cup 07 competition; compared the performance of several learning algorithms applied to job category classification for IBM employees

 

 

Tsinghua University, Aug. 2000 ~ June 2002

Research Assistant, Bioinformatics group

Research: worked on an image analysis system for curative effect judgement for Traditional Chinese Medicine. Research involved designing and implementing image processing algorithms for texture analysis and applying Statistical Learning Theory to image classifications.

 

Student Research Training (SRT) Program, Tsinghua University, Aug. 1998 ~ Aug. 1999

Designed and maintained the user interface of Open Bookmark Service System in Internet.

 

 

ACADEMIC SERVICE

                                               

 

Program Committee Member, ICML 08

Reviewer for IEEE’s Transactions on Knowledge and Data Engineering

EDUCATION

 

 

 

Carnegie Mellon University, Pittsburgh PA

Ph.D. in Machine Learning, Dec. 2007

Machine Learning Department, School of Computer Science

Dissertation Title: Stacked Graphical Learning

Advisor: William W. Cohen

 

Tsinghua University, Beijing, P. R. China

M.S. in Pattern Recognition and Intelligent System, June 2002

 

Tsinghua University, Beijing, P. R. China

B. E. in Automation, June 2000

 

 

 

AWARDS & FELLOWSHIPS

 

 

Student Travel Award, PSB 07 and SDM 07

Carnegie Mellon Research Fellowship, Aug.2002 – Dec. 2007

Tsinghua University Outstanding Student Scholarship, Nov. 2001.

Honor of Outstanding Graduate (awarded to top 2% graduates), Jun. 2000

Tsinghua University Outstanding Student Scholarship, 1996 ~ 1999

 

CONTESTS & PRIZES

 

 

Second Runner-up for Task 1 in KDD Cup 2007

 

PUBLICATIONS

 

 

JOURNAL PUBLICATIONS

 

 

Zhenzhen Kou, William W. Cohen and Robert F. Murphy (2005): High-Recall Protein Entity Recognition Using a Dictionary, Bioinformatics. 2005 Jun; 21 Suppl 1:i266-73.

 

Zhenzhen Kou, Liang Ji and Xuegong Zhang(2001), Karyotyping of CGH human metaphase by using support vector machines, Cytometry, December 2001.

 

 

CONFERENCE PUBLICATIONS

 

Yan Liu, Zhenzhen Kou, Claudia Perlich and Richard Lawrence (2008): Intelligent System for Workforce Classification,  in KDD 2008 Workshop on Data Mining for Business Applications.

 

Zhenzhen Kou, Vitor R. Carvalho and William W. Cohen (2007): Online Stacked Graphical Learning,  in NIPS 2007 Workshop on Efficient Machine Learning.

 

Zhenzhen Kou and William W. Cohen (2007): Stacked Graphical Models for Efficient Inference in Markov Random Fields, in SIAM International Conference on Data Mining (SDM07), Minneapolis, MN, 2007.

 

Yan Liu and Zhenzhen Kou (2007): Predicting Who Rated What in Large-Scale Datasets, in Proceedings of KDD Cup and Workshop 2007, San Jose, CA, 2007.

 

 

Zhenzhen Kou and William W. Cohen (2007): Notes for Stacked Graphical Models for Effcient Inference in Markov Random Fields, Technical Report: CMU-ML-07-101.

 

Zhenzhen Kou, William W. Cohen and Robert F. Murphy (2007), A Stacked Graphical Model for Associating Information from Text And Images In Figures, in Pacific Symposium on Biocomputing (PSB 07), Maui, 2007.

 

Robert Murphy, Zhenzhen Kou, Juchang Hua, Matthew  Joffe, William W. Cohen (2005): Extracting Structured Information from Text and Images in On-line Journal Articles for Localization Proteomics, in proceedings of SIG: Linking Literature, Information and Knowledge for Biology 2005 (BioLINK ‘05), in association with ISMB 2005, Detroit, Michigan.

 

Robert F. Murphy, Zhenzhen Kou, Juchang Hua, Matthew Joffe and William W. Cohen (2004): Extracting and Structuring Subcellular Location Information from On-line Journal Articles: The Subcellular Location Image Finder, in Knowledge Sharing and Collaborative Engineering (KSCE 2004), Virgin Islands, 2004.

 

Zhenzhen Kou, William W. Cohen and Robert F. Murphy (2003): Extracting Information from Text and Images for Location Proteomics, in 3rd Workshop on Data Mining in Bioinformatics (BIOKDD), Washington D.C., 2003.

 

Zhenzhen Kou, Jianhua Xu, Xuegong Zhang and Liang Ji(2001), An Improved Support Vector Machine Using Class-Median Vectors, in 8th International Conference on Neural Information Processing, Shanghai, China, 2001.

 

 

 

TEACHING EXPERIENCE

 

 

 

Teaching Assistant, “Machine Learning(10-701)”, Carnegie Mellon University, Jan. – May. 2005

Teaching Assistant, “Database Applications(15-415)”, Carnegie Mellon University, Aug. – Dec. 2005

Teaching Assistant, Signals and System Analysis”, Tsinghua University,  Sep. 2000 ~ Jan. 2001

 

 

TALKS

 

 

Oral Presentation, Predicting Who Rated What in Large-Scale Datasets”, in KDD Cup Workshop, San Jose, CA, Aug. 2007

 

 

Oral Presentation, A Stacked Graphical Model for Associating Information from Text And Images In Figures”, in Pacific Symposium on Biocomputing (PSB), Maui, Jan. 2007

 

 

Invited talk, “Stacked Graphical Learning”, in Nielsen BuzzMetrics Pittsburgh, Oct. 2006

 

 

Oral Presentation, High-Recall Protein Entity Recognition Using a Dictionary”, in 13th International Conference on Intelligent Systems for Molecular Biology (ISMB), Detroit, Aug. 2005

 

 

Oral Presentation, Extracting Information from Text and Images for Location Proteomics”, in 3rd Workshop on Data Mining in Bioinformatics (BIOKDD), Washington D.C., Aug. 2003

 

 

Poster Presentation, Online Stacked Graphical Learning”, in NIPS 2007 Workshop on Efficient Machine Learning

 

Poster Presentation, “Stacked Graphical Models for Efficient Inference in Markov Random Fields”, in SIAM International Conference on Data Mining (SDM07), Minneapolis, MN, 2007

Poster Presentation,  Extracting Structured Information from Text and Images in On-line Journal Articles for Localization Proteomics”, in Linking Literature, Information and Knowledge for Biology 2005 (BioLINK ‘05)

 

REFERENCES

 

William W. Cohen

Associate Research Professor

Machine Learning Department

Carnegie Mellon University

5000 Forbes Ave

Pittsburgh, PA, 15213

412-268-7664

wcohen@cs.cmu.edu

Robert F. Murphy

Professor

Departments of Biological Sciences, Biomedical Engineering and Machine Learning

Carnegie Mellon University

4400 Fifth Avenue

Pittsburgh, PA 15213

412-268-3480

murphy@cmu.edu

 

Tom Mitchell

Professor

Machine Learning Department

Carnegie Mellon University

5000 Forbes Ave

Pittsburgh, PA, 15213

412-268-2611

Tom.Mitchell@cmu.edu

Christos Faloutsos

Professor

Computer Science Department

Carnegie Mellon University

5000 Forbes Ave

Pittsburgh, PA, 15213

412-268-1457

christos@cs.cmu.edu