William W. Cohen's Papers: Matching/Data Integration
- William Yang Wang, Kathryn Mazaitis, Ni Lao, Tom Mitchell, and William W. Cohen (2015): Efficient Inference and Learning in a Large Knowledge Base: Reasoning with Extracted Information using a Locally Groundable First-Order Probabilistic Logic in Machine Learning, 2015.
- Bhavana Dalvi, Einat Minkov, Partha P. Talukdar, and William W. Cohen (2015): Automatic Gloss Finding for a Knowledge Base using Ontological Constraints in WSDM-2015.
- Jay Pujara, Hui Miao, Lise Getoor and William W. Cohen (2014): Using Semantics & Statistics to Turn Data into Knowledge in AI Magazine 2014.
- William Yang Wang, Kathryn Mazaitis, William W. Cohen (2013): Programming with Personalized PageRank: A Locally Groundable First-Order Probabilistic Logic in CIKM-2013 (Honorable Mention for Best Paper at CIKM-2013). (Originally published as: William Yang Wang, Kathryn Mazaitis, William W. Cohen (2013): Programming with Personalized PageRank: A Locally Groundable First-Order Probabilistic Logic in arxiv 1305.2254; William Yang Wang, Kathryn Mazaitis, William W. Cohen (2013): Programming with Personalized PageRank: A Locally Groundable First-Order Probabilistic Logic in ICML 2103 Workshop on Inferning).
- Einat Minkov and William W. Cohen (2010): Improving Graph-Walk Based Similarity with Reranking: Case Studies for Personal Information Management in TOIS-2010.
- William W. Cohen, Natalie Glance, Charles Schafer, Roy Tromble, Yuk Wah Wong (2009): Data Integration for Many Data Sources using Context-Sensitive Similarity Metrics in limbo.
- Einat Minkov and William Cohen (2007): Learning to Rank Typed Graph Walks: Local and Global Approaches in WebKDD-2007.
- Sarah Zelikovitz, William Cohen, and Haym Hirsh (2007): Extending WHIRL with background knowledge for improved text classification in Information Retrieval 10(1) pp 35-67.
- Einat Minkov and William W. Cohen (2006): An Email and Meeting Assistant using Graph Walks in CEAS-2006.
- Einat Minkov, Andrew Ng and William W. Cohen (2006): Contextual Search and Name Disambiguation in Email using Graphs in SIGIR-2006.
- William W. Cohen (2006): A Graph-Search Framework for GeneId Ranking (Extended Abstract) in BioNLP'06.
- William W. Cohen & Einat Minkov (2006): A Graph-Search Framework for Associating Gene Identifiers with Documents in BMC Bioinformatics.
- Einat Minkov, Richard Wang & William Cohen (2004): Extracting Personal Names from Emails: Applying Named Entity Recognition to Informal Text in NAACL-2005.
- Pradeep Ravikumar, William W. Cohen, Stephen E. Fienberg (2004): A Secure Protocol for Computing String Distance Metrics in PSDM-2004.
- Pradeep Ravikumar & William W. Cohen (2004): A Hierarchical Graphical Model for Record Linkage in UAI 2004.
- William W. Cohen & Sunita Sarawagi (2004): Exploiting Dictionaries in Named Entity Extraction: Combining Semi-Markov Extraction Processes and Data Integration Methods in KDD 2004: 89-98.
- Mikael Bilenko, Ray Mooney, William W. Cohen, Pradeep Ravikumar & Steve Fienberg (2003): Adaptive Name-Matching in Information Integration in IEEE Intelligent Systems 18(5): 16-23 (2003).
- William W. Cohen, Pradeep Ravikumar & Stephen Fienberg (2003): A Comparison of String Metrics for Matching Names and Records in KDD Workshop on Data Cleaning and Object Consolidation.
- William W. Cohen, Pradeep Ravikumar & Stephen Fienberg (2003): A Comparison of String Distance Metrics for Name-Matching Tasks in IIWeb 2003: 73-78.
- William W. Cohen & Jacob Richman (2002): Learning to Match and Cluster Large High-Dimensional Data Sets For Data Integration in KDD 2002: 475-480.
- William W. Cohen & Jacob Richman (2001): Learning to Match and Cluster Entity Names in Proc. of the ACM SIGIR-2001 Workshop on Mathematical/Formal Methods in IR.
- Chumki Basu, Haym Hirsh, William W. Cohen & Craig Neville-Manning (2001): Technical Paper Recommendation: A Study in Combining Multiple Information Sources in J. Artif. Intell. Res. (JAIR) 14: 231-252 (2001). (Originally published as: Chumki Basu, Haym Hirsh, William W. Cohen (1998): Recommendation as Classification: Using Social and Content-Based Information in Recommendation. in AAAI/IAAI 1998: 714-720).
- William W. Cohen, David McAllester, and Henry Kautz (2000): Hardening Soft Information Sources in KDD 2000: 255-259.
- William W. Cohen and Wei Fan (2000): Web-Collaborative Filtering: Recommending Music by Crawling The Web in Computer Networks 33(1-6): 685-698 (2000). (Originally published as: William W. Cohen and Wei Fan (2000): Web-Collaborative Filtering: Recommending Music by Crawling The Web in WWW 2000).
- William W. Cohen (2000): Data Integration using Similarity Joins and a Word-based Information Representation Language in ACM Trans. Inf. Syst. 18(3): 288-321 (2000). (Originally published as: William W. Cohen (1998): Integration of Heterogeneous Databases Without Common Domains Using Queries Based on Textual Similarity in SIGMOD Conference 1998: 201-212 (Won Ten-Year "Test of Time" Award at SIGMOD 2008); William W. Cohen (1997): Knowledge Integration for Structured Information Sources Containing Text (Extended Abstract) in SIGIR Workshop on Networked IR (informal proceedings)).
- William W. Cohen (1999): What Can We Learn from the Web in ICML 1999.
- William W. Cohen (1999): Recognizing Structure in Web Pages using Similarity Queries in AAAI/IAAI 1999: 59-66.
- William W. Cohen (2000): WHIRL: A Word-based Information Representation Language in Artif. Intell. 118(1-2): 163-196 (2000).
- William W. Cohen and Wei Fan (1999): Learning Page-Independent Heuristics for Extracting Data from Web Pages in Computer Networks 31(11-16): 1641-1652 (1999). (Originally published as: William W. Cohen and Wei Fan (1999): Learning Page-Independent Heuristics for Extracting Data from Web Pages in WWW 1999).
- William W. Cohen (1999): Reasoning about Textual Similarity in a Web-Based Information Access in Autonomous Agents and Multi-Agent Systems 2(1): 65-86 (1999).
- William W. Cohen (1999): A Demonstration of WHIRL (demonstration abstract) in SIGIR 1999: 327.
- William W. Cohen (1999): Some Practical Observations on Integration of Web Information in WebDB (Informal Proc.) 1999: 55-60.
- William W. Cohen (1998): The WHIRL Approach to Information Integration in IEEE Intelligent Systems, Sept/Oct 1998, pp 20--23.
- William W. Cohen & Haym Hirsh (1998): Joins that Generalize: Text Classification Using WHIRL in KDD 1998: 169-173.
- William W. Cohen (1998): Providing Database-like Access to the Web Using Queries Based on Textual Similarity (demonstration abstract) in SIGMOD Conference 1998: 558-560.
- William W. Cohen (1998): A Web-based Information System that Reasons with Structured Collections of Text in Agents 1998: 400-407.
- William W. Cohen (1998): The WHIRL Approach to Integration: An Overview in IIWeb 1998 (informal proceedings).
[Selected papers| By topic: Deep Learning| Info Extraction/Reading| Topic Modeling| Learning in Graphs| Matching/Data Integration| Text Categorization| Rule Learning| Explanation-Based Learning| Formal Results| Inductive Logic Programming| Collaborative Filtering| Applications| Intelligent Tutoring| GNAT System| By year: All papers]