Turing Center at University of Washington

Investigating problems at the crossroads of natural language processing, data mining, Web search, and the Semantic Web.

Turing Center Home Turing Center People Turing Center Publications Turing Center Press Turing Center Events Turing Center Jobs Turing Center Contact
 

Publications by year

Click on the icon or paper title to retrieve copies of the papers.

2013

"Modeling Missing Data in Distant Supervision for Information Extraction"
Alan Ritter, Luke Zettlemoyer, Mausam, and Oren Etzioni
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing (EMNLP 2013) and reprinted in Transactions of the ACL, 1: 367−378, 2013

"Generating Coherent Event Schemas at Scale"
Niranjan Balasubramanian, Stephen Soderland, Mausam, and Oren Etzioni
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing (EMNLP 2013)

"Paraphrase-Driven Learning for Open Question Answering"
Anthony Fader, Luke Zettlemoyer, and Oren Etzioni
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (ACL 2013)

"Towards Coherent Multi-Document Summarization"
Janara Christensen, Mausam, Stephen Soderland, and Oren Etzioni
Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT 2013)

"Extracting Knowledge from Twitter and The Web"
Alan Ritter
University of Washington Ph.D. Dissertation

"Modeling User Behavior and Attention in Search"
Jeff Huang
University of Washington Ph.D. Dissertation

2012

"RevMiner: An Extractive Interface for Navigating Reviews on a Smartphone"
Jeff Huang, Oren Etzioni, Luke Zettlemoyer, Kevin Clark, and Christian Lee
Proceedings of the 25th ACM Symposium on User Interface Software and Technology (UIST 2012)

"Entity Linking at Web Scale"
Thomas Lin, Mausam, and Oren Etzioni
Proceedings of the Knowledge Extraction Workshop at NAACL-HLT 2012 (AKBC-WEKEX 2012)

"Rel-grams: A Probabilistic Model of Relations in Text"
Niranjan Balasubramanian, Stephen Soderland, Mausam, and Oren Etzioni
Proceedings of the Knowledge Extraction Workshop at NAACL-HLT 2012 (AKBC-WEKEX 2012)

"No Noun Phrase Left Behind: Detecting and Typing Unlinkable Entities"
Thomas Lin, Mausam, and Oren Etzioni
Proceedings of the 2012 Conference on Empirical Methods in Natural Language Processing (EMNLP 2012)

"Open Language Learning for Information Extraction"
Mausam, Michael Schmitz, Robert Bart, Stephen Soderland, and Oren Etzioni
Proceedings of the 2012 Conference on Empirical Methods in Natural Language Processing (EMNLP 2012)

"Open Domain Event Extraction from Twitter"
Alan Ritter, Mausam, Oren Etzioni, and Sam Clark
Proceedings of the 18th ACM SIGKDD international Conference on Knowledge Discovery & Data Mining (KDD 2012)

"Leveraging Knowledge Bases in Web Text Processing"
Thomas Lin
University of Washington Ph.D. Dissertation

2011

"Search needs a shake-up"
Oren Etzioni
Nature, 476: 25-26, August 4, 2011

"Identifying Relations for Open Information Extraction"
Anthony Fader, Stephen Soderland, and Oren Etzioni
Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing (EMNLP 2011)

"Named Entity Recognition in Tweets: An Experimental Study"
Alan Ritter, Sam Clark, Mausam, and Oren Etzioni
Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing (EMNLP 2011)

"Open Information Extraction: the Second Generation"
Oren Etzioni, Anthony Fader, Janara Christensen, Stephen Soderland, and Mausam
Proceedings of the 22nd International Joint Conference on Artificial Intelligence (IJCAI 2011)

"An Analysis of Open Information Extraction based on Semantic Role Labeling"
Janara Christensen, Mausam, Stephen Soderland, and Oren Etzioni
Proceedings of the 6th International Conference on Knowledge Capture (K-CAP 2011)

"Global Learning of Typed Entailment Rules"
Jonathan Berant, Ido Dagan, and Jacob Goldberger
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics (ACL 2011)
Best Student Paper Award

"Markov Logic for Machine Reading"
Hoifung Poon
University of Washington Ph.D. Dissertation

"Inference Over the Web"
Stefan Schoenmackers
University of Washington Ph.D. Dissertation

2010

"Adapting Open Information Extraction to Domain-Specific Relations"
Stephen Soderland, Brendan Roof, Bo Qin, Shi Xu, Mausam, and Oren Etzioni
AI Magazine, 31(3): 93-102, 2010

"Commonsense from the Web: Relation Properties"
Thomas Lin, Mausam, and Oren Etzioni
Proceedings of the 2010 AAAI Fall Symposium on Commonsense Knowledge

"Learning First-Order Horn Clauses from Web Text"
Stefan Schoenmackers, Oren Etzioni, Daniel S. Weld, and Jesse Davis
Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing (EMNLP 2010)

"Identifying Functional Relations in Web Text"
Thomas Lin, Mausam, and Oren Etzioni
Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing (EMNLP 2010)

"Analysis of a Probabilistic Model of Redundancy in Unsupervised Information Extraction"
Doug Downey, Oren Etzioni, and Stephen Soderland
Artificial Intelligence, 174(11): 726-748, 2010

"Panlingual Lexical Translation via Probabilistic Inference"
Mausam, Stephen Soderland, Oren Etzioni, Daniel S. Weld, Kobi Reiter, Michael Skinner, Marcus Sammer, and Jeff Bilmes
Artificial Intelligence, 174(9-10): 619-637, 2010

"PanLex and LEXTRACT: Translating all Words of all Languages of the World"
Timothy Baldwin, Jonathan Pool, and Susan M. Colowick
Proceedings of the 23th International Conference on Computational Linguistics (COLING 2010)

"Panlingual Globalization"
Jonathan Pool
Handbook of Language and Globalization, ed. Nikolas Coupland (Wiley-Blackwell), Chapter 6: 142–161, 2010

"Extracting Sequences from the Web"
Anthony Fader, Stephen Soderland, and Oren Etzioni
Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics (ACL 2010)

"A Latent Dirichlet Allocation method for Selectional Preferences"
Alan Ritter, Mausam, and Oren Etzioni
Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics (ACL 2010)

"Panlingual Lexical Translation via Probabilistic Inference"
Mausam, Stephen Soderland, and Oren Etzioni
Proceedings of the 24th Conference on Artificial Intelligence (AAAI 2010)

"Semantic Role Labeling for Open Information Extraction"
Janara Christensen, Mausam, Stephen Soderland, and Oren Etzioni
Proceedings of the Workshop on Formalisms and Methodology for Learning by Reading (FAM-LbR) at NAACL 2010

"Machine Reading at the University of Washington"
Hoifung Poon, Janara Christensen, Pedro Domingos, Oren Etzioni, Raphael Hoffmann, Chloe Kiddon, Thomas Lin, Xiao Ling, Mausam, Alan Ritter, Stefan Schoenmackers, Stephen Soderland, Dan Weld, Fei Wu, and Congle Zhang
Proceedings of the Workshop on Formalisms and Methodology for Learning by Reading (FAM-LbR) at NAACL 2010

"Evaluating Lemmatic Communication"
Katherine Everitt, Christopher Lim, Oren Etzioni, Jonathan Pool, Susan Colowick, and Stephen Soderland
Journal of Translation and Technical Communication Research, 3: 70-84, 2010

"Translators in a Global Community"
Jonathan Pool
In Humphrey Tonkin and Maria Esposito Frank (eds.), The Translator as Mediator of Cultures, John Benjamins Publishing Company, Amsterdam: 73-85, 2010

"Structure Learning in Markov Logic Networks"
Stanley Kok
University of Washington Ph.D. Dissertation

2009

"Identifying Interesting Assertions from the Web"
Thomas Lin, Oren Etzioni, and James Fogarty
Proceedings of the 18th Conference on Information and Knowledge Management (CIKM 2009)

"Data Integration for the Relational Web"
Michael J. Cafarella, Alon Halevy, and Nodira Khoussainova
PVLDB, 2(1): 1090-1101, 2009

"Lemmatic Machine Translation"
Stephen Soderland, Christopher Lim, Mausam, Bo Qin, Oren Etzioni, and Jonathan Pool
Proceedings of Machine Translation Summit XII, 2009

"Compiling a Massive, Multilingual Dictionary via Probabilistic Inference"
Mausam, Stephen Soderland, Oren Etzioni, Daniel S. Weld, Michael Skinner, and Jeff Bilmes
Proceedings of the 47th Annual Meeting of the Association for Computational Linguistics and 4th International Joint Conference on Natural Language Processing (ACL-IJCNLP 2009)

"A Rose is a Roos is a Ruusu: Querying Translations for Web Image Search"
Janara Christensen, Mausam, and Oren Etzioni
Proceedings of the 47th Annual Meeting of the Association for Computational Linguistics and 4th International Joint Conference on Natural Language Processing (ACL-IJCNLP 2009)

"What Is This, Anyway: Automatic Hypernym Discovery"
Alan Ritter, Stephen Soderland, and Oren Etzioni
Proceedings of the 2009 AAAI Spring Symposium on Learning by Reading and Learning to Read

"Unsupervised Methods for Determining Object and Relation Synonyms on the Web"
Alexander Yates and Oren Etzioni
Journal of Artificial Intelligence Research, 34: 255-296, 2009
appendix

"Learning to Generalize for Complex Selection Tasks"
Alan Ritter and Sumit Basu
Proceedings of the 2009 International Conference on Intelligent User Interfaces (IUI 2009)
Best Student Paper Award

"Extracting and Querying a Comprehensive Web Database"
Michael J. Cafarella
Proceedings of the 4th Biennial Conference on Innovative Data Systems Research (CIDR 2009)

"Extracting and Managing Structured Web Data"
Michael J. Cafarella
University of Washington Ph.D. Dissertation

"Open Information Extraction for the Web"
Michele Banko
University of Washington Ph.D. Dissertation

2008

"Open Information Extraction from the Web"
Oren Etzioni, Michele Banko, Stephen Soderland, and Daniel S. Weld
Communications of the ACM, 51(12): 68-74, 2008

"Web-Scale Extraction of Structured Data"
Michael J. Cafarella, Jayant Madhavan, and Alon Halevy
SIGMOD Record, 37(4): 55-61, 2008

"Look Ma, No Hands: Analyzing the Monotonic Feature Abstraction for Text Classification"
Doug Downey and Oren Etzioni
Proceedings of the 22nd Annual Conference on Neural Information Processing Systems (NIPS 2008)

"Exploiting Hyponymy in Extracting Relations and Enhancing Ontologies"
Bhushan Mandhani and Stephen Soderland
Proceedings of the IEEE/WIC/ACM Conference on Web Intelligence Workshop on Natural Language Processing and Ontology Engineering (NLPOE 2008)

"Scaling Textual Inference to the Web"
Stefan Schoenmackers, Oren Etzioni, and Daniel S. Weld
Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing (EMNLP 2008)

"It's a Contradiction -- No, It's Not: A Case Study using Functional Relations"
Alan Ritter, Doug Downey, Stephen Soderland, and Oren Etzioni
Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing (EMNLP 2008)

"Ontology-driven, unsupervised instance population"
Luke K. McDowell and Michael Cafarella
Journal of Web Semantics, 6(3): 218-236, 2008

"WebTables: Exploring the Power of Tables on the Web"
Michael J. Cafarella, Alon Halevy, Daisy Zhe Wang, Eugene Wu, and Yang Zhang
Proceedings of the 34th International Conference on Very Large Data Bases (VLDB 2008)

"Uncovering the Relational Web"
Michael J. Cafarella, Alon Halevy, Daisy Zhe Wang, Eugene Wu, and Yang Zhang
Proceedings of the 11th International Workshop on Web and Databases (WebDB 2008)

"The Tradeoffs Between Open and Traditional Relation Extraction"
Michele Banko and Oren Etzioni
Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics (ACL 2008)

"Data Management Projects at Google"
Michael Cafarella, Edward Chang, Andrew Fikes, Alon Halevy, Wilson Hsieh, Alberto Lerner, Jayant Madhavan, and S. Muthukrishnan
SIGMOD Record, 37(1): 34-38, 2008

"Redundancy in Web-scale Information Extraction: Probabilistic Model and Experimental Results"
Doug Downey
University of Washington Ph.D. Dissertation
Datasets from Doug Downey's Ph.D. thesis
LEX code

2007

"Strategies for Lifelong Knowledge Extraction from the Web"
Michele Banko and Oren Etzioni
Proceedings of the 4th International Conference on Knowledge Capture (K-CAP 2007)
Best Student Paper Award

"Disambiguating for the Web: A Test of Two Methods"
Jonathan Pool and S. M. Colowick
Proceedings of the 4th International Conference on Knowledge Capture (K-CAP 2007)

"Syntactic Disambiguation for the Semantic Web"
Jonathan Pool and S. M. Colowick
Proceedings of the Semantic Authoring, Annotation and Knowledge Markup Workshop (SAAKM 2007)

"Lexical Translation with Application to Image Search on the Web"
Oren Etzioni, Kobi Reiter, Stephen Soderland, and Marcus Sammer
Proceedings of Machine Translation Summit XI, 2007

"Building a Sense-Distinguished Multilingual Lexicon from Monolingual Corpora and Bilingual Lexicons"
Marcus Sammer and Stephen Soderland
Proceedings of Machine Translation Summit XI, 2007

"Navigating Extracted Data with Schema Discovery"
Michael J. Cafarella, Dan Suciu, and Oren Etzioni
Proceedings of the 10th International Workshop on the Web and Databases (WebDB 2007)

"Sparse Information Extraction: Unsupervised Language Models to the Rescue"
Doug Downey, Stefan Schoenmackers, and Oren Etzioni
Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics (ACL 2007)

"Unsupervised Resolution of Objects and Relations on the Web"
Alexander Yates and Oren Etzioni
Proceedings of Human Language Technologies: Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL-HLT 2007)

"Machine Reading"
Oren Etzioni, Michele Banko, and Michael J. Cafarella
Proceedings of the 2007 AAAI Spring Symposium on Machine Reading

"Moving from Textual Relations to Ontologized Relations"
Stephen Soderland and Bhushan Mandhani
Proceedings of the 2007 AAAI Spring Symposium on Machine Reading

"Open Information Extraction from the Web"
Michele Banko, Michael J. Cafarella, Stephen Soderland, Matt Broadhead, and Oren Etzioni
Proceedings of the 20th International Joint Conference on Artificial Intelligence (IJCAI 2007)

"Locating Complex Named Entities in Web Text"
Doug Downey, Matthew Broadhead, and Oren Etzioni
Proceedings of the 20th International Joint Conference on Artificial Intelligence (IJCAI 2007)

"Structured Querying of Web Text: A Technical Challenge"
Michael J. Cafarella, Christopher Re, Dan Suciu, Oren Etzioni, and Michele Banko
Proceedings of the 3rd Biennial Conference on Innovative Data Systems Research (CIDR 2007)

"Information Extraction from Unstructured Web Text"
Ana-Maria Popescu
University of Washington Ph.D. Dissertation

"Information Extraction from the Web: Techniques and Applications"
Alexander Yates
University of Washington Ph.D. Dissertation

2006

"Structured Queries Over Web Text"
Michael J. Cafarella, Oren Etzioni, and Dan Suciu
IEEE Data Bulletin, 29(4): 45-51, 2006

"Machine Reading"
Oren Etzioni, Michele Banko, and Michael J. Cafarella
Proceedings of the 21st National Conference on Artificial Intelligence (AAAI 2006)

"Ontology-driven Information Extraction with OntoSyphon"
Luke K. McDowell and Michael Cafarella
Proceedings of the 5th International Semantic Web Conference (ISWC 2006)

"Can Controlled Languages Scale to the Web?"
Jonathan Pool
Proceedings of the 5th International Workshop on Controlled Language Applications (CLAW 2006)

"Ambiguity Reduction for Machine Translation: Human-Computer Collaboration"
Marcus Sammer, Kobi Reiter, Stephen Soderland, Katrin Kirchhoff, and Oren Etzioni
Proceedings of the Conference of the Association for Machine Translation in the Americas (AMTA 2006)

"Detecting Parser Errors Using Web-based Semantic Filters"
Alexander Yates, Stefan Schoenmackers, and Oren Etzioni
Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP 2006)

"Relational Web Search"
Michael J. Cafarella, Michele Banko, and Oren Etzioni
UW CSE Tech Report 2006-04-02

2005

"Extracting Product Features and Opinions from Reviews"
Ana-Maria Popescu and Oren Etzioni
Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP 2005)

"KnowItNow: Fast, Scalable Information Extraction from the Web"
Michael J. Cafarella, Doug Downey, Stephen Soderland, and Oren Etzioni
Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP 2005)

"A Probabilistic Model of Redundancy in Information Extraction"
Doug Downey, Oren Etzioni, and Stephen Soderland
Proceedings of the 19th International Joint Conference on Artificial Intelligence (IJCAI 2005)
Distinguished Paper Award

"A Search Engine for Natural Language Applications"
Michael J. Cafarella and Oren Etzioni
Proceedings of the 14th International World Wide Web Conference (WWW 2005)

"Unsupervised named-entity extraction from the Web: An experimental study"
Oren Etzioni, Michael Cafarella, Doug Downey, Ana-Maria Popescu, Tal Shaked, Stephen Soderland, Daniel S. Weld, and Alexander Yates
Artificial Intelligence, 165(1): 91-134, 2005

"Beauty and the Beast: What running a broad-coverage precision grammar over the BNC taught us about the grammar---and the corpus"
Timothy Baldwin, John Beavers, Emily M. Bender, Dan Flickinger, Ara Kim, and Stephan Oepen
Linguistic Evidence: Empirical, Theoretical, and Computational Perspectives, 49-70, 2005

"A Coordination Module for a Crosslinguistic Grammar Resource"
Scott Drellishak and Emily M. Bender
Proceedings of the 12th International Conference on Head-Driven Phrase Structure Grammar (HPSG 2005)

"Rapid Prototyping of Scalable Grammars: Towards Modularity in Extensions to a Language-Independent Core"
Emily M. Bender and Dan Flickinger
Proceedings of the 2nd International Joint Conference on Natural Language Processing (IJCNLP 2005)

2004

"Methods for domain-independent information extraction from the Web: an experimental comparison"
Oren Etzioni, Michael Cafarella, Doug Downey, Ana-Maria Popescu, Tal Shaked, Stephen Soderland, Daniel S. Weld, and Alexander Yates
Proceedings of the 19th National Conference on Artificial Intelligence (AAAI 2004)

"Web-scale information extraction in KnowItAll (preliminary results)"
Oren Etzioni, Michael Cafarella, Doug Downey, Stanley Kok, Ana-Maria Popescu, Tal Shaked, Stephen Soderland, Daniel S. Weld, and Alexander Yates
Proceedings of the 13th International Conference on the World Wide Web (WWW 2004)

"Semantic Email"
Luke McDowell , Oren Etzioni, Alon Halevy, and Henry Levy
Proceedings of the 13th International Conference on the World Wide Web (WWW 2004)
Runner-up for Best Paper Award

"Modern Natural Language Interfaces to Databases: composing statistical parsing with semantic tractability"
Ana-Maria Popescu, Alex Armanasu, Oren Etzioni, David Ko, and Alexander Yates
Proceedings of the 20th International Conference on Computational Linguistics (COLING 2004)

"The specification of agent behavior by ordinary people: A case study"
Luke McDowell, Oren Etzioni, and Alon Halevy
Proceedings of the 3rd International Semantic Web Conference (ISWC 2004)

"Semantic email: Theory and applications"
Luke McDowell, Oren Etzioni, and Alon Halevy
Journal of Web Semantics, 2(2): 153-183, 2004

"Montage: Leveraging Advances in Grammar Engineering, Linguistic Ontologies, and Mark-up for the Documentation of Underdescribed Languages"
Emily Bender, Dan Flickinger, Jeff Good, and Ivan Sag
Proceedings of the Workshop on First Steps for Language Documentation of Minority Languages: Computational Linguistic Tools for Morphology, Lexicon and Corpus Compilation (LREC 2004)

"Meaning for the Masses: Theory and Applications for Semantic Web and Semantic Email Systems"
Luke McDowell
University of Washington Ph.D. Dissertation

2003

"Enticing Ordinary People onto the Semantic Web via Instant Gratification"
Luke McDowell, Oren Etzioni, Steven D. Gribble, Alon Halevy, Henry Levy, William Pentney, Deepak Verma, and Stani Vlasseva
Proceedings of the International Semantic Web Conference (ISWC 2003)

"Semantic Email: Adding Lightweight Data Manipulation Capabilities to the Email Habitat"
Oren Etzioni, Alon Halevy, Henry Levy, and Luke McDowell
Proceedings of the 6th International Workshop on the Web and Databases (WebDB 2003)

"Towards a theory of natural language interfaces to databases"
Ana-Maria Popescu, Oren Etzioni, and Henry Kautz
Proceedings of the Intelligent User Interfaces (IUI 2003)

"A reliable natural language interface to household appliances"
Alexander Yates, Oren Etzioni, and Daniel S. Weld
Proceedings of the International Conference on Intelligent User Interfaces (IUI 2003)

"Compositional Semantics in a Multilingual Grammar Resource"
Dan Flickinger and Emily M. Bender
Proceedings of the Workshop on Ideas and Stratgies for Multilingual Grammar Development (ESSLLI 2003)

2002

"The Grammar Matrix: An Open-Source Starter-Kit for the Rapid Development of Cross-Linguistically Consistent Broad-Coverage Precision Grammars"
Emily M. Bender, Dan Flickinger, and Stephan Oepen
Proceedings of the Workshop on Grammar Engineering and Evaluation (COLING 2002)

1999

"Clustering Web Documents: A Phrase-Based Method for Grouping Search Engine Results"
Oren Zamir
University of Washington Ph.D. Dissertation

Publication lookup sites
Google Scholar
CiteSeerX
ACM Portal
AAAI Publications
DBLP

 
 

Email: | Maps | Directions