|
|||||||||||||||||
Events2008
Computational Linguistics Lab DiscussionsVarious topics in computational linguistics Natural Language Processing Reading GroupCurrent paper scheduleThursdays, 3:30 pm - 4:30 pm, Electrical Engineering M306 Machine Translation Reading GroupRecent papers in primarily statistical machine translation ColloquiumMausam (Turing Center) and Stephen Soderland (Turing Center) Abstract The goal of our project is a system that can translate between arbitrary pairs of languages. Unfortunately, most machine translation methodologies assume aligned corpora or grammar rules, which are available for only a small number of major language pairs. This makes scaling the popular approaches to any-language translation virtually impossible. We propose to scale machine translation to a panlingual level by first attempting to solve the lexical translation problem and then proceeding to translating pairs of words, phrases and then sentences. In this talk we primarily describe a novel approach to lexical translation that employs probabilistic inference over the Translation Graph, a novel lexical resource that combines translations from hundreds of machine readable dictionaries and Wiktionaries. Our inference algorithm opens up several interesting and challenging future directions that we detail in the talk. We will also demo PanImages (http://www.panimages.org), an image search application that uses the Translation Graph. SymposiumFourteenth UW/Microsoft Quarterly Symposium in Computational Linguistics You are invited to take advantage of this opportunity to connect with the computational linguistics community at Microsoft and the University of Washington. Sponsored by the UW Departments of Linguistics, Electrical Engineering, and Computer Science; Microsoft Research; and UW alumni at Microsoft. The symposium consists of two invited talks, followed by a poster presentation and an informal reception. Amar Subramanya and Jeff Bilmes (Electrical Engineering) Colin Cherry (NLP group, Microsoft Research) Michael Tepper (Linguistics) ColloquiumOren Etzioni, Mausam, and Stephen Soderland (Turing Center) Abstract The goal of our project is a system that can translate between arbitrary pairs of languages. Unfortunately, most machine translation methodologies assume aligned corpora or grammar rules, which are available for only a small number of major language pairs. This makes scaling the popular approaches to any-language translation virtually impossible. We propose to scale machine translation to a panlingual level by first attempting to solve the lexical translation problem and then proceeding to translating pairs of words, phrases and then sentences. In this talk we primarily describe a novel approach to lexical translation that employs probabilistic inference over the Translation Graph, a novel lexical resource that combines translations from hundreds of machine readable dictionaries and Wiktionaries. We hope to also discuss whether the Graph can be used to support linguistics research, and to unearth new observations by providing lexical data on a large number of languages in concert. SymposiumFifteenth UW/Microsoft Quarterly Symposium in Computational Linguistics You are invited to take advantage of this opportunity to connect with the computational linguistics community at Microsoft and the University of Washington. Sponsored by the UW Departments of Linguistics, Electrical Engineering, and Computer Science; Microsoft Research; and UW alumni at Microsoft. Transit from UW: Sound Transit Route 545 every 15 minutes from Montlake Freeway Station (2:07 or 2:22 p.m.) to SR 520 and NE 40th Street (2:19 or 2:34 p.m.); cross SR 520 to 148th Avenue NE, turn left, go to NE 36th Street. Carpooling from UW: Contact Dan Jinguji (danjj@u.washington.edu). Talks: Douglas Downey (Computer Science & Engineering) Chris Quirk (Microsoft Research) Demonstrations: Scott Drellishak, Kelly O’Hara, and Emily M. Bender (Linguistics) Michael Gamon, Sumit Basu, Dmitriy Belenko, Danyel Fisher, Matthew Hurst, and Arnd Christian König (Microsoft Research and Microsoft Live Labs) Michael Gamon, Chris Brockett, Dmitriy Belenko, Bill Dolan, Jianfeng Gao, Lucy Vanderwende (Microsoft Research) ColloquiumTimothy Baldwin (Computer Science and Software Engineering, Melbourne) Abstract The ILIAD (Improved Linux Information Access by Data Mining) Project is an attempt to apply language technology to the task of Linux troubleshooting by analysing the underlying information structure of a multi-document text discourse and improving information delivery through a combination of filtering, term identification and information extraction techniques. In this talk, I will outline the overall project design and present results for a variety of thread-level filtering tasks. Speaker Timothy Baldwin is a Senior Lecturer in the Department of Computer Science and Software Engineering, University of Melbourne. Since completing his PhD at the Tokyo Institute of Technology in 2001, he has been involved with research grants from sources including the NSF, NTT, ARC, NICTA and Google. His research interests include web mining, information extraction, deep linguistic processing, multiword expressions, deep lexical acquisition, and biomedical text mining. He is the author of over 130 journal and conference publications, and has held visiting appointments at NTT Communication Science Laboratories and Saarland University. He is the recipient of a number of awards for both teaching and research in the areas of computer science and natural language processing. He is currently on the editorial board of Computational Linguistics, a series editor for CSLI Publications, and a member of the Deep Linguistic Processing with HPSG Initiative (DELPH-IN).
Previous Events, 2005Previous Events, 2006Previous Events, 2007 |
|||||||||||||||||
|
Email: | Maps | Directions |
|||||||||||||||||