You are not connected to the Beloit College network. Access to online content and services may require you to authenticate with your library.
Remote Access

Find a copy in the library

WorldCat
Find it in libraries globally

Finding libraries that hold this item...
Details
Genre/Form: | Electronic books |
---|---|
Additional Physical Format: | Print version: Feldman, Anna. Resource-light approach to morpho-syntactic tagging. Amsterdam : Rodopi, 2010 (OCoLC)497573700 |
Material Type: | Document, Internet resource |
Document Type: | Internet Resource, Computer File |
All Authors / Contributors: |
Anna Feldman; Jirka Hana |
ISBN: | 9789042027695 904202769X 9042027681 9789042027688 |
OCLC Number: | 608352102 |
Description: | 1 online resource (xiv, 185 pages) : illustrations |
Contents: | Preliminary Material -- Introduction -- Common tagging techniques -- Previous resource-light approaches to NLP -- Languages, corpora and tagsets -- Quantifying language properties -- Resource-light morphological analysis -- Cross-language morphological tagging -- Summary and further work -- Bibliography -- Tagsets we use -- Corpora -- Language properties -- Citation Index. |
Series Title: | Language and computers, no. 70. |
Responsibility: | Anna Feldman and Jirka Hana. |
More information: |
Abstract:
While supervised corpus-based methods are highly accurate for different NLP tasks, including morphological tagging, they are difficult to port to other languages because they require resources that are expensive to create. As a result, many languages have no realistic prospect for morpho-syntactic annotation in the foreseeable future. The method presented in this book aims to overcome this problem by significantly limiting the necessary data and instead extrapolating the relevant information from another, related language. The approach has been tested on Catalan, Portuguese, and Russian. Although these languages are only relatively resource-poor, the same method can be in principle applied to any inflected language, as long as there is an annotated corpus of a related language available. Time needed for adjusting the system to a new language constitutes a fraction of the time needed for systems with extensive, manually created resources: days instead of years. This book touches upon a number of topics: typology, morphology, corpus linguistics, contrastive linguistics, linguistic annotation, computational linguistics and Natural Language Processing (NLP). Researchers and students who are interested in these scientific areas as well as in cross-lingual studies and applications will greatly benefit from this work. Scholars and practitioners in computer science and linguistics are the prospective readers of this book.

Reviews
Editorial reviews
Publisher Synopsis
"F[eldman] & H[ana] have opened a very interesting door, showing us a method with many potential applications to less resourced languages. I suspect there are many other methods behind that door that we could put to use leveraging the computational analysis of one language to help analyze related languages. Finally, it is a potential way for field linguists and computational linguists to work together--again, after a lapse of some years." - in: Linguist List, Fri. Dec. 17, 2010 Read more...
User-contributed reviews
Add a review and share your thoughts with other readers.
Be the first.
Add a review and share your thoughts with other readers.
Be the first.
Tags
Similar Items
Related Subjects:(6)
- Grammar, Comparative and general -- Morphosyntax.
- Computational linguistics.
- Language transfer (Language learning)
- Cross-language information retrieval.
- LANGUAGE ARTS & DISCIPLINES -- Grammar & Punctuation.
- LANGUAGE ARTS & DISCIPLINES -- Linguistics -- Syntax.
User lists with this item (1)
- Things I Own(1 items)
by jirka.x1 updated 2010-03-20