Suvarna Garge (Editor)

LinguaStream

Updated on
Edit
Like
Comment
Share on FacebookTweet on TwitterShare on LinkedInShare on Reddit

LinguaStream is a generic platform for Natural Language Processing (NLP), based on incremental enrichment of electronic documents. LinguaStream is developed at the GREYC computer science research group (Université de Caen) since 2001. It is available for free for private use and research purposes.

Contents

Description

LinguaStream allows complex processing streams to be designed and evaluated, assembling analysis components of various types and levels: part-of-speech, syntax, semantics, discourse or statistical. Each stage of the processing stream discovers and produces new information, on which the subsequent steps can rely. At the end of the stream, several tools allow analysed documents and their annotations to be conveniently visualised.

LinguaStream is above all a virtual laboratory targeted to researchers in NLP. It allows for complex experiments on corpora to be realised conveniently, using various types of declarative formalisms, and reducing considerably the development costs. Its uses range from corpora exploration to the development of fully functional automatic analysers. An integrated environment is provided with the platform, where all the steps of the realisation of an experiment can be achieved.

Technology

As a platform, LinguaStream provides an extensive Java API. For example, it can be integrated with Java EE servers to develop web applications based on processing streams. It is also used for teaching, and provides specific modules dedicated to students.

References

LinguaStream Wikipedia