Please use the following text to cite this item or export to a predefined format:
Rubino, Francesco; Quochi, Valeria and Frontini, Francesca, 2012, Multiword Extractor, CLARIN DSpace, http://hdl.handle.net/20.500.11752/ILC-91
dc.contributor.author | Rubino, Francesco |
dc.contributor.author | Quochi, Valeria |
dc.contributor.author | Frontini, Francesca |
dc.date.accessioned | 2018-09-13T08:17:26Z |
dc.date.available | 2018-09-13T08:17:26Z |
dc.date.issued | 2012-12-12 |
dc.description | This is a lexical acquisition web-service for the automatic extraction of multiword expressions from large corpora. The service takes in input a POS-tagged corpus in CoNLL-X format plus a pair of POS-tags for the first and last word of a MWE, and outputs a list of extracted (candidate) multiword expressions with a set of linguistic and statistical information. The output can then be post-processed through filters that will refine and improve the accuracy of the extraction, and finally converted to an LMF-compliant XML lexical resource. The tool code is available open-source at https://github.com/francescafrontini/MWExtractor. Further details can be found in: Quochi Valeria & Frontini Francesca & Rubino Francesco. 2012. A MWE Acquisition and Lexicon Builder Web Service. In Proceedings of the 24th International Conference on Computational Linguistics (COLING 2012), December 10-14 2012, IIT Bombay, Mumbai, India. Frontini Francesca & Rubino Francesco & Quochi Valeria. 2012. Automatic Creation of quality multi-word Lexica from noisy text data. In Proceedings of the Sixth Workshop on Analytics for Noisy Unstructured Text Data (AND2012). December 9, 2012, IIT Bombay, Mumbai, India (Co-located with COLING2012). |
dc.identifier.uri | http://hdl.handle.net/20.500.11752/ILC-91 |
dc.publisher | Istituto di Linguistica Computazionale “A. Zampolli” - Consiglio Nazionale delle Ricerche (ILC-CNR) |
dc.relation | info:eu-repo/grantAgreement/EC/FP7/327146 |
dc.relation.isreferencedby | http://www.aclweb.org/anthology/C12-1140 |
dc.source.uri | http://www.panacea-lr.eu/system/deliverables/PANACEA_D6.2.pdf |
dc.subject | Multiword Extraction |
dc.subject | Automatic lexical acquisition |
dc.title | Multiword Extractor |
dc.type | toolService |
local.branding | ILC |
local.contact.person | Valeria Quochi valeri.quochi@ilc.cnr.it Istituto di Linguistica Computazionale “A. Zampolli” - Consiglio Nazionale delle Ricerche (ILC-CNR) |
local.demo.uri | https://ilc4clarin.ilc.cnr.it/en/services/multiword-extractor |
local.files.count | 0 |
local.files.size | 0 |
local.has.files | no |
local.sponsor | euFunds FP7-STREP-GA248064 European Commission Platform for Automatic, Normalized Annotation and Cost-Effective Acquisition of Language Resources for Human Language Technologies info:eu-repo/grantAgreement/EC/FP7/327146 |
metashare.ResourceInfo#ContentInfo.detailedType | service |
metashare.ResourceInfo#ResourceComponentType#ToolServiceInfo.languageDependent | false |