DeepNLPF: A Framework for Integrating Third Party NLP Tools. Rodrigues, F., Lima, R., Domingues, W., Fidalgo, R., Chifu, A., Espinasse, B., & Fournier, S. In Calzolari, N., Béchet, F., Blache, P., Choukri, K., Cieri, C., Declerck, T., Goggi, S., Isahara, H., Maegaard, B., Mariani, J., Mazo, H., Moreno, A., Odijk, J., & Piperidis, S., editors, Proceedings of the Twelfth Language Resources and Evaluation Conference, pages 7244–7251, Marseille, France, May, 2020. European Language Resources Association. Paper abstract bibtex Natural Language Processing (NLP) of textual data is usually broken down into a sequence of several subtasks, where the output of one the subtasks becomes the input to the following one, which constitutes an NLP pipeline. Many third-party NLP tools are currently available, each performing distinct NLP subtasks. However, it is difficult to integrate several NLP toolkits into a pipeline due to many problems, including different input/output representations or formats, distinct programming languages, and tokenization issues. This paper presents DeepNLPF, a framework that enables easy integration of third-party NLP tools, allowing the user to preprocess natural language texts at lexical, syntactic, and semantic levels. The proposed framework also provides an API for complete pipeline customization including the definition of input/output formats, integration plugin management, transparent ultiprocessing execution strategies, corpus-level statistics, and database persistence. Furthermore, the DeepNLPF user-friendly GUI allows its use even by a non-expert NLP user. We conducted runtime performance analysis showing that DeepNLPF not only easily integrates existent NLP toolkits but also reduces significant runtime processing compared to executing the same NLP pipeline in a sequential manner.
@inproceedings{rodrigues-etal-2020-deepnlpf,
title = "{D}eep{NLPF}: A Framework for Integrating Third Party {NLP} Tools",
author = "Rodrigues, Francisco and
Lima, Rinaldo and
Domingues, William and
Fidalgo, Robson and
Chifu, Adrian and
Espinasse, Bernard and
Fournier, S{\'e}bastien",
editor = "Calzolari, Nicoletta and
B{\'e}chet, Fr{\'e}d{\'e}ric and
Blache, Philippe and
Choukri, Khalid and
Cieri, Christopher and
Declerck, Thierry and
Goggi, Sara and
Isahara, Hitoshi and
Maegaard, Bente and
Mariani, Joseph and
Mazo, H{\'e}l{\`e}ne and
Moreno, Asuncion and
Odijk, Jan and
Piperidis, Stelios",
booktitle = "Proceedings of the Twelfth Language Resources and Evaluation Conference",
month = may,
year = "2020",
address = "Marseille, France",
publisher = "European Language Resources Association",
url = "https://aclanthology.org/2020.lrec-1.895",
pages = "7244--7251",
abstract = "Natural Language Processing (NLP) of textual data is usually broken down into a sequence of several subtasks, where the output of one the subtasks becomes the input to the following one, which constitutes an NLP pipeline. Many third-party NLP tools are currently available, each performing distinct NLP subtasks. However, it is difficult to integrate several NLP toolkits into a pipeline due to many problems, including different input/output representations or formats, distinct programming languages, and tokenization issues. This paper presents DeepNLPF, a framework that enables easy integration of third-party NLP tools, allowing the user to preprocess natural language texts at lexical, syntactic, and semantic levels. The proposed framework also provides an API for complete pipeline customization including the definition of input/output formats, integration plugin management, transparent ultiprocessing execution strategies, corpus-level statistics, and database persistence. Furthermore, the DeepNLPF user-friendly GUI allows its use even by a non-expert NLP user. We conducted runtime performance analysis showing that DeepNLPF not only easily integrates existent NLP toolkits but also reduces significant runtime processing compared to executing the same NLP pipeline in a sequential manner.",
language = "English",
ISBN = "979-10-95546-34-4",
}
Downloads: 0
{"_id":"DE5RCBSKaQzrAhEav","bibbaseid":"rodrigues-lima-domingues-fidalgo-chifu-espinasse-fournier-deepnlpfaframeworkforintegratingthirdpartynlptools-2020","author_short":["Rodrigues, F.","Lima, R.","Domingues, W.","Fidalgo, R.","Chifu, A.","Espinasse, B.","Fournier, S."],"bibdata":{"bibtype":"inproceedings","type":"inproceedings","title":"DeepNLPF: A Framework for Integrating Third Party NLP Tools","author":[{"propositions":[],"lastnames":["Rodrigues"],"firstnames":["Francisco"],"suffixes":[]},{"propositions":[],"lastnames":["Lima"],"firstnames":["Rinaldo"],"suffixes":[]},{"propositions":[],"lastnames":["Domingues"],"firstnames":["William"],"suffixes":[]},{"propositions":[],"lastnames":["Fidalgo"],"firstnames":["Robson"],"suffixes":[]},{"propositions":[],"lastnames":["Chifu"],"firstnames":["Adrian"],"suffixes":[]},{"propositions":[],"lastnames":["Espinasse"],"firstnames":["Bernard"],"suffixes":[]},{"propositions":[],"lastnames":["Fournier"],"firstnames":["Sébastien"],"suffixes":[]}],"editor":[{"propositions":[],"lastnames":["Calzolari"],"firstnames":["Nicoletta"],"suffixes":[]},{"propositions":[],"lastnames":["Béchet"],"firstnames":["Frédéric"],"suffixes":[]},{"propositions":[],"lastnames":["Blache"],"firstnames":["Philippe"],"suffixes":[]},{"propositions":[],"lastnames":["Choukri"],"firstnames":["Khalid"],"suffixes":[]},{"propositions":[],"lastnames":["Cieri"],"firstnames":["Christopher"],"suffixes":[]},{"propositions":[],"lastnames":["Declerck"],"firstnames":["Thierry"],"suffixes":[]},{"propositions":[],"lastnames":["Goggi"],"firstnames":["Sara"],"suffixes":[]},{"propositions":[],"lastnames":["Isahara"],"firstnames":["Hitoshi"],"suffixes":[]},{"propositions":[],"lastnames":["Maegaard"],"firstnames":["Bente"],"suffixes":[]},{"propositions":[],"lastnames":["Mariani"],"firstnames":["Joseph"],"suffixes":[]},{"propositions":[],"lastnames":["Mazo"],"firstnames":["Hélène"],"suffixes":[]},{"propositions":[],"lastnames":["Moreno"],"firstnames":["Asuncion"],"suffixes":[]},{"propositions":[],"lastnames":["Odijk"],"firstnames":["Jan"],"suffixes":[]},{"propositions":[],"lastnames":["Piperidis"],"firstnames":["Stelios"],"suffixes":[]}],"booktitle":"Proceedings of the Twelfth Language Resources and Evaluation Conference","month":"May","year":"2020","address":"Marseille, France","publisher":"European Language Resources Association","url":"https://aclanthology.org/2020.lrec-1.895","pages":"7244–7251","abstract":"Natural Language Processing (NLP) of textual data is usually broken down into a sequence of several subtasks, where the output of one the subtasks becomes the input to the following one, which constitutes an NLP pipeline. Many third-party NLP tools are currently available, each performing distinct NLP subtasks. However, it is difficult to integrate several NLP toolkits into a pipeline due to many problems, including different input/output representations or formats, distinct programming languages, and tokenization issues. This paper presents DeepNLPF, a framework that enables easy integration of third-party NLP tools, allowing the user to preprocess natural language texts at lexical, syntactic, and semantic levels. The proposed framework also provides an API for complete pipeline customization including the definition of input/output formats, integration plugin management, transparent ultiprocessing execution strategies, corpus-level statistics, and database persistence. Furthermore, the DeepNLPF user-friendly GUI allows its use even by a non-expert NLP user. We conducted runtime performance analysis showing that DeepNLPF not only easily integrates existent NLP toolkits but also reduces significant runtime processing compared to executing the same NLP pipeline in a sequential manner.","language":"English","isbn":"979-10-95546-34-4","bibtex":"@inproceedings{rodrigues-etal-2020-deepnlpf,\n title = \"{D}eep{NLPF}: A Framework for Integrating Third Party {NLP} Tools\",\n author = \"Rodrigues, Francisco and\n Lima, Rinaldo and\n Domingues, William and\n Fidalgo, Robson and\n Chifu, Adrian and\n Espinasse, Bernard and\n Fournier, S{\\'e}bastien\",\n editor = \"Calzolari, Nicoletta and\n B{\\'e}chet, Fr{\\'e}d{\\'e}ric and\n Blache, Philippe and\n Choukri, Khalid and\n Cieri, Christopher and\n Declerck, Thierry and\n Goggi, Sara and\n Isahara, Hitoshi and\n Maegaard, Bente and\n Mariani, Joseph and\n Mazo, H{\\'e}l{\\`e}ne and\n Moreno, Asuncion and\n Odijk, Jan and\n Piperidis, Stelios\",\n booktitle = \"Proceedings of the Twelfth Language Resources and Evaluation Conference\",\n month = may,\n year = \"2020\",\n address = \"Marseille, France\",\n publisher = \"European Language Resources Association\",\n url = \"https://aclanthology.org/2020.lrec-1.895\",\n pages = \"7244--7251\",\n abstract = \"Natural Language Processing (NLP) of textual data is usually broken down into a sequence of several subtasks, where the output of one the subtasks becomes the input to the following one, which constitutes an NLP pipeline. Many third-party NLP tools are currently available, each performing distinct NLP subtasks. However, it is difficult to integrate several NLP toolkits into a pipeline due to many problems, including different input/output representations or formats, distinct programming languages, and tokenization issues. This paper presents DeepNLPF, a framework that enables easy integration of third-party NLP tools, allowing the user to preprocess natural language texts at lexical, syntactic, and semantic levels. The proposed framework also provides an API for complete pipeline customization including the definition of input/output formats, integration plugin management, transparent ultiprocessing execution strategies, corpus-level statistics, and database persistence. Furthermore, the DeepNLPF user-friendly GUI allows its use even by a non-expert NLP user. We conducted runtime performance analysis showing that DeepNLPF not only easily integrates existent NLP toolkits but also reduces significant runtime processing compared to executing the same NLP pipeline in a sequential manner.\",\n language = \"English\",\n ISBN = \"979-10-95546-34-4\",\n}\n\n\n","author_short":["Rodrigues, F.","Lima, R.","Domingues, W.","Fidalgo, R.","Chifu, A.","Espinasse, B.","Fournier, S."],"editor_short":["Calzolari, N.","Béchet, F.","Blache, P.","Choukri, K.","Cieri, C.","Declerck, T.","Goggi, S.","Isahara, H.","Maegaard, B.","Mariani, J.","Mazo, H.","Moreno, A.","Odijk, J.","Piperidis, S."],"key":"rodrigues-etal-2020-deepnlpf","id":"rodrigues-etal-2020-deepnlpf","bibbaseid":"rodrigues-lima-domingues-fidalgo-chifu-espinasse-fournier-deepnlpfaframeworkforintegratingthirdpartynlptools-2020","role":"author","urls":{"Paper":"https://aclanthology.org/2020.lrec-1.895"},"metadata":{"authorlinks":{}}},"bibtype":"inproceedings","biburl":"https://bibbase.org/network/files/DGB8WkveEMMk4fNMF","dataSources":["Qbm9HhNQbp6tv3asL"],"keywords":[],"search_terms":["deepnlpf","framework","integrating","third","party","nlp","tools","rodrigues","lima","domingues","fidalgo","chifu","espinasse","fournier"],"title":"DeepNLPF: A Framework for Integrating Third Party NLP Tools","year":2020}