A Lexical and Semantical Analysis on REST Cloud Computing APIs. Petrillo, F., Merle, P., Palma, F., Moha, N., & Gu�h�neuc, Y. In Ferguson, D., Mu�oz, V. M., Cardoso, J. S., Helfert, M., & Pahl, C., editors, Proceedings of the 8<sup>th</sup> International Conference on Cloud Computing and Services Science (CLOSER), pages 308–332, July, 2018. Springer. 24 pages.Paper abstract bibtex Cloud computing is a popular Internet-based computing paradigm that provides on-demand computational services and resources, generally offered by Cloud providers' REpresentational State Transfer (REST) APIs. Developers use REST APIs by invoking these APIs by their names and, thus, the lexicons used in the APIs are important to ease the developers' comprehension. In this paper, we study the lexicons and the linguistic (anti)patterns from 16 providers of REST Cloud Computing APIs. We observe that, although the 16 REST APIs describe the same domain (Cloud computing), contrary to what one might expect, their lexicons do not share a large number of common terms and 90% of the terms (3,561/3,947) are just used by one provider. Thus, the APIs are lexically heterogeneous and there is not a consensus on which terms to use in Cloud computing. Further, we observe that the majority of the URIs, 54%, follow the Contextualised Resource Names pattern, which is considered a good practice in REST API design. However, a majority of the URIs, 62.82%, suffer from the Non-pertinent Documentation antipattern. Thus, we present three main contributions: 1) a tooled approach, called CloudLex, for extracting and analysing REST Cloud computing lexicons; 2) our analysis of the terms used in 16 REST APIs in 59,677 term occurrences; 3) our analysis of the linguistic (anti)patters in more than 23,000 URIs of the 142 services of the 16 Cloud providers. We also show that CloudLex has an average precision of 84.82%, recall of 63.57%, and F1-measure of 71.03% on one complete API, Docker Engine, which confirms the accuracy of our semantic analyses for the detection of linguistic (anti)patterns.
@INPROCEEDINGS{Petrillo18-CLOSER-LexicalSemanticREST,
AUTHOR = {Fabio Petrillo and Philippe Merle and Francis Palma and
Naouel Moha and Yann-Ga�l Gu�h�neuc},
BOOKTITLE = {Proceedings of the 8<sup>th</sup> International Conference on Cloud Computing and Services Science (CLOSER)},
TITLE = {A Lexical and Semantical Analysis on REST Cloud
Computing APIs},
YEAR = {2018},
OPTADDRESS = {},
OPTCROSSREF = {},
EDITOR = {Donald Ferguson and V�ctor M�ndez Mu�oz and
Jorge S. Cardoso and Markus Helfert and Claus Pahl},
MONTH = {July},
NOTE = {24 pages.},
OPTNUMBER = {},
OPTORGANIZATION = {},
PAGES = {308--332},
PUBLISHER = {Springer},
OPTSERIES = {},
OPTVOLUME = {},
KEYWORDS = {Topic: <b>Code and design smells</b>,
Venue: <c>CLOSER</c>},
URL = {http://www.ptidej.net/publications/documents/CLOSER18.doc.pdf},
PDF = {http://www.ptidej.net/publications/documents/CLOSER18.ppt.pdf},
ABSTRACT = {Cloud computing is a popular Internet-based computing
paradigm that provides on-demand computational services and
resources, generally offered by Cloud providers' REpresentational
State Transfer (REST) APIs. Developers use REST APIs by invoking
these APIs by their names and, thus, the lexicons used in the APIs
are important to ease the developers' comprehension. In this paper,
we study the lexicons and the linguistic (anti)patterns from 16
providers of REST Cloud Computing APIs. We observe that, although the
16 REST APIs describe the same domain (Cloud computing), contrary to
what one might expect, their lexicons do not share a large number of
common terms and 90\% of the terms (3,561/3,947) are just used by one
provider. Thus, the APIs are lexically heterogeneous and there is not
a consensus on which terms to use in Cloud computing. Further, we
observe that the majority of the URIs, 54\%, follow the
Contextualised Resource Names pattern, which is considered a good
practice in REST API design. However, a majority of the URIs,
62.82\%, suffer from the Non-pertinent Documentation antipattern.
Thus, we present three main contributions: 1) a tooled approach,
called CloudLex, for extracting and analysing REST Cloud computing
lexicons; 2) our analysis of the terms used in 16 REST APIs in 59,677
term occurrences; 3) our analysis of the linguistic (anti)patters in
more than 23,000 URIs of the 142 services of the 16 Cloud providers.
We also show that CloudLex has an average precision of 84.82\%,
recall of 63.57\%, and F1-measure of 71.03\% on one complete API,
Docker Engine, which confirms the accuracy of our semantic analyses
for the detection of linguistic (anti)patterns.}
}
Downloads: 0
{"_id":"ckRZWzg7pCEErb8Ti","bibbaseid":"petrillo-merle-palma-moha-guhneuc-alexicalandsemanticalanalysisonrestcloudcomputingapis-2018","downloads":0,"creationDate":"2018-10-22T00:13:29.337Z","title":"A Lexical and Semantical Analysis on REST Cloud Computing APIs","author_short":["Petrillo, F.","Merle, P.","Palma, F.","Moha, N.","Gu�h�neuc, Y."],"year":2018,"bibtype":"inproceedings","biburl":"http://www.yann-gael.gueheneuc.net/Work/Publications/Biblio/complete-bibliography.bib","bibdata":{"bibtype":"inproceedings","type":"inproceedings","author":[{"firstnames":["Fabio"],"propositions":[],"lastnames":["Petrillo"],"suffixes":[]},{"firstnames":["Philippe"],"propositions":[],"lastnames":["Merle"],"suffixes":[]},{"firstnames":["Francis"],"propositions":[],"lastnames":["Palma"],"suffixes":[]},{"firstnames":["Naouel"],"propositions":[],"lastnames":["Moha"],"suffixes":[]},{"firstnames":["Yann-Ga�l"],"propositions":[],"lastnames":["Gu�h�neuc"],"suffixes":[]}],"booktitle":"Proceedings of the 8<sup>th</sup> International Conference on Cloud Computing and Services Science (CLOSER)","title":"A Lexical and Semantical Analysis on REST Cloud Computing APIs","year":"2018","optaddress":"","optcrossref":"","editor":[{"firstnames":["Donald"],"propositions":[],"lastnames":["Ferguson"],"suffixes":[]},{"firstnames":["V�ctor","M�ndez"],"propositions":[],"lastnames":["Mu�oz"],"suffixes":[]},{"firstnames":["Jorge","S."],"propositions":[],"lastnames":["Cardoso"],"suffixes":[]},{"firstnames":["Markus"],"propositions":[],"lastnames":["Helfert"],"suffixes":[]},{"firstnames":["Claus"],"propositions":[],"lastnames":["Pahl"],"suffixes":[]}],"month":"July","note":"24 pages.","optnumber":"","optorganization":"","pages":"308–332","publisher":"Springer","optseries":"","optvolume":"","keywords":"Topic: <b>Code and design smells</b>, Venue: <c>CLOSER</c>","url":"http://www.ptidej.net/publications/documents/CLOSER18.doc.pdf","pdf":"http://www.ptidej.net/publications/documents/CLOSER18.ppt.pdf","abstract":"Cloud computing is a popular Internet-based computing paradigm that provides on-demand computational services and resources, generally offered by Cloud providers' REpresentational State Transfer (REST) APIs. Developers use REST APIs by invoking these APIs by their names and, thus, the lexicons used in the APIs are important to ease the developers' comprehension. In this paper, we study the lexicons and the linguistic (anti)patterns from 16 providers of REST Cloud Computing APIs. We observe that, although the 16 REST APIs describe the same domain (Cloud computing), contrary to what one might expect, their lexicons do not share a large number of common terms and 90% of the terms (3,561/3,947) are just used by one provider. Thus, the APIs are lexically heterogeneous and there is not a consensus on which terms to use in Cloud computing. Further, we observe that the majority of the URIs, 54%, follow the Contextualised Resource Names pattern, which is considered a good practice in REST API design. However, a majority of the URIs, 62.82%, suffer from the Non-pertinent Documentation antipattern. Thus, we present three main contributions: 1) a tooled approach, called CloudLex, for extracting and analysing REST Cloud computing lexicons; 2) our analysis of the terms used in 16 REST APIs in 59,677 term occurrences; 3) our analysis of the linguistic (anti)patters in more than 23,000 URIs of the 142 services of the 16 Cloud providers. We also show that CloudLex has an average precision of 84.82%, recall of 63.57%, and F1-measure of 71.03% on one complete API, Docker Engine, which confirms the accuracy of our semantic analyses for the detection of linguistic (anti)patterns.","bibtex":"@INPROCEEDINGS{Petrillo18-CLOSER-LexicalSemanticREST,\r\n AUTHOR = {Fabio Petrillo and Philippe Merle and Francis Palma and \r\n Naouel Moha and Yann-Ga�l Gu�h�neuc},\r\n BOOKTITLE = {Proceedings of the 8<sup>th</sup> International Conference on Cloud Computing and Services Science (CLOSER)},\r\n TITLE = {A Lexical and Semantical Analysis on REST Cloud \r\n Computing APIs},\r\n YEAR = {2018},\r\n OPTADDRESS = {},\r\n OPTCROSSREF = {},\r\n EDITOR = {Donald Ferguson and V�ctor M�ndez Mu�oz and \r\n Jorge S. Cardoso and Markus Helfert and Claus Pahl},\r\n MONTH = {July},\r\n NOTE = {24 pages.},\r\n OPTNUMBER = {},\r\n OPTORGANIZATION = {},\r\n PAGES = {308--332},\r\n PUBLISHER = {Springer},\r\n OPTSERIES = {},\r\n OPTVOLUME = {},\r\n KEYWORDS = {Topic: <b>Code and design smells</b>, \r\n Venue: <c>CLOSER</c>},\r\n URL = {http://www.ptidej.net/publications/documents/CLOSER18.doc.pdf},\r\n PDF = {http://www.ptidej.net/publications/documents/CLOSER18.ppt.pdf},\r\n ABSTRACT = {Cloud computing is a popular Internet-based computing \r\n paradigm that provides on-demand computational services and \r\n resources, generally offered by Cloud providers' REpresentational \r\n State Transfer (REST) APIs. Developers use REST APIs by invoking \r\n these APIs by their names and, thus, the lexicons used in the APIs \r\n are important to ease the developers' comprehension. In this paper, \r\n we study the lexicons and the linguistic (anti)patterns from 16 \r\n providers of REST Cloud Computing APIs. We observe that, although the \r\n 16 REST APIs describe the same domain (Cloud computing), contrary to \r\n what one might expect, their lexicons do not share a large number of \r\n common terms and 90\\% of the terms (3,561/3,947) are just used by one \r\n provider. Thus, the APIs are lexically heterogeneous and there is not \r\n a consensus on which terms to use in Cloud computing. Further, we \r\n observe that the majority of the URIs, 54\\%, follow the \r\n Contextualised Resource Names pattern, which is considered a good \r\n practice in REST API design. However, a majority of the URIs, \r\n 62.82\\%, suffer from the Non-pertinent Documentation antipattern. \r\n Thus, we present three main contributions: 1) a tooled approach, \r\n called CloudLex, for extracting and analysing REST Cloud computing \r\n lexicons; 2) our analysis of the terms used in 16 REST APIs in 59,677 \r\n term occurrences; 3) our analysis of the linguistic (anti)patters in \r\n more than 23,000 URIs of the 142 services of the 16 Cloud providers. \r\n We also show that CloudLex has an average precision of 84.82\\%, \r\n recall of 63.57\\%, and F1-measure of 71.03\\% on one complete API, \r\n Docker Engine, which confirms the accuracy of our semantic analyses \r\n for the detection of linguistic (anti)patterns.}\r\n}\r\n\r\n","author_short":["Petrillo, F.","Merle, P.","Palma, F.","Moha, N.","Gu�h�neuc, Y."],"editor_short":["Ferguson, D.","Mu�oz, V. M.","Cardoso, J. S.","Helfert, M.","Pahl, C."],"key":"Petrillo18-CLOSER-LexicalSemanticREST","id":"Petrillo18-CLOSER-LexicalSemanticREST","bibbaseid":"petrillo-merle-palma-moha-guhneuc-alexicalandsemanticalanalysisonrestcloudcomputingapis-2018","role":"author","urls":{"Paper":"http://www.ptidej.net/publications/documents/CLOSER18.doc.pdf"},"keyword":["Topic: <b>Code and design smells</b>","Venue: <c>CLOSER</c>"],"metadata":{"authorlinks":{"gu�h�neuc, y":"https://bibbase.org/show?bib=http%3A%2F%2Fwww.yann-gael.gueheneuc.net%2FWork%2FPublications%2FBiblio%2Fcomplete-bibliography.bib&msg=embed"}}},"search_terms":["lexical","semantical","analysis","rest","cloud","computing","apis","petrillo","merle","palma","moha","gu�h�neuc"],"keywords":["topic: <b>code and design smells</b>","venue: <c>closer</c>"],"authorIDs":["5a5fb236a39f2c3645000032","5e60e7f0839e59df010000e8","AfJhKcg96muyPdu7S","ahGA65oGDChNYp7Mb"],"dataSources":["8vn5MSGYWB4fAx9Z4"]}