A Lexical and Semantical Analysis on REST Cloud Computing APIs

A Lexical and Semantical Analysis on REST Cloud Computing APIs. Petrillo, F., Merle, P., Palma, F., Moha, N., & Gu�h�neuc, Y. In Ferguson, D., Mu�oz, V. M., Cardoso, J. S., Helfert, M., & Pahl, C., editors, Proceedings of the 8th International Conference on Cloud Computing and Services Science (CLOSER), pages 308–332, July, 2018. Springer. 24 pages.

Paper abstract bibtex

Cloud computing is a popular Internet-based computing paradigm that provides on-demand computational services and resources, generally offered by Cloud providers' REpresentational State Transfer (REST) APIs. Developers use REST APIs by invoking these APIs by their names and, thus, the lexicons used in the APIs are important to ease the developers' comprehension. In this paper, we study the lexicons and the linguistic (anti)patterns from 16 providers of REST Cloud Computing APIs. We observe that, although the 16 REST APIs describe the same domain (Cloud computing), contrary to what one might expect, their lexicons do not share a large number of common terms and 90\NOof the terms (3,561/3,947) are just used by one provider. Thus, the APIs are lexically heterogeneous and there is not a consensus on which terms to use in Cloud computing. Further, we observe that the majority of the URIs, 54%, follow the Contextualised Resource Names pattern, which is considered a good practice in REST API design. However, a majority of the URIs, 62.82%, suffer from the Non-pertinent Documentation antipattern. Thus, we present three main contributions: 1) a tooled approach, called CloudLex, for extracting and analysing REST Cloud computing lexicons; 2) our analysis of the terms used in 16 REST APIs in 59,677 term occurrences; 3) our analysis of the linguistic (anti)patters in more than 23,000 URIs of the 142 services of the 16 Cloud providers. We also show that CloudLex has an average precision of 84.82%, recall of 63.57%, and F1-measure of 71.03\NOon one complete API, Docker Engine, which confirms the accuracy of our semantic analyses for the detection of linguistic (anti)patterns.

@INPROCEEDINGS{Petrillo18-CLOSER-LexicalSemanticREST,
   AUTHOR       = {Fabio Petrillo and Philippe Merle and Francis Palma and 
      Naouel Moha and Yann-Ga�l Gu�h�neuc},
   BOOKTITLE    = {Proceedings of the 8<sup>th</sup> International Conference on Cloud Computing and Services Science (CLOSER)},
   TITLE        = {A Lexical and Semantical Analysis on REST Cloud 
      Computing APIs},
   YEAR         = {2018},
   OPTADDRESS   = {},
   OPTCROSSREF  = {},
   EDITOR       = {Donald Ferguson and V�ctor M�ndez Mu�oz and 
      Jorge S. Cardoso and Markus Helfert and Claus Pahl},
   MONTH        = {July},
   NOTE         = {24 pages.},
   OPTNUMBER    = {},
   OPTORGANIZATION = {},
   PAGES        = {308--332},
   PUBLISHER    = {Springer},
   OPTSERIES    = {},
   OPTVOLUME    = {},
   KEYWORDS     = {Topic: <b>Code and design smells</b>, 
      Venue: <c>CLOSER</c>},
   URL          = {http://www.ptidej.net/publications/documents/CLOSER18.doc.pdf},
   PDF          = {http://www.ptidej.net/publications/documents/CLOSER18.ppt.pdf},
   ABSTRACT     = {Cloud computing is a popular Internet-based computing 
      paradigm that provides on-demand computational services and 
      resources, generally offered by Cloud providers' REpresentational 
      State Transfer (REST) APIs. Developers use REST APIs by invoking 
      these APIs by their names and, thus, the lexicons used in the APIs 
      are important to ease the developers' comprehension. In this paper, 
      we study the lexicons and the linguistic (anti)patterns from 16 
      providers of REST Cloud Computing APIs. We observe that, although the 
      16 REST APIs describe the same domain (Cloud computing), contrary to 
      what one might expect, their lexicons do not share a large number of 
      common terms and 90\NOof the terms (3,561/3,947) are just used by one 
      provider. Thus, the APIs are lexically heterogeneous and there is not 
      a consensus on which terms to use in Cloud computing. Further, we 
      observe that the majority of the URIs, 54\%, follow the 
      Contextualised Resource Names pattern, which is considered a good 
      practice in REST API design. However, a majority of the URIs, 
      62.82\%, suffer from the Non-pertinent Documentation antipattern. 
      Thus, we present three main contributions: 1) a tooled approach, 
      called CloudLex, for extracting and analysing REST Cloud computing 
      lexicons; 2) our analysis of the terms used in 16 REST APIs in 59,677 
      term occurrences; 3) our analysis of the linguistic (anti)patters in 
      more than 23,000 URIs of the 142 services of the 16 Cloud providers. 
      We also show that CloudLex has an average precision of 84.82\%, 
      recall of 63.57\%, and F1-measure of 71.03\NOon one complete API, 
      Docker Engine, which confirms the accuracy of our semantic analyses 
      for the detection of linguistic (anti)patterns.}
}

Downloads: 0

{"_id":"AS5xpfNCkEP45wBQN","bibbaseid":"petrillo-merle-palma-moha-guhneuc-alexicalandsemanticalanalysisonrestcloudcomputingapis-2018","author_short":["Petrillo, F.","Merle, P.","Palma, F.","Moha, N.","Gu�h�neuc, Y."],"bibdata":{"bibtype":"inproceedings","type":"inproceedings","author":[{"firstnames":["Fabio"],"propositions":[],"lastnames":["Petrillo"],"suffixes":[]},{"firstnames":["Philippe"],"propositions":[],"lastnames":["Merle"],"suffixes":[]},{"firstnames":["Francis"],"propositions":[],"lastnames":["Palma"],"suffixes":[]},{"firstnames":["Naouel"],"propositions":[],"lastnames":["Moha"],"suffixes":[]},{"firstnames":["Yann-Ga�l"],"propositions":[],"lastnames":["Gu�h�neuc"],"suffixes":[]}],"booktitle":"Proceedings of the 8th International Conference on Cloud Computing and Services Science (CLOSER)","title":"A Lexical and Semantical Analysis on REST Cloud Computing APIs","year":"2018","optaddress":"","optcrossref":"","editor":[{"firstnames":["Donald"],"propositions":[],"lastnames":["Ferguson"],"suffixes":[]},{"firstnames":["V�ctor","M�ndez"],"propositions":[],"lastnames":["Mu�oz"],"suffixes":[]},{"firstnames":["Jorge","S."],"propositions":[],"lastnames":["Cardoso"],"suffixes":[]},{"firstnames":["Markus"],"propositions":[],"lastnames":["Helfert"],"suffixes":[]},{"firstnames":["Claus"],"propositions":[],"lastnames":["Pahl"],"suffixes":[]}],"month":"July","note":"24 pages.","optnumber":"","optorganization":"","pages":"308–332","publisher":"Springer","optseries":"","optvolume":"","keywords":"Topic: Code and design smells, Venue: <c>CLOSER</c>","url":"http://www.ptidej.net/publications/documents/CLOSER18.doc.pdf","pdf":"http://www.ptidej.net/publications/documents/CLOSER18.ppt.pdf","abstract":"Cloud computing is a popular Internet-based computing paradigm that provides on-demand computational services and resources, generally offered by Cloud providers' REpresentational State Transfer (REST) APIs. Developers use REST APIs by invoking these APIs by their names and, thus, the lexicons used in the APIs are important to ease the developers' comprehension. In this paper, we study the lexicons and the linguistic (anti)patterns from 16 providers of REST Cloud Computing APIs. We observe that, although the 16 REST APIs describe the same domain (Cloud computing), contrary to what one might expect, their lexicons do not share a large number of common terms and 90\\NOof the terms (3,561/3,947) are just used by one provider. Thus, the APIs are lexically heterogeneous and there is not a consensus on which terms to use in Cloud computing. Further, we observe that the majority of the URIs, 54%, follow the Contextualised Resource Names pattern, which is considered a good practice in REST API design. However, a majority of the URIs, 62.82%, suffer from the Non-pertinent Documentation antipattern. Thus, we present three main contributions: 1) a tooled approach, called CloudLex, for extracting and analysing REST Cloud computing lexicons; 2) our analysis of the terms used in 16 REST APIs in 59,677 term occurrences; 3) our analysis of the linguistic (anti)patters in more than 23,000 URIs of the 142 services of the 16 Cloud providers. We also show that CloudLex has an average precision of 84.82%, recall of 63.57%, and F1-measure of 71.03\\NOon one complete API, Docker Engine, which confirms the accuracy of our semantic analyses for the detection of linguistic (anti)patterns.","bibtex":"@INPROCEEDINGS{Petrillo18-CLOSER-LexicalSemanticREST,\r\n AUTHOR = {Fabio Petrillo and Philippe Merle and Francis Palma and \r\n Naouel Moha and Yann-Ga�l Gu�h�neuc},\r\n BOOKTITLE = {Proceedings of the 8th International Conference on Cloud Computing and Services Science (CLOSER)},\r\n TITLE = {A Lexical and Semantical Analysis on REST Cloud \r\n Computing APIs},\r\n YEAR = {2018},\r\n OPTADDRESS = {},\r\n OPTCROSSREF = {},\r\n EDITOR = {Donald Ferguson and V�ctor M�ndez Mu�oz and \r\n Jorge S. Cardoso and Markus Helfert and Claus Pahl},\r\n MONTH = {July},\r\n NOTE = {24 pages.},\r\n OPTNUMBER = {},\r\n OPTORGANIZATION = {},\r\n PAGES = {308--332},\r\n PUBLISHER = {Springer},\r\n OPTSERIES = {},\r\n OPTVOLUME = {},\r\n KEYWORDS = {Topic: Code and design smells, \r\n Venue: <c>CLOSER</c>},\r\n URL = {http://www.ptidej.net/publications/documents/CLOSER18.doc.pdf},\r\n PDF = {http://www.ptidej.net/publications/documents/CLOSER18.ppt.pdf},\r\n ABSTRACT = {Cloud computing is a popular Internet-based computing \r\n paradigm that provides on-demand computational services and \r\n resources, generally offered by Cloud providers' REpresentational \r\n State Transfer (REST) APIs. Developers use REST APIs by invoking \r\n these APIs by their names and, thus, the lexicons used in the APIs \r\n are important to ease the developers' comprehension. In this paper, \r\n we study the lexicons and the linguistic (anti)patterns from 16 \r\n providers of REST Cloud Computing APIs. We observe that, although the \r\n 16 REST APIs describe the same domain (Cloud computing), contrary to \r\n what one might expect, their lexicons do not share a large number of \r\n common terms and 90\\NOof the terms (3,561/3,947) are just used by one \r\n provider. Thus, the APIs are lexically heterogeneous and there is not \r\n a consensus on which terms to use in Cloud computing. Further, we \r\n observe that the majority of the URIs, 54\\%, follow the \r\n Contextualised Resource Names pattern, which is considered a good \r\n practice in REST API design. However, a majority of the URIs, \r\n 62.82\\%, suffer from the Non-pertinent Documentation antipattern. \r\n Thus, we present three main contributions: 1) a tooled approach, \r\n called CloudLex, for extracting and analysing REST Cloud computing \r\n lexicons; 2) our analysis of the terms used in 16 REST APIs in 59,677 \r\n term occurrences; 3) our analysis of the linguistic (anti)patters in \r\n more than 23,000 URIs of the 142 services of the 16 Cloud providers. \r\n We also show that CloudLex has an average precision of 84.82\\%, \r\n recall of 63.57\\%, and F1-measure of 71.03\\NOon one complete API, \r\n Docker Engine, which confirms the accuracy of our semantic analyses \r\n for the detection of linguistic (anti)patterns.}\r\n}\r\n\r\n","author_short":["Petrillo, F.","Merle, P.","Palma, F.","Moha, N.","Gu�h�neuc, Y."],"editor_short":["Ferguson, D.","Mu�oz, V. M.","Cardoso, J. S.","Helfert, M.","Pahl, C."],"key":"Petrillo18-CLOSER-LexicalSemanticREST","id":"Petrillo18-CLOSER-LexicalSemanticREST","bibbaseid":"petrillo-merle-palma-moha-guhneuc-alexicalandsemanticalanalysisonrestcloudcomputingapis-2018","role":"author","urls":{"Paper":"http://www.ptidej.net/publications/documents/CLOSER18.doc.pdf"},"keyword":["Topic: Code and design smells","Venue: <c>CLOSER</c>"],"metadata":{"authorlinks":{}}},"bibtype":"inproceedings","biburl":"http://www.yann-gael.gueheneuc.net/Work/Publications/Biblio/complete-bibliography.bib","dataSources":["8vn5MSGYWB4fAx9Z4"],"keywords":["topic: code and design smells","venue: <c>closer</c>"],"search_terms":["lexical","semantical","analysis","rest","cloud","computing","apis","petrillo","merle","palma","moha","gu�h�neuc"],"title":"A Lexical and Semantical Analysis on REST Cloud Computing APIs","year":2018}