Implementation of search robot's function to collect information in scientometric systems
At present, the World Wide Web is developing rapidly, and every day the problem of automated collection and analysis of information placed on various web resources is becoming increasingly urgent. If in the 90s of the last century, the World Wide Web was a huge amount of poorly structured informatio...
Wedi'i Gadw mewn:
| Prif Awdur: | |
|---|---|
| Fformat: | article |
| Iaith: | eng |
| Cyhoeddwyd: |
2019
|
| Pynciau: | |
| Mynediad Ar-lein: | https://revista.sangregorio.edu.ec/index.php/REVISTASANGREGORIO/article/view/1002 |
| Tagiau: |
Ychwanegu Tag
Dim Tagiau, Byddwch y cyntaf i dagio'r cofnod hwn!
|
| _version_ | 1858437121544552448 |
|---|---|
| author | F. Galimyanov, Anis |
| author_facet | F. Galimyanov, Anis |
| author_role | author |
| collection | Revista Universidad San Gregorio de Portoviejo |
| dc.creator.none.fl_str_mv | F. Galimyanov, Anis |
| dc.date.none.fl_str_mv | 2019-08-09 |
| dc.format.none.fl_str_mv | application/pdf |
| dc.identifier.none.fl_str_mv | https://revista.sangregorio.edu.ec/index.php/REVISTASANGREGORIO/article/view/1002 |
| dc.language.none.fl_str_mv | eng |
| dc.publisher.none.fl_str_mv | Universidad San Gregorio de Portoviejo |
| dc.relation.none.fl_str_mv | https://revista.sangregorio.edu.ec/index.php/REVISTASANGREGORIO/article/view/1002/NINE 10.36097/rsan.v1i32.1002.g533 |
| dc.rights.none.fl_str_mv | Derechos de autor 2019 Revista San Gregorio info:eu-repo/semantics/openAccess |
| dc.source.none.fl_str_mv | Revista San Gregorio; No. 32 (2019): Revista San Gregorio. SPECIAL EDITION-AUGUST 2019; 69-76 Revista San Gregorio; Núm. 32 (2019): Revista San Gregorio. SPECIAL EDITION-AUGUST 2019; 69-76 2528-7907 1390-7247 10.36097/rsan.v1i32 reponame:Revista Universidad San Gregorio de Portoviejo instname:Universidad San Gregorio de Portoviejo instacron:USGP |
| dc.subject.none.fl_str_mv | search robot spider crawler bot parser robot Hirsch index Scopus pytho |
| dc.title.none.fl_str_mv | Implementation of search robot's function to collect information in scientometric systems |
| dc.type.none.fl_str_mv | info:eu-repo/semantics/article info:eu-repo/semantics/publishedVersion Artículo evaluado por pares |
| description | At present, the World Wide Web is developing rapidly, and every day the problem of automated collection and analysis of information placed on various web resources is becoming increasingly urgent. If in the 90s of the last century, the World Wide Web was a huge amount of poorly structured information, to search in which it was difficult for a person. It was then that the first developments in the field of automated agents began to appear, facilitating the task of finding the necessary information on the web. The main part of such systems is a search robot - a software package that navigates through web resources and collects information for a database. In the Kazan (Volga Region) Federal University, a monthly rating of academic staff is compiled based on data placed in the personal offices of employees in the Electronic University system. Now there is a need to move away from manually filling the Hirsch index in a personal account with KFU staff to avoid incorrect data filing and validation of the entered information by the Prospective Development Center. What was required was the creation of a search robot to automatically collect the Hirsch indices of KFU employees from the Scopus system. This article discusses the search robot: What is it? How does he work? How to write your program to collect information? All these issues were addressed in this article. The possible types of search robots and the whole process of their work were considered. The Scopus scientometric system and scientometric indicator - Hirsch index, its purpose, and calculation were considered. For implementation, the Python programming language was used and the tools for implementing HTTP requests and processing HTML pages were considered. |
| eu_rights_str_mv | openAccess |
| format | article |
| id | REVUSGP_f2eb2ade1666bbf868dd77213bbe82f6 |
| instacron_str | USGP |
| institution | USGP |
| instname_str | Universidad San Gregorio de Portoviejo |
| language | eng |
| network_acronym_str | REVUSGP |
| network_name_str | Revista Universidad San Gregorio de Portoviejo |
| oai_identifier_str | oai:ojs.pkp.sfu.ca:article/1002 |
| publishDate | 2019 |
| publisher.none.fl_str_mv | Universidad San Gregorio de Portoviejo |
| reponame_str | Revista Universidad San Gregorio de Portoviejo |
| repository.mail.fl_str_mv | . |
| repository.name.fl_str_mv | Revista Universidad San Gregorio de Portoviejo - Universidad San Gregorio de Portoviejo |
| repository_id_str | 0 |
| rights_invalid_str_mv | Derechos de autor 2019 Revista San Gregorio |
| spelling | Implementation of search robot's function to collect information in scientometric systemsF. Galimyanov, Anissearch robotspidercrawlerbotparserrobotHirsch indexScopuspythoAt present, the World Wide Web is developing rapidly, and every day the problem of automated collection and analysis of information placed on various web resources is becoming increasingly urgent. If in the 90s of the last century, the World Wide Web was a huge amount of poorly structured information, to search in which it was difficult for a person. It was then that the first developments in the field of automated agents began to appear, facilitating the task of finding the necessary information on the web. The main part of such systems is a search robot - a software package that navigates through web resources and collects information for a database. In the Kazan (Volga Region) Federal University, a monthly rating of academic staff is compiled based on data placed in the personal offices of employees in the Electronic University system. Now there is a need to move away from manually filling the Hirsch index in a personal account with KFU staff to avoid incorrect data filing and validation of the entered information by the Prospective Development Center. What was required was the creation of a search robot to automatically collect the Hirsch indices of KFU employees from the Scopus system. This article discusses the search robot: What is it? How does he work? How to write your program to collect information? All these issues were addressed in this article. The possible types of search robots and the whole process of their work were considered. The Scopus scientometric system and scientometric indicator - Hirsch index, its purpose, and calculation were considered. For implementation, the Python programming language was used and the tools for implementing HTTP requests and processing HTML pages were considered.Universidad San Gregorio de Portoviejo2019-08-09info:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersionArtículo evaluado por paresapplication/pdfhttps://revista.sangregorio.edu.ec/index.php/REVISTASANGREGORIO/article/view/1002Revista San Gregorio; No. 32 (2019): Revista San Gregorio. SPECIAL EDITION-AUGUST 2019; 69-76Revista San Gregorio; Núm. 32 (2019): Revista San Gregorio. SPECIAL EDITION-AUGUST 2019; 69-762528-79071390-724710.36097/rsan.v1i32reponame:Revista Universidad San Gregorio de Portoviejoinstname:Universidad San Gregorio de Portoviejoinstacron:USGPenghttps://revista.sangregorio.edu.ec/index.php/REVISTASANGREGORIO/article/view/1002/NINE10.36097/rsan.v1i32.1002.g533Derechos de autor 2019 Revista San Gregorioinfo:eu-repo/semantics/openAccess2019-08-27T11:15:01Zoai:ojs.pkp.sfu.ca:article/1002Portal de revistashttps://revista.sangregorio.edu.ec/Universidad privadahttps://sangregorio.edu.ec/..Ecuador.2528-79071390-7247opendoar:02019-08-27T11:15:01Revista Universidad San Gregorio de Portoviejo - Universidad San Gregorio de Portoviejofalse |
| spellingShingle | Implementation of search robot's function to collect information in scientometric systems F. Galimyanov, Anis search robot spider crawler bot parser robot Hirsch index Scopus pytho |
| status_str | publishedVersion |
| title | Implementation of search robot's function to collect information in scientometric systems |
| title_full | Implementation of search robot's function to collect information in scientometric systems |
| title_fullStr | Implementation of search robot's function to collect information in scientometric systems |
| title_full_unstemmed | Implementation of search robot's function to collect information in scientometric systems |
| title_short | Implementation of search robot's function to collect information in scientometric systems |
| title_sort | Implementation of search robot's function to collect information in scientometric systems |
| topic | search robot spider crawler bot parser robot Hirsch index Scopus pytho |
| url | https://revista.sangregorio.edu.ec/index.php/REVISTASANGREGORIO/article/view/1002 |