Wikigrep distribuido: búsquedas avanzadas en la wikipedia
In this project we created a regular expressions search engine that uses the Wikipedia database of articles. The system allows the use of to enter a regular expression and makes an asynchronous request to initialize an EC2 cluster; it searches for the pattern inside all the Wikipedia and then return...
محفوظ في:
| المؤلف الرئيسي: | |
|---|---|
| مؤلفون آخرون: | , |
| التنسيق: | article |
| اللغة: | spa |
| منشور في: |
2009
|
| الموضوعات: | |
| الوصول للمادة أونلاين: | http://www.dspace.espol.edu.ec/handle/123456789/7701 |
| الوسوم: |
إضافة وسم
لا توجد وسوم, كن أول من يضع وسما على هذه التسجيلة!
|
| _version_ | 1858337358885158912 |
|---|---|
| author | Varas Palomeque, Irene Carolina |
| author2 | Paladines Herrera, Gabriel Antonio Abad, Cristina |
| author2_role | author author |
| author_facet | Varas Palomeque, Irene Carolina Paladines Herrera, Gabriel Antonio Abad, Cristina |
| author_role | author |
| collection | Repositorio Escuela Superior Politécnica del Litoral |
| dc.creator.none.fl_str_mv | Varas Palomeque, Irene Carolina Paladines Herrera, Gabriel Antonio Abad, Cristina |
| dc.date.none.fl_str_mv | 2009-10-15 2009-10-15 2009-10-15 |
| dc.format.none.fl_str_mv | application/pdf application/postscript |
| dc.identifier.none.fl_str_mv | http://www.dspace.espol.edu.ec/handle/123456789/7701 |
| dc.language.none.fl_str_mv | spa |
| dc.rights.none.fl_str_mv | info:eu-repo/semantics/openAccess |
| dc.source.none.fl_str_mv | reponame:Repositorio Escuela Superior Politécnica del Litoral instname:Escuela Superior Politécnica del Litoral instacron:ESPOL |
| dc.subject.none.fl_str_mv | HADDOP CLOUD COMPUTING MAPREDUCE ELASTIC MAPREDUCE SIMPLE STORAGE SERVICE S3 WIKIPEDIA DATASET CLÚSTER EC2. |
| dc.title.none.fl_str_mv | Wikigrep distribuido: búsquedas avanzadas en la wikipedia |
| dc.type.none.fl_str_mv | info:eu-repo/semantics/publishedVersion info:eu-repo/semantics/article |
| description | In this project we created a regular expressions search engine that uses the Wikipedia database of articles. The system allows the use of to enter a regular expression and makes an asynchronous request to initialize an EC2 cluster; it searches for the pattern inside all the Wikipedia and then returns the result, displaying a list of all the occurrences of the pattern and a link to the Wikipedia Article. We used the Amazon Web Services, Java libraries to manipulate Wikipedia Articles, the Hadoop framework and a dataset of the Wikipedia Articles. We tested some regular expressions that couldn’t be searched for using neither traditional search engines nor the Wikipedia Search Engine. Our tests show that an advanced search engine could be cheap to implement providing high scalability through the use of cloud computing and data-intensive computing techniques. |
| eu_rights_str_mv | openAccess |
| format | article |
| id | ESPOL_aaf6d3beba34232edd475fac63286148 |
| instacron_str | ESPOL |
| institution | ESPOL |
| instname_str | Escuela Superior Politécnica del Litoral |
| language | spa |
| network_acronym_str | ESPOL |
| network_name_str | Repositorio Escuela Superior Politécnica del Litoral |
| oai_identifier_str | oai:www.dspace.espol.edu.ec:123456789/7701 |
| publishDate | 2009 |
| reponame_str | Repositorio Escuela Superior Politécnica del Litoral |
| repository.mail.fl_str_mv | . |
| repository.name.fl_str_mv | Repositorio Escuela Superior Politécnica del Litoral - Escuela Superior Politécnica del Litoral |
| repository_id_str | 1479 |
| spelling | Wikigrep distribuido: búsquedas avanzadas en la wikipediaVaras Palomeque, Irene CarolinaPaladines Herrera, Gabriel AntonioAbad, CristinaHADDOPCLOUD COMPUTINGMAPREDUCEELASTIC MAPREDUCESIMPLE STORAGE SERVICE S3WIKIPEDIADATASETCLÚSTER EC2.In this project we created a regular expressions search engine that uses the Wikipedia database of articles. The system allows the use of to enter a regular expression and makes an asynchronous request to initialize an EC2 cluster; it searches for the pattern inside all the Wikipedia and then returns the result, displaying a list of all the occurrences of the pattern and a link to the Wikipedia Article. We used the Amazon Web Services, Java libraries to manipulate Wikipedia Articles, the Hadoop framework and a dataset of the Wikipedia Articles. We tested some regular expressions that couldn’t be searched for using neither traditional search engines nor the Wikipedia Search Engine. Our tests show that an advanced search engine could be cheap to implement providing high scalability through the use of cloud computing and data-intensive computing techniques.2009-10-152009-10-152009-10-15info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfapplication/postscripthttp://www.dspace.espol.edu.ec/handle/123456789/7701spainfo:eu-repo/semantics/openAccessreponame:Repositorio Escuela Superior Politécnica del Litoralinstname:Escuela Superior Politécnica del Litoralinstacron:ESPOL2018-04-04T13:09:05Zoai:www.dspace.espol.edu.ec:123456789/7701Institucionalhttps://www.dspace.espol.edu.ec/Universidad públicahttps://www.espol.edu.ec/.https://www.dspace.espol.edu.ec/oaiEcuador...opendoar:14792018-04-04T13:09:05falseInstitucionalhttps://www.dspace.espol.edu.ec/Universidad públicahttps://www.espol.edu.ec/.https://www.dspace.espol.edu.ec/oai.Ecuador...opendoar:14792018-04-04T13:09:05Repositorio Escuela Superior Politécnica del Litoral - Escuela Superior Politécnica del Litoralfalse |
| spellingShingle | Wikigrep distribuido: búsquedas avanzadas en la wikipedia Varas Palomeque, Irene Carolina HADDOP CLOUD COMPUTING MAPREDUCE ELASTIC MAPREDUCE SIMPLE STORAGE SERVICE S3 WIKIPEDIA DATASET CLÚSTER EC2. |
| status_str | publishedVersion |
| title | Wikigrep distribuido: búsquedas avanzadas en la wikipedia |
| title_full | Wikigrep distribuido: búsquedas avanzadas en la wikipedia |
| title_fullStr | Wikigrep distribuido: búsquedas avanzadas en la wikipedia |
| title_full_unstemmed | Wikigrep distribuido: búsquedas avanzadas en la wikipedia |
| title_short | Wikigrep distribuido: búsquedas avanzadas en la wikipedia |
| title_sort | Wikigrep distribuido: búsquedas avanzadas en la wikipedia |
| topic | HADDOP CLOUD COMPUTING MAPREDUCE ELASTIC MAPREDUCE SIMPLE STORAGE SERVICE S3 WIKIPEDIA DATASET CLÚSTER EC2. |
| url | http://www.dspace.espol.edu.ec/handle/123456789/7701 |