Wikigrep distribuido: búsquedas avanzadas en la wikipedia

In this project we created a regular expressions search engine that uses the Wikipedia database of articles. The system allows the use of to enter a regular expression and makes an asynchronous request to initialize an EC2 cluster; it searches for the pattern inside all the Wikipedia and then return...

ver descrição completa

Na minha lista:
Detalhes bibliográficos
Autor principal: Varas Palomeque, Irene Carolina (author)
Outros Autores: Paladines Herrera, Gabriel Antonio (author), Abad, Cristina (author)
Formato: article
Idioma:spa
Publicado em: 2009
Assuntos:
Acesso em linha:http://www.dspace.espol.edu.ec/handle/123456789/7701
Tags: Adicionar Tag
Sem tags, seja o primeiro a adicionar uma tag!
_version_ 1858337358885158912
author Varas Palomeque, Irene Carolina
author2 Paladines Herrera, Gabriel Antonio
Abad, Cristina
author2_role author
author
author_facet Varas Palomeque, Irene Carolina
Paladines Herrera, Gabriel Antonio
Abad, Cristina
author_role author
collection Repositorio Escuela Superior Politécnica del Litoral
dc.creator.none.fl_str_mv Varas Palomeque, Irene Carolina
Paladines Herrera, Gabriel Antonio
Abad, Cristina
dc.date.none.fl_str_mv 2009-10-15
2009-10-15
2009-10-15
dc.format.none.fl_str_mv application/pdf
application/postscript
dc.identifier.none.fl_str_mv http://www.dspace.espol.edu.ec/handle/123456789/7701
dc.language.none.fl_str_mv spa
dc.rights.none.fl_str_mv info:eu-repo/semantics/openAccess
dc.source.none.fl_str_mv reponame:Repositorio Escuela Superior Politécnica del Litoral
instname:Escuela Superior Politécnica del Litoral
instacron:ESPOL
dc.subject.none.fl_str_mv HADDOP
CLOUD COMPUTING
MAPREDUCE
ELASTIC MAPREDUCE
SIMPLE STORAGE SERVICE S3
WIKIPEDIA
DATASET
CLÚSTER EC2.
dc.title.none.fl_str_mv Wikigrep distribuido: búsquedas avanzadas en la wikipedia
dc.type.none.fl_str_mv info:eu-repo/semantics/publishedVersion
info:eu-repo/semantics/article
description In this project we created a regular expressions search engine that uses the Wikipedia database of articles. The system allows the use of to enter a regular expression and makes an asynchronous request to initialize an EC2 cluster; it searches for the pattern inside all the Wikipedia and then returns the result, displaying a list of all the occurrences of the pattern and a link to the Wikipedia Article. We used the Amazon Web Services, Java libraries to manipulate Wikipedia Articles, the Hadoop framework and a dataset of the Wikipedia Articles. We tested some regular expressions that couldn’t be searched for using neither traditional search engines nor the Wikipedia Search Engine. Our tests show that an advanced search engine could be cheap to implement providing high scalability through the use of cloud computing and data-intensive computing techniques.
eu_rights_str_mv openAccess
format article
id ESPOL_aaf6d3beba34232edd475fac63286148
instacron_str ESPOL
institution ESPOL
instname_str Escuela Superior Politécnica del Litoral
language spa
network_acronym_str ESPOL
network_name_str Repositorio Escuela Superior Politécnica del Litoral
oai_identifier_str oai:www.dspace.espol.edu.ec:123456789/7701
publishDate 2009
reponame_str Repositorio Escuela Superior Politécnica del Litoral
repository.mail.fl_str_mv .
repository.name.fl_str_mv Repositorio Escuela Superior Politécnica del Litoral - Escuela Superior Politécnica del Litoral
repository_id_str 1479
spelling Wikigrep distribuido: búsquedas avanzadas en la wikipediaVaras Palomeque, Irene CarolinaPaladines Herrera, Gabriel AntonioAbad, CristinaHADDOPCLOUD COMPUTINGMAPREDUCEELASTIC MAPREDUCESIMPLE STORAGE SERVICE S3WIKIPEDIADATASETCLÚSTER EC2.In this project we created a regular expressions search engine that uses the Wikipedia database of articles. The system allows the use of to enter a regular expression and makes an asynchronous request to initialize an EC2 cluster; it searches for the pattern inside all the Wikipedia and then returns the result, displaying a list of all the occurrences of the pattern and a link to the Wikipedia Article. We used the Amazon Web Services, Java libraries to manipulate Wikipedia Articles, the Hadoop framework and a dataset of the Wikipedia Articles. We tested some regular expressions that couldn’t be searched for using neither traditional search engines nor the Wikipedia Search Engine. Our tests show that an advanced search engine could be cheap to implement providing high scalability through the use of cloud computing and data-intensive computing techniques.2009-10-152009-10-152009-10-15info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfapplication/postscripthttp://www.dspace.espol.edu.ec/handle/123456789/7701spainfo:eu-repo/semantics/openAccessreponame:Repositorio Escuela Superior Politécnica del Litoralinstname:Escuela Superior Politécnica del Litoralinstacron:ESPOL2018-04-04T13:09:05Zoai:www.dspace.espol.edu.ec:123456789/7701Institucionalhttps://www.dspace.espol.edu.ec/Universidad públicahttps://www.espol.edu.ec/.https://www.dspace.espol.edu.ec/oaiEcuador...opendoar:14792018-04-04T13:09:05falseInstitucionalhttps://www.dspace.espol.edu.ec/Universidad públicahttps://www.espol.edu.ec/.https://www.dspace.espol.edu.ec/oai.Ecuador...opendoar:14792018-04-04T13:09:05Repositorio Escuela Superior Politécnica del Litoral - Escuela Superior Politécnica del Litoralfalse
spellingShingle Wikigrep distribuido: búsquedas avanzadas en la wikipedia
Varas Palomeque, Irene Carolina
HADDOP
CLOUD COMPUTING
MAPREDUCE
ELASTIC MAPREDUCE
SIMPLE STORAGE SERVICE S3
WIKIPEDIA
DATASET
CLÚSTER EC2.
status_str publishedVersion
title Wikigrep distribuido: búsquedas avanzadas en la wikipedia
title_full Wikigrep distribuido: búsquedas avanzadas en la wikipedia
title_fullStr Wikigrep distribuido: búsquedas avanzadas en la wikipedia
title_full_unstemmed Wikigrep distribuido: búsquedas avanzadas en la wikipedia
title_short Wikigrep distribuido: búsquedas avanzadas en la wikipedia
title_sort Wikigrep distribuido: búsquedas avanzadas en la wikipedia
topic HADDOP
CLOUD COMPUTING
MAPREDUCE
ELASTIC MAPREDUCE
SIMPLE STORAGE SERVICE S3
WIKIPEDIA
DATASET
CLÚSTER EC2.
url http://www.dspace.espol.edu.ec/handle/123456789/7701