Use este identificador para citar ou linkar para este item: http://www.repositorio.ufop.br/jspui/handle/123456789/1676
Título: Genealogical trees on the web : a search engine user perspective.
Autor(es): Yates, Ricardo Baeza
Pereira Junior, Álvaro Rodrigues
Ziviani, Nivio
Palavras-chave: Web
Text
Content evolution
Search engine
Web mining
Data do documento: 2008
Referência: YATES, R. B.; PEREIRA JÚNIOR, A. R.; ZIVIANI, N. Genealogical trees on the web : a search engine user perspective. In. 17th International World Wide Web Conference, 17,. 2008. Beijing. Anais... Beijing: International World Wide Web Conference, 2008. Disponível em: <http://homepages.dcc.ufmg.br/~nivio/papers/www08.pdf>. Acesso em: 18 out. 2012.
Resumo: This paper presents an extensive study about the evolution of textual content on the Web, which shows how some new pages are created from scratch while others are created using already existing content. We show that a significant fraction of the Web is a byproduct of the latter case. We introduce the concept of Web genealogical tree, in which every page in a Web snapshot is classified into a component. We study in detail these components, characterizing the copies and identifying the relation between a source of content and a search engine, by comparing page relevance measures, documents returned by real queries performed in the past, and click-through data. We observe that sources of copies are more frequently returned by queries and more clicked than other documents.
URI: http://www.repositorio.ufop.br/handle/123456789/1676
Aparece nas coleções:DECOM - Trabalhos apresentados em eventos

Arquivos associados a este item:
Arquivo Descrição TamanhoFormato 
EVENTO_GenealogicalTreesWeb.pdf567,71 kBAdobe PDFVisualizar/Abrir


Os itens no repositório estão protegidos por copyright, com todos os direitos reservados, salvo quando é indicado o contrário.