Published on Snurblog (http://snurb.info)

Home > Non-Content Features of Material in the Internet Archive

Non-Content Features of Material in the Internet Archive

Tue, 24/05/2016 - 01:11 — Snurb
Internet Content Preservation [1]
WebSci '16 [2]

The third presenter in this Web Science 2016 [3] session is Tu Ngoc Nguyen, who reintroduces us to the Internet Archive's Wayback Machine. This is a useful service, but searching it is not necessarily straightforward. Is it possible to draw on the non-content features to improve search results?

The project drew on the full archive for the German Web, and utilised a number of assessment techniques to assess and rank documents based on twenty non-content features. I'm frankly unable to understand the numerical data presented in the tables here, but from what I do understand the use of these additional features does improve the retrievability of relevant information. Sorry!

[Creative Commons Attribution-NonCommercial-ShareAlike 2.0 License]
Except where otherwise noted, this work is licensed under a Creative Commons License. -->

Source URL:http://snurb.info/node/2098

Links
[1] http://snurb.info/taxonomy/term/30 [2] http://snurb.info/taxonomy/term/160 [3] http://www.websci16.org/