| | |
| | |
Stat |
Members: 3645 Articles: 2'501'711 Articles rated: 2609
20 April 2024 |
|
| | | |
|
Article overview
| |
|
Formats over Time: Exploring UK Web History | Andrew N. Jackson
; | Date: |
5 Oct 2012 | Abstract: | Is software obsolescence a significant risk? To explore this issue, we
analysed a corpus of over 2.5 billion resources corresponding to the UK Web
domain, as crawled between 1996 and 2010. Using the DROID and Apache Tika
identification tools, we examined each resource and captured the results as
extended MIME types, embedding version, software and hardware identifiers
alongside the format information. The combined results form a detailed temporal
format profile of the corpus, which we have made available as open data. We
present the results of our initial analysis of this dataset. We look at image,
HTML and PDF resources in some detail, showing how the usage of different
formats, versions and software implementations has changed over time.
Furthermore, we show that software obsolescence is rare on the web and uncover
evidence indicating that network effects act to stabilise formats against
obsolescence. | Source: | arXiv, 1210.1714 | Services: | Forum | Review | PDF | Favorites |
|
|
No review found.
Did you like this article?
Note: answers to reviews or questions about the article must be posted in the forum section.
Authors are not allowed to review their own article. They can use the forum section.
browser Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; ClaudeBot/1.0; +claudebot@anthropic.com)
|
| |
|
|
|
| News, job offers and information for researchers and scientists:
| |