Zum Inhalt springen

Web ARChive

aus Wikipedia, der freien Enzyklopädie
Dies ist eine alte Version dieser Seite, zuletzt bearbeitet am 25. April 2014 um 23:07 Uhr durch 64.147.222.129 (Diskussion) (Software). Sie kann sich erheblich von der aktuellen Version unterscheiden.

The Web ARChive (WARC) archive format specifies a method for combining multiple digital resources into an aggregate archive file together with related information. The WARC format is a revision of the Internet Archive's ARC File Format [ARC_IA] that has traditionally been used to store "web crawls" as sequences of content blocks harvested from the World Wide Web. The WARC format generalizes the older format to better support the harvesting, access, and exchange needs of archiving organizations. Besides the primary content currently recorded, the revision accommodates related secondary content, such as assigned metadata, abbreviated duplicate detection events, and later-date transformations.[1]

References

Vorlage:Reflist

Software


Vorlage:Web-stub

  1. http://www.digitalpreservation.gov/formats/fdd/fdd000236.shtml