Citizendia
Your Ad Here

Digital preservation refers to the management of digital information over time. A digital system uses discrete (discontinuous values usually but not always Symbolized Numerically (hence called "digital" to represent information for Preservation of digital information is widely considered to require more constant and ongoing attention than preservation of other media[1]. This constant input of effort, time, and money to handle rapid technological and organisational advance is considered the main stumbling block for preserving digital information. Indeed, while we are still able to read our written heritage from several thousand years ago, the digital information created merely a decade ago is in serious danger of being lost, creating a digital Dark Age. The Digital Dark Age is a term used to describe a possible future situation where it will be difficult or impossible to read historical documents because they have been stored in an obsolete

Digital preservation can therefore be seen as the set of processes and activities that ensure continued access to information and all kinds of records, scientific and cultural heritage existing in digital formats. This includes the preservation of materials resulting from Digital Reformatting, but particularly information that is born-digital and has no analog counterpart. Digital reformatting is the process of converting analogue materials into a digital format as a surrogate of the original Born-digital is a term from Digital preservation describing digital materials that originated in the digital realm and have no print or analog counterpart In the language of digital imaging and electronic resources, preservation is no longer just the product of a program but an ongoing process. In this regard the way digital information is stored is important in ensuring their longevity. The long-term storage of digital information is assisted by the inclusion of Preservation Metadata. Preservation metadata is an essential component of most Digital preservation strategies

Digital preservation is defined as: long-term, error-free storage of digital information, with means for retrieval and interpretation, for the entire time span the information is required for. Long-term is defined as "long enough to be concerned with the impacts of changing technologies, including support for new media and data formats, or with a changing user community. Long Term may extend indefinitely”[2]. "Retrieval" means obtaining needed digital files from the long-term, error-free digital storage, without possibility of corrupting the continued error-free storage of the digital files. "Interpretation" means that the retrieved digital files, files that, for example, are of texts, charts, images or sounds, are decoded and transformed into usable representations. This is often interpreted as "rendering", i. e. making it available for a human to access. However, in many cases it will mean able to be processed by computational means.

Contents

Why active preservation is necessary

Society's heritage has been presented on many different materials, including stone, vellum, bamboo, silk, paper and etc. Now a large quantity of information exists in digital forms, including emails, blogs, social networking websites, national elections websites, web photo albums, and sites which change their content over time. According to a report by the US Library of Congress, 44% of the sites available on the internet in 1998 had vanished one year later[3]. The Library of Congress is the De facto National library of the United States and the research arm of the United States Congress

The unique characteristic of digital forms makes it easy to create content and keep it up-to-date, but at the same time brings many difficulties in the preservation of this content. Margaret Hedstrom points out that “digital preservation raises challenges of a fundamentally different nature which are added to the problems of preserving traditional format materials. Margaret Hedstrom is an Information science researcher and a pioneer of research into the area of longevity of digital materials including electronic records[4]

Physical deterioration

The first challenge digital preservation faces is that the media on which digital contents stand are more vulnerable to deterioration and catastrophic loss. While acid paper are prone to deterioration in terms of brittleness and yellowness, the deterioration does not become apparent in at least six decades; and when the deterioration really happens, it happens over decades too. It is also highly possible to retrieve all information without loss after deterioration is spotted. The recording media for digital data deteriorate at a much more rapid pace, and once the deterioration starts, in most cases there is already data loss. This characteristic of digital forms leaves a very short time frame for preservation decisions and actions.

Digital obsolescence

Another challenge, perhaps a more serious and important one, is the problem of long-term access. Digital technology is developing extremely fast, and one retrieval and playback technology can become obsolete in a matter of years. When faster, more capable and cheaper storage and processing devices are developed, the older version gets replaced almost immediately. When a software or decoding technology is abandoned, or a hardware device is no longer in production, records created under the environment of such technologies are at great risk of loss, simply because they are not tangible any more. This process is known as digital obsolescence. Digital obsolescence is a situation where a digital resource is no longer readable because the physical media the reader required to read the media the hardware or the software that

This challenge is exacerbated by the lack of established standards, protocols, and proven methods for preserving digital information[5]. We used to save copies of data on tapes, but media standards for tapes have changed considerably over the last five to ten years, and there is no guarantee that tapes will be readable in the future[6]. Hedstrom further explained that almost all digital library researches have been focused on “architectures and systems for information organization and retrieval, presentation and visualization, and administration of intellectual property rights” and that “digital preservation remains largely experimental and replete with the risks associated with untested methods”. While the rapid advance of technology threats access of digital contents in length, the lack of digitizing standards affects the issue in width.

Strategies

In 2006, the Online Computer Library Center (OCLC) developed a four-point strategy for the long-term preservation of digital objects that consisted of:

There are several additional strategies that individuals and organizations may use to actively combat the loss of digital information.

Refreshing

Refreshing is the transfer of data between two types of the same storage medium so there are no bitrate changes or alteration of data [8]. In Telecommunications and Computing, bitrate (sometimes written bit rate, data rate or as a Variable R or f b For example, transferring census data from a gold preservation CD to a new one. A census is the procedure of acquiring information about every member of a given population A Compact Disc (also known as a CD) is an Optical disc used to store digital data, originally developed for storing digital audio This strategy may need to be combined with migration when the software or hardware required to read the data is no longer available or is unable to understand the format of the data. Typical PC hardware A typical Personal computer consists of a case or chassis in a tower shape (desktop and the following parts Motherboard Refreshing will likely always be necessary due to the deterioration of physical media.

Migration

Migration is the transferring of data to newer system environments (Garrett et al. , 1996). This may include conversion of resources from one format to another (e. g. , conversion of Microsoft Word to PDF or OpenDocument), from one operating system to another (e. Microsoft Word is Microsoft 's flagship word processing software. The OpenDocument format (ODF is a File format for electronic office documents such as Spreadsheets Charts presentations and g. , Solaris to Linux) or from one programming language to another (e. Solaris is a Unix -based Operating system introduced by Sun Microsystems in 1992 as the successor to SunOS. Linux (commonly pronounced ˈlɪnəks A programming language is an Artificial language that can be used to write programs which control the behavior of a machine particularly a Computer. g. , C to Java) so the resource remains fully accessible and functional. tags please moot on the talk page first! --> In Computing, C is a general-purpose cross-platform block structured Resources that are migrated run the risk of losing some type of functionality since newer formats may be incapable of capturing all the functionality of the original format, or the converter itself may be unable to interpret all the nuances of the original format. The latter is often a concern with proprietary data formats.

The National Archives Electronic Records Archives and Lockheed Martin are jointly developing a migration system that will preserve any type of document, created on any application or platform, and delivered to the archives on any type of digital media (Reagan, 2006). The United States National Archives and Records Administration ( NARA) is an independent agency of the United States federal government charged Lockheed Martin ( is a large multinational Aerospace manufacturer and advanced technology Company formed in 1995 by the merger of In the system, files are translated into flexible formats, such as XML; they will therefore be accessible by technologies in the future (Reagan, 2006). Lockheed Martin argues that it would be impossible to develop an emulation system for the National Archives ERA because the volume of records and cost would be prohibitive (Reagan, 2006).

Replication

Creating duplicate copies of data on one or more systems is called replication. Data that exists as a single copy in only one location is highly vulnerable to software or hardware failure, intentional or accidental alteration, and environmental catastrophes like fire, flooding, etc. Digital data is more likely to survive if it is replicated in several locations. Replicated data may introduce difficulties in refreshing, migration, versioning, and access control since the data is located in multiple places. Access control is the ability to permit or deny the use of a particular resource by a particular entity

Emulation

Emulation is the replicating of functionality of an obsolete system (Rothenberg, 1998). For example, emulating an Atari 2600 on a Windows system or emulating WordPerfect 1.0 on a Macintosh. The Atari 2600 is a Video game console released in October 1977 Microsoft Windows is a series of Software Operating systems and Graphical user interfaces produced by Microsoft. WordPerfect is a proprietary Word processing application At the height of its popularity in the late 1980s and early 1990s it was the De facto Macintosh, commonly nicknamed Mac is a Brand name which covers several lines of Personal computers designed developed and marketed by Apple Inc Emulators may be built for applications, operating systems, or hardware platforms. An emulator duplicates (provides an emulation of the functions of one System using a different system so that the second system behaves like (and appears to Emulation has been a popular strategy for retaining the functionality of old video game systems. The feasibility of emulation as a catch-all solution has been debated in the academic community (Granger, 2000).

Raymond A. Lorie has suggested a Universal Virtual Computer (UVC) could be used to run any software in the future on a yet unknown platform (Lorie, 2001). A Universal Virtual Computer (UVC is much like a Virtual machine (VM in computing by means that it creates a layer between the underlying Computer platform The UVC strategy uses a combination of emulation and migration. The UVC strategy has not yet been widely adopted by the digital preservation community.

Jeff Rothenberg, a major proponent of Emulation for digital preservation in libraries, working in partnership with Koninklijke Bibliotheek and National Archief of the Netherlands, has recently helped launch Dioscuri, a modular emulator that succeeds in running MS-DOS, WordPerfect 5. 1, DOS games, and more (Hoeven, 2007).

Metadata attachment

Metadata is data on a digital file that includes information on creation, access rights, restrictions, preservation history, and rights management [9]. Metadata ( meta data, or sometimes metainformation) is "data about data" of any sort in any media Metadata attached to digital files may be affected by file format obsolescence. ASCII is considered to be the most durable format for metadata [10] because it is widespread, backwards compatible when used with Unicode, and utilizes human-readable characters, not numeric codes. American Standard Code for Information Interchange ( ASCII) It retains information, but not the structure information is presented in. For higher functionality, SGML or XML should be used. The Standard Generalized Markup Language ( ISO 88791986 SGML) is an ISO Standard Metalanguage in which one can define Markup languages Don't change "Extensible" Both markup languages are stored in ASCII format, but contain tags that denote structure and format.

Trustworthy digital objects

Digital objects that can speak to their own authenticity are called trustworthy digital objects (TDOs). TDOs were proposed by Henry M. Gladney to enable digital objects to maintain a record of their change history so future users can know with certainty that the contents of the object are authentic (Gladney, 2004). Other preservation strategies like replication and migration are necessary for the long-term preservation of TDOs.

Digital sustainability

Digital sustainability encompasses a range of issues and concerns that contribute to the longevity of digital information[11]. Unlike traditional, temporary strategies and more permanent solutions, digital sustainability implies a more active and continuous process. Digital sustainability concentrates less on the solution and technology and more on building an infrastructure and approach that is flexible with an emphasis on interoperability, continued maintenance and continuous development[12]. Digital sustainability incorporates activities in the present that will facilitate access and availability in the future.

Digital preservation standards

To standardize digital preservation practice and provide a set of recommendations for preservation program implementation, the Reference Model for an Open Archival Information System (OAIS) was developed. An Open Archival Information System (or OAIS) is an Archive, consisting of an organization of people and systems that has accepted the responsibility to preserve The reference model (ISO 14721:2003) includes the following responsibilities that an OAIS archive must abide by:

OAIS is concerned with all technical aspects of a digital object’s life cycle: ingest into and storage in a preservation infrastructure, data management, accessibility, and distribution. The model also addresses metadata issues and recommends that five types of metadata be attached to a digital object: reference (identification) information, provenance (including preservation history), context, fixity (authenticity indicators), and representation (formatting, file structure, and what “imparts meaning to an object’s bitstream” [14]. Prior to Gladney's proposal of TDOs was the Research Library Group's (RLG) development of “attributes and responsibilities” that denote the practices of a “Trusted Digital Repository” (TDR) The seven attributes of a TDR are: “compliance with the Reference Model for an Open Archival Information System (OAIS), Administrative responsibility, Organizational viability, Financial sustainability, Technological and procedural suitability, System security, Procedural accountability. " Among RLG’s attributes and responsibilities were recommendations calling for the collaborative development of digital repository certifications, models for cooperative networks, and sharing of research and information on digital preservation with regards to intellectual property rights. [15].

Digital sound preservation standards

In January 2004, the Council on Library and Information Resources (CLIR) hosted a roundtable meeting of audio experts discussing best practices, which culminated in a report delivered March 2006. This report investigated procedures for reformatting sound from analog to digital, summarizing discussions and recommendations for best practices for digital preservation. Participants made a series of 15 recommendations for improving the practice of analog audio transfer for archiving:

Examples of digital preservation initiatives

Large-Scale digital preservation initiatives (LSDIs)

Many research libraries and archives have begun or are about to begin Large-Scale digital preservation initiatives (LSDI’s). The main players in LSDIs are cultural institutions, commercial companies such as Google and Microsoft, and non-profit groups including the Open Content Alliance (OCA) and the Million Book Project (MBP). The Open Content Alliance (OCA is a consortium of non-profit and for-profit groups dedicated to building a free archive of digital text and multimedia The Million Book Project (or the Universal Library) led by Carnegie Mellon University School of Computer Science and University Libraries aims to digitize a The primary motivation of these groups is to expand access to scholarly resources.

LSDIs: Library Perspective

Approximately 30 cultural entities, including the 12-member Committee on Institutional Cooperation (CIC), have signed digitization agreements with either Google or Microsoft. The Committee on Institutional Cooperation (CIC, also known as the "Academic Big Ten" was established in 1958 and is a consortium of twelve primarily Midwestern Several of these cultural entities are participating in the Open Content Alliance (OCA) and the Million Book Project (MBP). The Open Content Alliance (OCA is a consortium of non-profit and for-profit groups dedicated to building a free archive of digital text and multimedia The Million Book Project (or the Universal Library) led by Carnegie Mellon University School of Computer Science and University Libraries aims to digitize a Some libraries are involved in only one initiative and others have diversified their digitization strategies through participation in multiple initiatives. The three main reasons for library participation in LSDIs are: Access, Preservation and Research and Development. It is hoped that digital preservation will ensure that library materials remain accessible for future generations. Libraries have a perpetual responsibility for their materials and a commitment to archive their digital materials. Libraries plan to use digitized copies as backups for works in case they go out of print, deteriorate, or are lost and damaged.


References

  1. ^ Lifecycle Information for E-literature. LIFE. Retrieved on 2007-06-14. Year 2007 ( MMVII) was a Common year starting on Monday of the Gregorian calendar in the 21st century. Events 1276 - While taking exile in Fuzhou in southern China, away from the advancing Mongol invaders, the remnants of the
  2. ^ Consultative Committee for Space Data Systems. (2002). Reference Model for an Open Archival Information System (OAIS). Washington, DC: CCSDS Secretariat, p. 1-1
  3. ^ "U.S. Congress Approves Library of Congress Plan for Preservation of Digital Materials", Library of Congress, 2003-02-07. Year 2003 ( MMIII) was a Common year starting on Wednesday of the Gregorian calendar. Events 457 - Leo I becomes emperor of the Byzantine Empire. 1074 - Battle of Montesarchio in which the Prince Retrieved on 2008-04-23
  4. ^ Hedstrom, M. 2008 ( MMVIII) is the current year in accordance with the Gregorian calendar, a Leap year that started on Tuesday of the Common Events 215 BC - A temple is built on the Capitoline Hill dedicated to Venus Erycina to commemorate the Roman defeat at (1997). Digital preservation: a time bomb for Digital Libraries. Retrieved on December 4th, 2007, from http://www.uky.edu/~kiernan/DL/hedstrom.html.
  5. ^ Levy, D. M. & Marshall, C. C. (1995). Going digital: a look at assumptions underlying digital libraries," Communications of the ACM, 58, No. 4: 77-84.
  6. ^ Flugstad, Myron. (2007). Website Archiving: the Long-Term Preservation of Local Born Digital Resources. Arkansas Libraries v. 64 no. 3 (Fall 2007) p. 5-7
  7. ^ Online Computer Library Center, Inc. (2006). OCLC Digital Archive Preservation Policy and Supporting Documentation, p. 5
  8. ^ Cornell University Library. (2005) Digital Preservation Management: Implementing Short-term Strategies for Long-term Problems.
  9. ^ NISO Framework Advisory Group. (2004). A Framework of Guidance for Building Good Digital Collections, 2nd edition, p. 27,
  10. ^ National Initiative for a Networked Cultural Heritage. (2002). NINCH Guide to Good Practice in the Digital Representation and Management of Cultural Heritage Materials
  11. ^ Bradley, K. (Summer 2007). Defining digital sustainability. Library Trends v. 56 no 1 p. 148-163.
  12. ^ Sustainability of Digital Resources. (2008). TASI: Technical Advisory Service for Images.
  13. ^ Consultative Committee for Space Data Systems. (2002). Reference Model for an Open Archival Information System (OAIS). Washington, DC: CCSDS Secretariat, p. 3-1
  14. ^ Cornell University Library. (2005) Digital Preservation Management: Implementing Short-term Strategies for Long-term Problems
  15. ^ Research Libraries Group. (2002). [http://www.oclc.org/programs/ourwork/past/trustedrep/repositories.pdf/ Trusted Digital Repositories: Attributes and Responsibilities]
  16. ^ Council on Library and Information Resources. Publication 137: Capturing Analog Sound for Digital Preservation: Report of a Roundtable Discussion of Best Practices for Transferring Analog Discs and Tapes March 2006


See also

External links

Digital Asset Management consists of tasks and decisions surrounding ingesting annotating cataloguing storage and retrieval of Digital assets such as Digital photographs Data Format Management ( DFM) is the application of a systematic approach to the selection and use of the data formats used to encode information for storage on a computer Digital curation is the Curation, Preservation, maintenance and collection and Archiving of Digital assets Digital obsolescence is a situation where a digital resource is no longer readable because the physical media the reader required to read the media the hardware or the software that A digital object identifier ( DOI) is a permanent identifier given to an Electronic document. Digital reformatting is the process of converting analogue materials into a digital format as a surrogate of the original Enterprise content management ( ECM) is a set of technologies used to capture store preserve and deliver content and documents and content related to organizational A file format is a particular way to encode information for storage in a Computer file. Information Lifecycle Management refers to a wide-ranging set of strategies for administering Storage Systems on Computing devices The Library of Congress National Digital Library Program (NDLP is assembling a digital Library of reproductions of Primary source materials to support the study of Metadata ( meta data, or sometimes metainformation) is "data about data" of any sort in any media The National Digital Information Infrastructure and Preservation Program is a national strategic program being led by the Library of Congress to preserve digital content New media art preservation, a form of Art conservation, is the study and practice of techniques for sustaining artworks created using digital biological performative and other Preservation metadata is an essential component of most Digital preservation strategies The Section 108 Study Group is a select committee of copyright experts convened by the Library of Congress, and charged with updating for the digital world the United States Trustworthy Repositories Audit & Certification (TRAC is a checklist that developed from work done by the OCLC / RLG Programs and National Archives and Records A Universal Virtual Computer (UVC is much like a Virtual machine (VM in computing by means that it creates a layer between the underlying Computer platform A web crawler (also known as a web spider, web robot, or—especially in the FOAF community— web scutter) is a program or automated
© 2009 citizendia.org; parts available under the terms of GNU Free Documentation License, from http://en.wikipedia.org
Dapyx Software network: MP3 Explorer | Ebook Manager | Zenithic