WP32 Cost/benefit data collection and modelling: amending the DoW

The idea is to use this page to amend the text for the WP description in the DoW. Please edit this wiki page directly, and use the "insert" and "delete" mark-up to show changes, as illustrated:

Here is some unchanged text.
<ins>Here is some text to be inserted.</ins>
<del>Here is some text to be deleted.</del>
Here is some more unchanged text

Start month End month WP leader
11 26 BL

Objectives

Create models for the cost of preservation of digital objects Evaluate, test, validate and standardise cost models for the preservation of digital objects The objective of this WP is to evaluate and test cost models for the preservation of digital objects

Description of work and role of partners

Task 3210 Cost parameters

The data holders in the consortium encompass a wide variety of repositories and cost drivers. Many of the cost models which have been published, such as LIFE [37] and [38], are based on a library model with a limited number of parameters collected. The same is true of the work by Beagrie et al [39]. |Fontaine et al [40] collected a very large number of cost parameters; while this limited the ability to produce an explicit models of cost dependencies because there were many more parameters collected for each repository than there were repositories providing data and therefore the number of degrees of freedom was too limited, nevertheless there used a technique of finding similarities for any new proposed repository. Even so, limited attention was paid to do anything other than bit preservation and access; the requirements for preserving the usability of the information were not part of the costings.

There are many related points of view. For example CERN could provide an interesting insight to a discipline where the costs of producing data are enormous, into the billions of Euros, but scant, if any, examples of data preservation and re-use exist. This scenario could stress-test any kind of economic model in extreme conditions. Other data, such as observations of the Earth at particular times, cannot be reproduced because the earth constantly changes.

Cost parameters from selected cost models will be mapped to the trusted repositories ISO or OAIS model to ascertain where cost parameters lie within the digital landscape. This will enable a gap analysis to be carried out as well as an assessment of whether a standard model could be developed in the future. A number of cost models have been published which cover different elements of costs associated with repositories. A review of published cost models will be undertaken to map cost parameters against the ISO for Trusted Repositories (ISO16363). This will show how cost model parameters are concentrated or where areas of activity are not covered.

Task 3220 Cost data collection

Based on the list of cost parameters from task 3210, we will collect cost information, with appropriate anonymisation, from the consortium members and others. If we have had enough data we may be able to model certain aspects of the costs, failing that we will use techniques such as those of Fountain at al, to make cost predictions and compare those to a controlled set of repository data. Based on the list of cost parameters from task 3210, we will collect cost information, with appropriate anonymisation, from the consortium members and others and test published cost models. If we do not have enough data we will review the cost parameters against the ISO for Trusted Repositories further and identify areas for investigation and development.

List of deliverables

  • D32.1 Report on cost parameters for digital repositories (M26)
D32.2 Report on testing of cost models and further analysis of cost parameters (M30)

Description of deliverables

D32.1) Report on cost parameters for digital repositories: This report will present which we believe are important, a summary of the data collected and results of the test using the control set. It will include costing data collected from numerous archives covering a great variety of sizes, types, data holdings and governance. It will contain an investigation of the effect of the various parameters. [month 26] This report will include a high level analysis of published cost models. Results of the mapping exercise between cost parameters and the ISO on Trusted Repositories (ISO16363) will provide commonalities and differences between the cost models. D32.2)The results of the testing of models (with the limited data we have) will be included within this report. Where enough cost data is not available, further analysis of the mapping of cost parameters to the ISO on Trusted Repositories will identify areas for further investigation and development. A review of the results of the survey (included within WP36) related to cost models will also be provided as well as links to the new Coordination Action. [month 30]


Further revisions in response to feedback from Project Officer 12th/14th December

Objective amended to (DRAFT): The objective of this WP is to evaluate and test cost models for the preservation of digital objects

Task3210 - DRAFT A number of cost models have been published which cover different elements of costs associated with repositories. A review of published cost models will be undertaken to map cost parameters against the ISO for Trusted Repositories (ISO16363). This will show how cost model parameters are concentrated or where areas of activity are not covered.

Task 3220 - DRAFT Based on the list of cost parameters from task 3210, we will collect cost information, with appropriate anonymisation, from the consortium members and others and test published cost models. If we do not have enough data we will review the cost parameters against the ISO for Trusted Repositories further and identify areas for investigation and development

Deliverable D32.1 - to be split into two as follows: D32.1 Report on cost parameters for digital repositories due M26 This report will include a high level analysis of published cost models. Results of the mapping exercise between cost parameters and the ISO on Trusted Repositories (ISO16363) will provide commonalities and differences between the cost models.

D32.2 Report on testing of cost models and further analysis of cost parameters due M30 The results of the testing of models (with the limited data we have) will be included within this report. Where enough cost data is not available, further analysis of the mapping of cost parameters to the ISO on Trusted Repositories will identify areas for further investigation and development. A review of the results of the survey (included within WP36) related to cost models will also be provided as well as links to the new Coordination Action.

-- SimonLambert - 2012-10-17

Edit | Attach | Watch | Print version | History: r5 < r4 < r3 < r2 < r1 | Backlinks | Raw View | Raw edit | More topic actions
Topic revision: r5 - 2013-01-09 - KirnnKaur
 
This site is powered by the TWiki collaboration platform Powered by PerlCopyright © 2008-2019 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding TWiki? Send feedback