Documentos de Académico
Documentos de Profesional
Documentos de Cultura
CarlWilson
TheBritishLibrary
www.openplanetsfoundation.org
Outline
Whyplan?
Institutionalcontext
Whatisapreservationplan?
Concepts
Thecontentsofapreservationplan
Howtocreateapreservationplan
Howtomakeastart
AlookatPLATO
JISCNewspapers,themakingofapreservationplan
Whatwedid,whywedidit,&whatwelearned
Alternativeapproaches
www.openplanetsfoundation.org
WhyPlan?
Whatisplanning?
Theorganisationalprocessofcreatingaplan
Theprocessofthinkingabouttheactivitiesrequiredtoachieve a
desiredgoal
Individualpeoplearentgoodatplanning
Groupsarebetteratplanning
Facilitatesconsensusthroughcommunication
Recordsdecisionsandtherationalbehindthem
Ifitallgoeswrongattributionandretribution.
www.openplanetsfoundation.org
ReasonstoPlan
Itsgoodtothinkaboutthings
Itprovidesacleargoal
Whereyouwanttogo
Whatareyouroptions
Whatsyourbestoptionandwhy
Thatwarmfuzzyfeeling
Itsgoodtofeelprepared
Youcantplanforeverything
Somethingscantbeforeseenorcontrolled
Limitedplanningresource
Aplannotputintoactionisjustaplan
www.openplanetsfoundation.org
WorkingWithinConstraints
Whynotuseestablishedbestpractice
Letsomebodywhoknowsbetterdoourthinkingforus
Butonesizedoesnotfitall
Thisisinstitutionalcontext
Differentaims
Differentobjectives
Differentbudgets
Thebottomlineisusuallythebottomline
Costisalwaysanissue
Preservationplanningisacostbenefitanalysisexercise
Gettingasclosetoouridealaswecanafford
www.openplanetsfoundation.org
WhyPreservationPlanning?
Issues
Enormousandrapidlyincreasingamountofdigitalinformation
Fragileresourcesandrapidevolutionoftechnology
Obsolescence,corruption,lossofvaluableinformation
Proactive andongoingattention/maintenancerequired
Potentialsolutionsstillfragmented
Stakeholders
Memoryinstitutions(contentholders):archives,libraries
(Scientific)datacentres
Governmentorganisations(recordcreators)
Businesscompanies(recordcreators,intellectualcapital)
Individuals(e.g.familypictures)
Allrequiretrustindigitalresources
www.openplanetsfoundation.org
TrustworthinessinDigitalRepositories
Producersneedtrustindigitalrepositories
Consumersneedtrustindigitalrepositories
Repositoriesneedtrustinexternalproviders
www.openplanetsfoundation.org
TrustworthinessinDigitalRepositories
RLG NationalArchivesandRecordsAdministrationDigitalRepository
CertificationTaskForce
TrustworthyRepositoriesAudit&Certification:CriteriaandChecklist
(TRAC)
NetworkofExpertiseinLongtermSTOrageofDigitalResources(NESTOR)
CatalogueofCriteriaofTrustedDigitalRepositories
DigitalRepositoryAuditMethodBasedonRiskAssessment(DRAMBORA)
Selfassessment
www.openplanetsfoundation.org
TRACandPreservationPlanningI
A3.2Repositoryhasproceduresandpoliciesinplace,andmechanismsfortheir
review,update,anddevelopmentastherepositorygrowsandastechnology
andcommunitypracticeevolve.
Policies,plans,monitoring
A3.6 Repository has a documented history of the changes to its operations,
procedures, software, and hardware that, where appropriate, is linked to
relevantpreservationstrategiesanddescribespotentialeffectsonpreserving
digitalcontent.
Preservationplansneedtraceability
www.openplanetsfoundation.org
TRACandPreservationPlanningII
B3.1Repositoryhasdocumentedpreservationstrategies.
PreservationPlan
B3.3Repositoryhasmechanismstochangeitspreservationplansasaresult
ofitsmonitoringactivities.
Monitorenvironment
Updatepreservationplans
www.openplanetsfoundation.org
NestorCriteria&PreservationPlanning
8.Thedigitalrepositoryhasastrategicplanforitstechnical preservation
measures.
9.2Thedigitalrepositoryidentifieswhichcharacteristicsofthedigitalobjects
aresignificantforinformationpreservation.
Cf.TRACB1.1:
Repositoryidentifiespropertiesitwillpreservefordigitalobjects
www.openplanetsfoundation.org
PreservationPolicy
PreservationPlanningisthepolicyonthecoalface.
Whatisapreservationpolicy?
Highlevelandabstract?
Framework(requirements)areidentifiedatahighlevel
Maycontainexplicitrequirements,e.g.
Emulationovermigration
MigrationtomandatedformatssuchasPDF/A
Institutionshaveformulatedvariousrequirements ascanbe
discoveredindifferenttypesofdocuments
JISCDigitalPreservationPoliciesStudy
http://www.jisc.ac.uk/media/documents/programmes/preservation/jiscpolicy
_p1finalreport.pdf
www.openplanetsfoundation.org
PreservationPolicy:Examples
Examplepolicystatementsofinstitutionswithadigitalpreservation
programme
UKDataArchive
NationalArchivesofAustralia
ISO/TR18492:2005
Longtermpreservationofelectronicdocumentbasedinformation
www.openplanetsfoundation.org
UKDataArchive
UKDataArchivePreservationPolicy
http://www.data
archive.ac.uk/news/publications/UKDAPreservationPolicy0308.pdf
p.11:TheUKDAhaschosentoimplementapreservationstrategy
baseduponopenandavailablefileformats,datamigrationandmedia
refreshment.
Whatdoesthischoicemeaninpractice?Twoexamples:
Emulationisapparently notapreservationstrategythatwillbe
chosen;allobsoletefileswillbemigrated.
Migrationtoopenfileformatswillbepreferred.
www.openplanetsfoundation.org
NationalArchivesofAustralia
AnApproachtothePreservationofDigitalRecords
http://www.naa.gov.au/images/anapproachgreenpaper_tcm2888.pdf
p.14:Thedigitalpreservationprogrammustbeabletopreserveany
digitalrecordthatisbroughtintoNationalArchives custodyregardlessof
theapplicationorsystemitisfromordataformatitisstored in.
Whatdoesthischoicemeaninpractice?Oneexample:
allrecordsthatareaccepted,shouldbepreserved,regardlessfile
format,medium,application,etc.
transformtoopenstandard+keeporiginal format
www.openplanetsfoundation.org
ISO/TR18492:2005
Internationalstandard:Longtermpreservationofelectronicdocument
basedinformation
p.12:Migrationtostandardformats
Storagerepositoriesshouldconsidermigratingelectronicdocument
basedinformationfromthewidevarietyofformatsusedbycreatorsor
recipientstoasmallernumberofstandardized formatsupontheir
transfertothecustodyoftherepository.
Standardized formats couldbeaconsensusonformatsthatarewidely
usedandarelikelytocoveramajorityofaparticularclassof electronic
documentbasedinformation.Proprietaryfileformatsshouldbeavoided.
Amongthetechnologyneutralformatsthatmeritconsiderationare
PDF/A1,XML,TIFFandJPEG.
www.openplanetsfoundation.org
DefinitionofaPreservationPlan
Apreservationplan definesaseriesofpreservationactionstobetaken
byaresponsibleinstitutiontoaddressanidentifiedriskfora givensetof
digitalobjectsorrecords(calledcollection).
ThePreservationPlantakesintoaccountthepreservationpolicies,legal
obligations,organisationalandtechnicalconstraints,userrequirements
andpreservationgoal.Italsodescribesthepreservationcontext,the
evaluatedalternativepreservationstrategiesandtheresultingdecision
foronestrategy,includingtherationaleofthedecision.
www.openplanetsfoundation.org
CharacteristicsofaPreservationPlan
Translationofapreservationpolicy
Specificationofhowtotreatacollectioninagiveninstitutionalsetting
Monitoredfor
changesintechnology
changesinorganisationalsetting
changesinuserrequirements
changesinavailabletools
changesinpreservationmethods
Speciesconcreteaction
Thepreservationactionplancanbeanexecutableworkflow
definition,detailingactionsandrequiredtechnicalenvironment
Thepreservationplanprovidesthecontext/backgroundofthe
preservationactionplan
www.openplanetsfoundation.org
ContentofaPreservationPlan
1.
2.
Identification
Status
Whatwastheimmediatereasonforthisplan?
Hasitbeenapprovedandifso,whenandbywhom
Howdoesitrelatetootherplansrelatedtoaspecifictypeofobjects?
3. Descriptionofinstitutionalsetting
4. Descriptionofthecollection(digitalobjects)
5. Purposeandrequirements
6. Evidenceofdecisionforaspecificpreservationaction
whatisthefoundationofthedecision
descriptionofevaluationofpossibleactions
7. Costsconsiderations
8. Triggerforreevaluation
9. Rolesandresponsibilities
10. Preservationworkflowdocumentation
www.openplanetsfoundation.org
HowToPrepare?
Understandtheorganisationalcontext
mandate/legislation
theorganisationalpolicy
usercommunity
Understandtheobjects
(collectionof)digitalobjects:characteristics
Understandtheinfrastructure
technology(past,present,future),infrastructure
people,knowledge,skills
Availableoptions
potentialmethods/strategies
Decisionmakingprocess:preservationplanning
www.openplanetsfoundation.org
Potentialresources
Documentation
Mandate/vision/missionstatements
Policydocuments(iftheyexist)
Projectplans
Guidelines
Procedures/rules
People
Administrators,managers
Producers
Domainexperts,e.g.Curators,ITStaff,Lawyers
Consumers,users
www.openplanetsfoundation.org
ObjectivesofPreservationPlanning
Supportdecisionmakingaboutdigitalpreservation
Identifycriteriaforpreservation
Workflowforevaluatingalternativesanddefiningpreservationplans
Developmethodologiesforassessingtherisksofapplyingdifferent
preservationstrategiesfordifferenttypesofdigitalobjects
Produceandevaluateprototypepreservationworkflows
Enableformulation,evaluationandexecutionofhighqualitycost
effectivepreservationworkflowsthatsuittheorganisationalneeds
Supporttheongoingevaluationoftheresultsofexecutingpreservation
workflowsandprovideafeedbackmechanism
Documenttheplanningprocesscarefully
www.openplanetsfoundation.org
PreservationPlanningandOAIS
www.openplanetsfoundation.org
PreservationPlanningEnvironment
www.openplanetsfoundation.org
PreservationPlanninginPlato
WebbasedplanningtoolimplementingthePlanetspreservationplanning
workflow
Publiclyavailable
Automationoftheplanningprocess
Integrationofregistriesandservicesfor
Fileformatidentification
Preservationaction(migration,emulation)
Characterisationandcomparison
Knowledgebasetosupportplanning
Release3.0.1
http://www.ifs.tuwien.ac.at/dp/plato
www.openplanetsfoundation.org
DemonstrationsScenario:
NewspapersandPreservationattheBL
NewspapersareofhighstrategicimportancefortheBL
Seriesofmassdigitisationprojectsongoing
4millionpagesayear
Economicandenvironmentalcostisbecomingincreasinglysignificant
Balancingriskwithcost:thebigchallenge!
www.openplanetsfoundation.org
TheCollection:JISC1Newspapers
19th centurynewspapersdigitisedaspartoftheJISC1Project
Digitalmasters:
Highvolume:80TB,3millionpages
TIFFimages,losslesscompression
~30MBperpage
300dpi,8bitgreyscale
Serviceimages,XML,OCRdtext
FocusonTIFFmasters
Master (TIF)
www.openplanetsfoundation.org
Preingest:AimandKeyRequirements
PlanetsCaseStudy
ExerciseextendedtodeliverresultstotheBLDigitalLibraryProgramme
Aim:
Considerpreingestmigrationtoreduceprojectedstoragecosts
KeyRequirements:
Reducecosts
Ensurequalitylevelsufficientforfutureusecases
Minimizepreservationrisk
Ensuresignificantpropertiesareretained
ImplementationofresultsatBL
www.openplanetsfoundation.org
Considerations
CostBenefitAnalysis
Balancingthelongtermpreservationcostsagainsttheoneoffprocess
costsofformatmigration
Bettercostingmethodsrequired
LIFE3resultsforlongtermpreservationcosts
Testingandexperienceforpreservationworkflowcosts
PreservationRequirements
OpenInternationalStandardspreferredforfileformats
Filesize,smallerischeaper
AutomatedQualityAssuranceofprocessanecessity
www.openplanetsfoundation.org
TheResults
JPEG2000ChosenoverTIFF
Losslesscompression,saveonstoragecosts
AutomatedQApossible
OptionsRejected
Donothing,TIFFfilesarelargewithpoorcompression
ConversiontoJPEG,smallfilesizesbutlossycompression
ConversiontoPNG,orBMP,bothconsideredpoorarchivalformats
duetopoorcompression,andlimitedcolourspace
www.openplanetsfoundation.org
WhatWeLearned
Testyourworkflow
Thentestitagain
TIFFimageresolutiondatanotretainedbyKakadu
ItisstillintheaccompanyingMETSfile
Talkingaboutqualityassuranceisnotenough
Automated Testingaddressedintheplan
Automated Testingabsentfromthefinalworkflow
Testingishard
APLATOplanisnotaperfectrecord
Doesntrecordrationaleforallcriteria,e.g.imagequality
www.openplanetsfoundation.org
ItsAllAbouttheProcess
Thebesttoolsarenosubstituteforgoodprocess
PreservationPlanningspecifiesthepreservationworkflow
Prototypeworkflowsonsmallsamplesarenotthesameasproduction
workflows
Contentneedsachampion
Thatchampionneedstobeinvolvedfromstarttofinish
www.openplanetsfoundation.org
MightWeNeedAlternatives?
WhatsgoodaboutPLATO
Itmakesyouthinkabouttherightthings
Itprovidesastructurefortheprocess
Availableonline
Agreatintroductiontopreservationplanning
Plansbecomeapublicresource
Whatsnotsogood?
Onlycopeswithsmallsamplesofhomogenouscontent
Arbitraryweightingandscoringofcriteria,smallchangescanhave
hugeeffects
DependentonavailabilityofPLATOtorenderplansinthefuture
Insufficientnarrative
www.openplanetsfoundation.org
Howtoproceed?
Discuss
Documenttemplates?
Makesharedpreservationplansavailable?
Forumtodiscusswarstories?
Bestpractise?
ProvideasecondPLATOinstance?
www.openplanetsfoundation.org