This item is only available as the following downloads:
The Data Life Cycle and Securing Access to Open Research DataRolando Garcia Milian and Hannah Norton UF Health Science Center Library Open Access Week 2012Image modified from: http://www.fotopedia.com/items/flickr 2194127541
Benefits of sharing data: Data can be used by other researchers with different objectives Accelerate the time of discovery by building upon previous research Results can be reproduced more easily and accurately Researchers receive the credit theyre due Data producers have a new channel by which to promote their work (increase impact of research)
Panton Principles for Open Data in Science Science is based on building on, reusing and openly criticisingthe published body of scientific knowledge. For science to effectively function, and for society to reap the full benefits from scientific endeavours, it is crucial that science data be made open.
Barriers to data sharing No time Lack of funding No place to put it Dont have rights to make data public No standards Others dont need my data
Data Lifecycle** Based on Data Documentation Initiative (DDI) version 3.0 Combined Life Cycle Model Study Concept Data Collection Data Processing Data Distribution Data Archiving Data Discovery Data Analysis Repurposing Data Analysis
Data Lifecycle*Based on Data Documentation Initiative (DDI) version 3.0 Combined Life Cycle Model Image credit: http://openstate.eu/our mission/open data networks/ Study Concept Data Collection Data Processing Data Distribution Data Archiving Data Discovery Data Analysis Repurposing Data Analysis
Data Lifecycle** Based on Data Documentation Initiative (DDI) version 3.0 Combined Life Cycle Model Study Concept Data Collection Data Processing Data Distribution Data Archiving Data Discovery Data Analysis Repurposing Data Analysis Data Management Planning Metadata/annotation
Why do researchers need a DMP?8 To ensure that valuable data resources will be accessible in the future to members of the research team and the broader community. To make life easier by planning ahead and documenting data throughout its life cycle, researchers can save time and focus on research. To increase the visibility of research. To satisfy funders requirements.
Funders requirements Proposals submitted or due on or after January 18, 2011, must include a supplementary document of no more than two pages labeled Data Management Plan This supplementary document should describe how the proposal will conform to NSF policy on the dissemination and sharing of research results. The NIH expects and supports the timely release and sharing of final research data from NIH supported studies for use by other researchers. Starting with the October 1, 2003 receipt date, investigators submitting an NIH application seeking $500,000 or more in direct costs in any single year are expected to include a plan for data sharing or state why data sharing is not possible .
DMP Templates and ToolsTemplates can give you a place to start, as long as you customize them for your project. UF HPC Center links: http://www.hpc.ufl.edu/proposals/ https://dmp.cdlib.org/
Who are you willing to share your data with? 6.3% 16.7% 35.4% 35.4% 95.8%0%20%40%60%80%100% Anyone Others outside of my field Others in my field Others in my department or institute Immediate collaborators Percentage of Respondents n=48
How are you sharing or planning to share your data? 10.0% 46.0% 22.0% 4.0% 68.0% 26.0%0%10%20%30%40%50%60%70% I do not share data Making them available informally to peers on request Making them available online via a project or institutional website Depositing them in UFs Institutional Repository (http://ufdc.ufl.edu/ir) Submitting them to a journal to support a publication Depositing them in a discipline specific data center or repository Percentage of Respondents n=50
Securing Access to Open Research Data
Considerations When Submitting Research Data To An Open Repository The repository adds Digital Object Identifiers (DOIs) / permanent links Metadata/ Standards File format and data structure Subject/research domain http://dx.doi.org/10.5061/dryad.585t4 http://ufdc.ufl.edu/IR00000452/00001 Repository have the technical capacity to embargo or sequester access to data until the content has been approved for release to the public.
Metadata / StandardsExcel file illustrating the structure of a GEO archive Affymetrixdata submissions.
GEO2R compares groups of samples in order to identify differentially expressed genes across experimental conditions The Web Image Browser (Open Cell Centered Database) for viewing and annotating images. WIB reads and manages images similar to Google Maps PhysioToolkit is a library of software for physiologic signalanalysis,and detection of physiologically significant eventsData Repositories With Associated Visualization And Analysis Tools
What repositories are appropriate for a researcher to submit the data to?
Research Data Management at UF Guide http://guides.uflib.ufl.edu/datamanagement
Research Data Management at UF Guide
Research Data Management at UF Guide
Training on Best Practices in Data Management
Publication rate of poster presentations DisciplineReference 47%OrthopaedicDoneganet al. (2010) Publication Rates of Presentations at an Annual Meeting of the American Academy of OrthopaedicSurgeons56%OphthalmologyJuzychet al. (1992) Whatever Happened to Abstracts From Different Sections of the Association for Research in Vision and Ophthalmology?42%NephrologyHarel, et al. (2011) Frequency and factors influencing publication of abstracts presented at three major nephrology meetings42% Reproduction EmbryologyEvers, 2000 Publication bias in reproductive research19% Dental EducationGalanget al. (2011) Factors Influencing Publication Rates of Abstracts Presented at the ADEA Annual Session & Exhibition A Great Number of Poster Presentations Are Never Published
Research papers refer to poster presentation results
Provides permanent links for all or your submitted items (ideal for inclusion in your CV, tenure packet) Receive monthly reports on the usage of your items. Share those items openly and ensure their permanent preservation. Indexed by online search engines (e.g. Google) The UF Institutional Repository
AcknowledgementThis project has been funded in part with federal funds from the National Library of Medicine, National Institutes of Health, under Contract # HHS N 276 2011 00004 C. This presentation is available for re use under a creative commons attribution license.
References Panton Principles, Principles for open data in science. Murray Rust, Peter; Neylon, Cameron; Pollock, Rufus; Wilbanks, John; (19 Feb 2010). Retrieved 10/18/2012 from http://pantonprinciples.org/ C Tenopir, S Allard, K Douglass, AU Aydinoglu, L Wu, E Read, M Manoff, M Frame. 2011. Data Sharing by Scientists: Practices and Perceptions. PLoSONE 6(6): e21101. Data Documentation Initiative (DDI) version 3.0 Combined Life Cycle Model: http://www.ddialliance.org/what