IC Talk: Data and Code Sharing for Open Science
Event details
Date | 16.10.2019 |
Hour | 14:30 › 15:45 |
Location | |
Category | Conferences - Seminars |
By: Merce Crosas - Harvard University
Abstract:
A critical element of Open Science is open access to research data and code. Successful implementation of widely used solutions for sharing open data must combine technology, standards, and incentives. For the last 15 years, the Dataverse project has focused on these three aspects to enhance research data sharing and enable access to tens of thousands of research datasets around the word. Currently, the Dataverse project is working to add two crucial components to continue supporting open science: 1) sharing code associated with the data and reproducible packages to verify scientific results, and 2) standardized data policies and differential privacy features to enable sharing and analysis of sensitive data. This talk will summarize the achievements made so far and outline the future improvements.
Bio:
Mercè Crosas is Harvard University’s Research Data Officer, with Harvard University Information Technology (HUIT), and Chief Data Science and Technology Officer at Harvard’s Institute for Quantitative Social Science (IQSS).
In her role at HUIT, Dr. Crosas provides leadership to mature Harvard’s data management and governance practices. She works in close collaboration with key constituencies in Research, Information Technology, and the Library to coordinate support for the data lifecycle and guide university policy, process, and procedures for research data. Dr. Crosas brings to this role a wealth of experience in data management architecture and international community data standards as well as the vision to make data more accessible for research while preserving privacy.
At IQSS, Dr. Crosas guides the vision and strategic direction of data sharing and data analysis projects developed at the Institute. She has led the Dataverse project, an open source software platform for sharing and archiving research data, since 2006. She now, also, leads the development of the text analysis tool Consilience, the DataTags project for sharing sensitive data, and supervises the user experience, data curation, and data science services teams.
In the last ten years, Dr. Crosas has been PI and co-PI of multiple research grants and collaborations related to data privacy, data provenance, research reproducibility, and data sharing in social science, biomedicine, and astronomy. She is part of numerous committees and working groups focused on research data management, data citation, and data standards, and is a co-author of the FAIR (Findable, Accessible, Interoperable, Reusable) data principles as well as the Joint Declaration of Data Citation Principles.
Before re-joining Harvard in 2004, Dr. Crosas worked for six years in the educational software and biotech industries, initially as a software developer, and subsequently as director of the software development team. She contributed to the development of lab information management systems (LIMS) for SNP discovery and genotyping and mass spectrometry. Before that, she spent six years at the Harvard-Smithsonian Center for Astrophysics, first as a pre-doctoral fellow for her Ph.D. in Astrophysics from Rice University, and later as a post-doctoral fellow, researcher, and software engineer with the Radioastronomy division. There she worked on Monte Carlo simulations of radiative transfer in evolved stars and contributed to the software for the Submillimeter Array interferometer. She earned a B.S. in Physics from the Universitat de Barcelona, Spain.
More information
Abstract:
A critical element of Open Science is open access to research data and code. Successful implementation of widely used solutions for sharing open data must combine technology, standards, and incentives. For the last 15 years, the Dataverse project has focused on these three aspects to enhance research data sharing and enable access to tens of thousands of research datasets around the word. Currently, the Dataverse project is working to add two crucial components to continue supporting open science: 1) sharing code associated with the data and reproducible packages to verify scientific results, and 2) standardized data policies and differential privacy features to enable sharing and analysis of sensitive data. This talk will summarize the achievements made so far and outline the future improvements.
Bio:
Mercè Crosas is Harvard University’s Research Data Officer, with Harvard University Information Technology (HUIT), and Chief Data Science and Technology Officer at Harvard’s Institute for Quantitative Social Science (IQSS).
In her role at HUIT, Dr. Crosas provides leadership to mature Harvard’s data management and governance practices. She works in close collaboration with key constituencies in Research, Information Technology, and the Library to coordinate support for the data lifecycle and guide university policy, process, and procedures for research data. Dr. Crosas brings to this role a wealth of experience in data management architecture and international community data standards as well as the vision to make data more accessible for research while preserving privacy.
At IQSS, Dr. Crosas guides the vision and strategic direction of data sharing and data analysis projects developed at the Institute. She has led the Dataverse project, an open source software platform for sharing and archiving research data, since 2006. She now, also, leads the development of the text analysis tool Consilience, the DataTags project for sharing sensitive data, and supervises the user experience, data curation, and data science services teams.
In the last ten years, Dr. Crosas has been PI and co-PI of multiple research grants and collaborations related to data privacy, data provenance, research reproducibility, and data sharing in social science, biomedicine, and astronomy. She is part of numerous committees and working groups focused on research data management, data citation, and data standards, and is a co-author of the FAIR (Findable, Accessible, Interoperable, Reusable) data principles as well as the Joint Declaration of Data Citation Principles.
Before re-joining Harvard in 2004, Dr. Crosas worked for six years in the educational software and biotech industries, initially as a software developer, and subsequently as director of the software development team. She contributed to the development of lab information management systems (LIMS) for SNP discovery and genotyping and mass spectrometry. Before that, she spent six years at the Harvard-Smithsonian Center for Astrophysics, first as a pre-doctoral fellow for her Ph.D. in Astrophysics from Rice University, and later as a post-doctoral fellow, researcher, and software engineer with the Radioastronomy division. There she worked on Monte Carlo simulations of radiative transfer in evolved stars and contributed to the software for the Submillimeter Array interferometer. She earned a B.S. in Physics from the Universitat de Barcelona, Spain.
More information
Practical information
- General public
- Free
- This event is internal
Contact
- Host: Christine Choirat - Swiss Data Science Center