Harvard Data Commons Project

An ecosystem of integrated tools for research data

Mission & Vision

The vision of the Harvard Data Commons is to improve the researcher experience by automating the flow of research data from research computing environments to management, publication, discovery, and preservation environments, which will result in an increased ability to meet sponsor requirements for data integrity, data provenance, and the reproducibility of research.

Project Background

Since 2019, a collaborative team from divisions and schools across Harvard has been pursuing the idea of a Data Commons to support the lifecycle of research at Harvard. We are currently building a small proof of concept or “Harvard Data Commons Minimum Viable Product (MVP)” using internal capital funding to start connecting key systems in the research data lifecycle.

The Harvard Data Commons MVP initiated its work in July 2021 and is executing its three primary objectives: 

Objective 1

Automating the technical pipeline between the research computing infrastructures and Dataverse.

Objective 2

Enhancing Dataverse to support machine-actionable workflows of various types.

Objective 3

Automating connections between key research and library systems used for archiving and publication

In addition to the 8 members of the Harvard Data Commons leadership team, there are an additional 24 staff from across Harvard contributing to the effort in one way or another, making this an inspiring and true cross-institutional collaboration.

Resources

Briefings & Presentations

Workshop Materials

Harvard Data Commons
Project Briefing

HUIT Tech Talk
by Mercè Crosas

Harvard Data Commons Workshop Slides

Workshop Recording, Transcript, and Zoom Chat

People

Leadership Team

Bill Barnett

Bill Barnett

Senior Director for Research Computing | Harvard Medical School

Paul DiBello

Paul DiBello

Senior Director, Research Computing Services | Harvard Business School

Emre Keskin

Emre Keskin

University Research Data Officer | Harvard University Information Technology (HUIT)

Stefano Iacus

Stefano Iacus

Director of Data Science and Product Research | The Institute for Quantitative Social Science

Ardys Kozbial

Ardys Kozbial

Assistant University Librarian for Content Strategies and Associate Librarian for Faculty of Arts and Sciences | Harvard Library

Stuart Snydman

Stuart Snydman

Associate University Librarian and Managing Director Library Technology Services | Harvard University Information Technology (HUIT)

Len Wisniewski

Len Wisniewski

Director of Engineering | The Institute for Quantitative Social Science

Scott Yockel

Scott Yockel

University Research Computing Officer | Harvard University Information Technology (HUIT)

Project Sponsorship, Leadership, and Objective Teams