Prepare for launch in 3… 2… 1…

In about two weeks we will launch the new DMPTool on Tues, 27 Feb. The much-anticipated third version of the tool represents an exciting next step in what has always been a community-driven project. We’ve now successfully merged the primary US- and UK-based data management planning tools into a single codebase (DMP Roadmap): the engine under the new DMPTool hood.

Why are we doing this?

A little background for those who haven’t been following along with our codevelopment journey: in 2016 the University of California Curation Center (UC3) decided to join forces with the Digital Curation Centre (DCC) to maintain a single open-source platform for DMPs. We took this action to extend our reach beyond national boundaries and move best practices forward, with a lofty goal to begin making DMPs machine actionable (i.e., useful for managing data). We’ll continue to run our own branded services (DMPTool, DMPonline, DMPTuuli, DMPMelbourne) on the shared codebase, and incorporate partners in Canada, Argentina, South Africa, and throughout Europe who are already running their own instances (full list).

In parallel with our co-development efforts we’ve been making the rounds of Research Data Alliance, Force11, IDCC, and disciplinary meetings to collect use cases for machine-actionable DMPs (details here) and help define common standards (RDA Working Group; just posted pre-print for 10 Simple Rules for Machine-Actionable DMPs). We also got an NSF EAGER grant so we can begin prototyping muy pronto.

The new version of the DMPTool will enable us to implement and test machine-actionable things in a truly global open science ecosystem. Successful approaches to making DMPs a more useful exercise will require input from and adoption by many stakeholders so we look forward to working with our existing DMP Roadmap community (an estimated 50k+ users, 400+ participating institutions, and a growing list of funder contacts across the globe) and welcoming others into the fold!

Preparing for Launch

To help DMPTool administrators prepare themselves and their institutional users for the upcoming launch, we will host a webinar on:

Mon, 26 Feb 2018, 9-10 AM Pacific Time
Zoom link (recording on Vimeo; Q&A and slides)

By that time we’ll have a new user guide for administrators, a new Quick Start Guide for researchers, and refreshed promo materials. Everyone will have seamless access to their existing DMPTool accounts, just through a new user interface that looks and feels more like DMPonline (spoiler alert: we made it blue). And one of the most exciting things about the new tool is that it contains 34 freshly updated funder templates with links to additional funder guidance.

Stay tuned to the DMPTool communication channels in the coming weeks (blog, admin email list, Twitter) for more news and updates. We look forward to seeing you at the webinar and welcome your feedback at any point.

On the right track(s) – DCC release draws nigh

blog post by Sarah Jones

Eurostar photo

Eurostar from Flickr by red hand records CC-BY-ND

Preliminary DMPRoadmap out to test

We’ve made a major breakthrough this month, getting a preliminary version of the DMPRoadmap code out to test on DMPonline, DMPTuuli and DMPMelbourne. This has taken longer than expected but there’s a lot to look forward to in the new code. The first major difference users will notice is that the tool is now lightning quick. This is thanks to major refactoring to optimise the code and improve performance and scalability. We have also reworked the plan creation wizard, added multi-lingual support, ORCID authentication for user profiles, on/off switches for guidance, and improved admin controls to allow organisations to upload their own logos and assign admin rights within their institutions. We will run a test period for the next 1-2 weeks and then move this into production for DCC-hosted services.

Work also continues on additional features needed to enable the DMPTool team to migrate to the DMPRoadmap codebase. This includes additional enhancements to existing features, adding a statistics dashboard, email notifications dashboard, enabling a public DMP library, template export, creating plans and templates from existing ones, and flagging “test” plans (see the Roadmap to MVP on the wiki to track our progress). We anticipate this work will be finished in August and the DMPTool will migrate over the summer. When we issue the full release we’ll also provide a migration path and documentation so those running instances of DMPonline can join us in the DMPRoadmap collaboration.

Machine-actionable DMPs

Stephanie and Sarah are also continuing to gather requirements for machine-actionable DMPs. Sarah ran a DMP workshop in Milan last month where we considered what tools and systems need to connect with DMPs in an institutional context, and Stephanie has been working with Purdue University and UCSD to map out the institutional landscape. The goal is to produce maps/diagrams for two specific institutions and extend the exercise to others to capture more details about practices, workflows, and systems. All the slides and exercise from the DMP workshop in Milan are on the Zenodo RDM community collection, and we’ll be sharing a write-up of our institutional mapping in due course. I’m keen to replicate the exercise Stephanie has been doing with some UK unis, so if you want to get involved, drop me a line. We have also been discussing potential pilot projects with the NSF and Wellcome Trust, and have seen the DMP standards and publishing working groups proposed at the last RDA plenary host their initial calls. Case statements will be out for comment soon – stay tuned for more!

We have also been discussing DMP services with the University of Queensland in Australia who are doing some great work in this area, and will be speaking with BioSharing later this month about connecting up so we can start to trial some of our machine-actionable DMP plans.

The travelling roadshow

Our extended network has also been helping us to disseminate DMPRoadmap news. Sophie Hou of NCAR (National Center for Atmospheric Research) took our DMP poster to the USGS Community for Data Integration meeting (Denver, CO 16–19 May) and Sherry Lake will display it next at the Dataverse community meeting (Cambridge, MA 14-16 June). We’re starting an inclusive sisterhood of the travelling maDMPs poster. Display the poster, take a picture, and go into the Hall of Fame! Robin Rice and Josh Finnell have also been part of the street team taking flyers to various conferences on our behalf. If you would like a publicity pack, Stephanie will send out stateside and Sarah will share through the UK and Europe. Just email us your contact details and we’ll send you materials. The next events we’ll be at are the Jisc Research Data Network in York, the EUDAT and CODATA summer schools, the DataONE Users Group and Earth Science Information Partners meetings (Bloomington, IN), the American Library Association Annual Conference (Chicago, IL), and the Ecological Society of America meeting (Portland, OR) . Catch up with us there!

RDA-DMP movings and shakings

RDA Plenary 9

We had another productive gathering of #ActiveDMPs enthusiasts at the Research Data Alliance (RDA) plenary meeting in Barcelona (5-7 Apr). Just prior to the meeting we finished distilling all of the community’s wonderful ideas for machine-actionable DMP use cases into a white paper that’s now available in RIO Journal. Following on the priorities outlined in the white paper, the RDA Active DMPs Interest Group session focused on establishing working groups to carry things forward. There were 100+ participants packed into the session, both physically and virtually, representing a broad range of stakeholders and national contexts and many volunteered to contribute to five proposed working groups (meeting notes here):

  • DMP common standards: define a standard for expression of machine-readable and -actionable DMPs
  • Exposing DMPs: develop use cases, workflows, and guidelines to support the publication of DMPs via journals, repositories, or other routes to making them open
  • Domain/infrastructure specialization: explore disciplinary tailoring and the collection of specific information needed to support service requests and use of domain infrastructure
  • Funder liaison: engage with funders, support DMP review ideas, and develop specific use cases for their context
  • Software management plans: explore the remit of DMPs and inclusion of different output types e.g. software and workflows too

The first two groups are already busy drafting case statements. And just a note about the term “exposing” DMPs: everyone embraced using this term to describe sharing, publishing, depositing, etc. activities that result in DMPs becoming open, searchable, useful documents (also highlighted in a recent report on DMPs from the University of Michigan by Jake Carlson). If you want to get involved, you can subscribe to the RDA Active DMPs Interest Group mailing list and connect with these distributed, international efforts.

Another way to engage is by commenting on recently submitted Horizon2020 DMPs exposed on the European Commission website (unfortunately, the commenting period is closed here and here — but one remains open until 15 May).

DMPRoadmap update

Back at the DMPRoadmap ranch, we’re busy working toward our MVP (development roadmap and other documentation available on the GitHub wiki). The MVP represents the merging of our two tools with some new enhancements (e.g., internationalization) and UX contributions to improve usability (e.g., redesign of the create plan workflow) and accessibility. We’ve been working through fluctuating developer resources and will update/confirm the estimated timelines for migrating to the new system in the coming weeks; current estimates are end of May for DMPonline and end of July for DMPTool. Some excellent news is that Bhavi Vedula, a seasoned contract developer for UC3, is joining the team to facilitate the DMPTool migration and help get us to the finish line. Welcome Bhavi!

In parallel, we’re beginning to model some active DMP pilot projects to inform our work on the new system and define future enhancements. The pilots are also intertwined with the RDA working group activities, with overlapping emphases on institutional and repository use cases. We will begin implementing use cases derived from these pilots post-MVP to test the potential for making DMPs active and actionable. More details forthcoming…

Upcoming events

The next scheduled stop on our traveling roadshow for active DMPs is the RDA Plenary 10 meeting in Montreal (19–21 Sept 2017), where working groups will provide progress updates. We’re also actively coordinating between the RDA Active DMPs IG and the FORCE11 FAIR DMPs group to avoid duplication of effort. So there will likely be active/FAIR/machine-actionable DMP activities at the next FORCE11 meeting in Berlin (25–27 Oct)—stay tuned for details.

And there are plenty of other opportunities to maintain momentum, with upcoming meetings and burgeoning international efforts galore. We’d love to hear from you if you’re planning your own active DMP things and/or discover anything new so we can continue connecting all the dots. To support this effort, we registered a new Twitter handle @ActiveDMPs and encourage the use of the #ActiveDMPs hashtag.

Until next time.

Active, actionable DMPs

IDCC workshop participants

Roadmap project IDCC debriefing
We had a spectacularly productive IDCC last month thanks to everyone who participated in the various meetings and events focused on the DMPRoadmap project and machine-actionable DMPs. Thank you, thank you! Sarah has since taken the traveling road show onward to a meeting at CERN (slides) and Stephanie discussed institutional infrastructure for DMPs at a meeting of California data librarians. In the midst of travels we’ve been wrangling the mountain of inputs into a draft white paper on machine-actionable DMP use cases. For now, we offer a preview of the report and an invitation to keep the momentum going at the RDA plenary in Barcelona, which is just around the corner (5–7 April).

The white paper represents the outputs of the IDCC workshop: ”A postcard from the future: Tools and services from a perfect DMP world” (slides, etc. here). We convened 47 participants from 16 countries representing funders, educational institutions, data service providers, and the research community. There was so much interest in the topic that we added an overflow session to accommodate everyone who wanted to weigh in. We’re gratified to discover how many folks have been thinking about DMPs as much as we have, and aim to continue synthesizing your stakeholder-balanced, community-driven solutions for improving the data management enterprise.

mind map exercise

Solving DMPs with rainbow stickies

The contributions from IDCC align with previously gathered information and drive the agenda summarized here. Consensus emerged to:

  • Focus on integrating existing systems (Interoperability was top-voted topic for the workshop)
  • Integrate DMPs into active research workflows to emphasize benefits of planning to researchers, but keep in mind that funders still drive demand.
  • Consider the potential of persistent identifiers (ORCID iDs, Crossref Funder Registry, etc.)
  • Explore ways to offer tailored, discipline-specific guidance at appropriate points

Next steps…
All stakeholders expressed a need for common standards and protocols to enable information to flow between plans and systems in a standardized manner. This would support APIs to both read and write to DMPs, as well as creating a framework for the development of new use cases over time. Therefore, it is a top priority to define a minimum data model with a core set of elements for DMPs. The model should incorporate existing standards and avoid inventing something new; it could potentially be based on a template structure and/or use the DMPRoadmap themes. Additional requirements in this area include that it:

  • Must make use of existing vocabularies and ontologies whenever possible
  • Must employ common exchange protocols (e.g., json)
  • Must be open to support new data types, models, and descriptions
  • Should be available in a format that can be rendered for human use
  • Should accommodate versioning to support actively updated DMPs

At the RDA 9th Plenary meeting in Barcelona during the Active DMPs IG session (6 April, 9:30-11:00) we propose establishing a working group to develop standards for DMPs. This isn’t our particular area of expertise so once again we’re relying on all of you to help steer the DMP ship. We hope that additional working groups might spin out from the session and invite your ideas and contributions (e.g., publishing DMPs).

…and beyond
The DCC and UC3 will continue to pursue international collaborations related to DMPRoadmap through pilot projects. As part of an iterative process for developing, implementing, testing, and refining these use cases we’re beginning to model domain-specific and institutional pilot projects to determine what information can realistically move between stakeholders, systems, and research workflows. We have some existing funds to support a subset of this work and are actively seeking additional sources of funding to carry the project forward. In addition to technical solutions, these projects will expand our capacity to connect with key stakeholders, with particular emphasis on addressing the needs and practices of researchers and funders. Stay tuned for more details in the coming weeks and months.

You can also track our progress and find oodles of documentation on the DMPRoadmap GitHub wiki.

DMPTool and RDM consultants support humanities grant submission

The following is a guest post by Quinn Dombrowski of the UC Berkeley RDM Program. The original is available at http://researchdata.berkeley.edu/stories

sarcophagus photo

When preparing a proposal to a funding agency, researchers focus on the grant narrative, framing their work in the most innovative and compelling way possible. Crafting a narrative that can stand as a surrogate for a scholar’s research for reviewers to evaluate is itself a time-consuming process; for the National Endowment for the Humanities (NEH) Digital Humanities grants, it’s only one of nine components of the application. Grant proposals must include a data management plan, a document that Assistant Professor of Near Eastern Studies Rita Lucarelli had not encountered prior to preparing her grant submission last fall. “I found the instructions to be clear, but I hadn’t thought about those issues before,” Professor Lucarelli said in a recent Research Data Management (RDM) workshop on DMPTool for the humanities.

The short version of the NEH guidelines states:

Prepare a data management plan for your project (not to exceed two pages). The members of your project team should consult this document throughout the life of the project and beyond the grant period. The plan should describe how your project team will manage and disseminate data generated or collected by the project. For example, projects in this category may generate data such as software code, algorithms, digital tools, reports, articles, research notes, or websites.

In addition, proposals of the type Professor Lucarelli was submitting require a sustainability plan. Following the basic prompts provided by the NEH, Professor Lucarelli drafted a brief paragraph for the data management plan and the sustainability plan, and sent the materials to the RDM team for review.

Starting early proved to be key. By having a draft done two months in advance, Lucarelli was able to send her proposal to the NEH for feedback, where she learned that her proposal — to fund a workshop, and development of a portal that would bring together a number of Egyptology projects that are building 3D models — would be eligible for a “level 2” grant, but not a “level 3” grant as Professor Lucarelli originally drafted: “level 3” grants are intended for projects that already had a finished prototype. “It’s important to figure out what level grant you’re applying to early,” Lucarelli reflected. “Deciding on that sooner would have saved me from drafting the sustainability plan that wasn’t applicable to the grant I ended up applying for.”

Involving the RDM team in the process early also allowed Lucarelli to work with an RDM consultant to refine her data management plan. Rick Jaffe, an RDM consultant, met with Lucarelli and talked through the scope and nature of the project she was proposing. After their first meeting, Jaffe logged into DMPTool, the Data Management Planning tool developed and supported by the California Digital Library (CDL), which provides templates and additional guidance for preparing data management plans for most major funding agencies. He pulled up the template for the NEH, and began to organize and expand upon his notes from the meeting, using the headers and prompts suggested by the DMPTool. Jaffe used the DMPTool’s private sharing function to make the draft data management plan visible and editable by Lucarelli and her collaborator at the University of Memphis, Joshua Roberson.

Drafting a data management plan in the DMPTool interface is convenient because it juxtaposes the questions and guidance for each section with a text box where you can write your responses. At a certain point in the process, it may be easier to download your draft data management plan and move it into Microsoft Word for editing. While it may be tempting to answer each of the questions in the prompt at great length, the overall two-page limitation forces grant applicants to be brief and specific. Quinn Dombrowski, another RDM consultant, worked with Lucarelli on winnowing the six-page version drafted in DMPTool into the required two pages.

“Even if I don’t get this grant, it was hugely valuable to prepare a data management plan,” explained Lucarelli. “When you’re working a new project, you never think about things like what will happen if you’re not involved with the project anymore — it’s hard to even imagine that! But a data management plan makes you think through all the details about what data you’ll actually get in your project, how you’ll store it, and how you’ll manage it in the long term. I was lucky to be working with a collaborator who knew some of the technical details about how to store audio files, because I would have been at a loss, myself. And it was very helpful to be able to sit down with RDM consultants who can help you think through all the issues involved in running a project like this. I feel much better prepared now for the next time I put together a grant application, whether or not a data management plan is required.”

Hang A DMPTool Poster!

In addition to working hard on the new version of the DMPTool (to be released in May), we are also working on outreach and education materials that promote the use of the DMPTool. Our latest addition to these materials is a generic poster about the DMPTool, including information about what’s to come in the new version. You can download a PDF version, or a PPTX version that you can customize for your institution. We plan on updating this poster when the new version of the DMPTool is released, so keep an eye out!

“DMPTool: Expert Resources & Support for Data Management Planning”. 30″x38″ poster

Slide1
Posters available as:

  • PDF (cannot be customized)
  • PPTX (can be customized)

DMPTool adds 100th institution!

From Flickr by Anvica

From Flickr by Anvica

We are pleased to announce that as of September 23rd, with the addition of Baylor University100 institutions have taken the step of customizing the DMPTool to provide local guidance and resources for their researchers. Check out the full list of participating institutions.

While institutions do not have to customize the DMPTool for their researchers to take advantage of the tool, taking that step can provide many benefits to their researchers as well as their data management and stewardship programs. These include:

  • Integration with Shibboleth so that researchers can use their institutional credentials;
  • Ability to add help text and links to institutional resources;
  • Ability to add contact information for the units that support data management; and
  • Ability to add text that can be copied into a data management plan.

With the release of the new and improved DMPTool in early 2014, there will be even more functionality for those institutions who integrate with and customize the tool. These features include:

  • An interface to manage all of the customizations directly;
  • Improved institutional branding;
  • The ability to add institution specific data management requirements;
  • Reviews of DMPs on a case by case basis or as a required step for all researchers; and
  • Multiple roles for administrative users including as editors of requirements and reviewers of DMPs, so that you can have appropriate teams working on the DMPTool.

If you are interested in talking to us more about this process, please contact us. It is a straightforward process that we are happy to walk you through.

Report on DMPTool at ESA 2013

Last week in Minneapolis, about 4,000 ecologists got together to geek out and enjoy the midwest for eight days. The DMPTool had a couple of appearances in the course of this 2013 Ecological Society of America Meeting– a workshop on managing ecological data and a session on data management planning and the DMPTool. It was also mentioned in numerous presentations about the DataONE Investigator Toolkit, of which the DMPTool is a part.

Here I want to briefly mention the special session on the DMPTool, which occurred on the first official day of #ESA2013. The session was 75 minutes long, and Bill Michener of DataONE and I were sharing the podium. He planned to introduce DMPs generally, followed by my explanation and demonstration of the DMPTool, including the new version of the tool due out in Winter 2013-2014.

Fifteen minutes before the presentations were due to start, the room was packed. Attendees were sitting on the floor in the aisles by 5 minutes before, and there was a nonstop trickle of attendees entering throughout the 75 minute session. The catch? We had no power.

That’s right: Bill and I were forced to talk about data management plans, demo the DMPTool, and discuss future plans to a packed room, all without a microphone, a projector, or even a chalk board. We managed to get through the session, and folks seemed to appreciate the impromptu soliloquies on data management by myself and Bill. The power came on about 5 minutes before the close of the session (of course), at which point I scurried behind the podium to show screen shots of the DMPTool. By the time I looked up from my laptop, about 60% of the audience had left. Apparently slides were not the big draw for our session.

My takeaway lessons? When giving a talk, be prepared for anything; people enjoy the element of surprise and improvisation; and researchers are dying to learn about DMPs, regardless of the potential hurdles put before them. This is most likely due to funder requirements for DMPs, but I’d like to think it also relates to my and Bill’s dulcet tones.

The slides I didn’t get to show are available on slideshare.

Screen-Shot-2013-08-12-at-8.37.16-AM

Library Outreach Update

Photo from Flickr by Glyn Lowe

Since our Kickoff Meeting at Berkeley, we’ve hit the ground running to begin working on building a foundation for our IMLS funded library outreach project. Internally we’ve been fine tuning the original IMLS meeting report,  converting it into a calendar of tasks and laying to groundwork for the success of our later objectives. We’ve already begun planning and researching for a series of educational webinars and putting together a wiki to house online resources on data management planning. These resources will stand alone as useful tools, but are also vital first steps toward accomplishing some of our larger goals.

Over the next few weeks our priority is going to be developing educational materials – continuing to assemble research for the coming webinars, putting together outreach materials and talking points, and adding to our list of useful outside resources. Many of these projects will be ongoing, with some major updates coming further down the pipeline to better assist librarians undertaking data management responsibilities.

We’re also going to be gunning for feedback as these projects develop and are finalized. If you’re interested in being involved, please leave a comment or send me an email.

Library Outreach: Call for DMPTool Guides

Hello, everyone! My name is Dan Phipps. I’m coming to the DMPTool project from UCLA’s Graduate School of Education and Information Studies. My academic focus has been on informatics, especially the preservation and curation of disaster data. Aside from digitizing maps for the UCLA Digital Libraries project, I’ve also worked at the UCLA Social Science Data Archive to help researchers better archive their data.

I’m working with California Digital Libraries as part of the IMLS funded Librarian Outreach project. Our focus is going to be specific to librarians and the role they play in the development of data management plans. While this is a relatively new hat for some librarians to wear, there is already a lot of resources from data archivists, repository institutions, grant departments and other librarians. We’re hoping to use the DMPTool as both a data management resource as well as a hub for information specialists to find useful materials.

The management of data is a major undertaking for any institution, and involves support everywhere from IT departments to individual researchers to granting offices and beyond. Librarians, by training, are uniquely suited to work within this environment – it is a field that has been focused on providing people with knowledge and support for centuries. Data management and preservation is a relatively new area of focus, but one which will be more and more important in the coming years.

One of the major goals of the Libraries Outreach project is to provide librarians with easy access to educational materials. Over the next few weeks we’ll be highlighting Libguides, wikis, webpages, and other useful online resources that have made using or teaching the DMPTool easier. If there are any references you find particularly useful, please email me your suggestions.

-Dan Phipps