Some directorates offer more specific guidelines. Engineering (ENG), for example, includes “analyzed” data in that directorate’s policy, meaning those data that are published in articles, dissertations, or supplementary materials. Note that figures within a publication aren’t sufficient – tables of the numbers used to create figures should be made available. In the guidance we’ve seen so far, sharing raw data is not typically required. Where no specific guidance is available, we recommend researchers keep in mind two things when deciding which data to share.
Some directorates offer more specific guidelines. Those we’ve reviewed typically require researchers to share data within 2-3 years of collection, the end of the award, or publication, so those are some benchmarks to consider if your directorate or the solicitation does not offer more specific guidelines.
Some directorates offer more specific guidelines (ENG specifies data should be kept at least for three years). If you are depositing your data in a data center or archive with a long-term commitment to providing access to the data, then you should simply state this in your plan. If you plan to host the data yourself or pay a service provider to host it for you, then you should specify a time period that is reasonable and that your budget can sustain, and explain that in your data management plan.
The NSF does not maintain a general purpose data repository, although some directorates and programs recommend certain repositories (see for example the data policy of the Division of Ocean Sciences). If no recommendations are provided, some of your options include disciplinary repositories, Cornell’s institutional repository (eCommons, for smaller data sets), the CISER data archive, publishers (for data related to publications), or a custom solution for your project. Please contact email@example.com for specific recommendations
That depends on what you mean. Data in papers in tables and figures is not really considered to be an adequate means of sharing. If, on the other hand, you already publish in journals which require data sets related to papers to be deposited in a data center and manuscripts to include citations or accession numbers for those data sets, that would be a reasonable data sharing plan for those data sets. More and more journals are beginning to accept data sets as supplementary materials and/or to require that authors make their data available.
NSF does allow for costs associated with data management (typically line G2, with an explanation in the budget justification). If you are depositing your data in a data center or archive, then your data will probably be available for the long term. Most data centers or repositories either accept data free of charge (if it is within their collection scope) or charge a one-time fee at the time of deposit, making budgeting fairly straightforward. Currently, Cornell doesn’t offer any services which allow up-front payment for longer term storage, although the RDMSG is aware of the need for such a service and is considering different options.
No, although some other agencies and institutions do provide some (see links below), and some data repositories or centers provide language for data management plans specifying the use of that repository for data access and preservation. We’re less inclined to provide sample plans until we see what kinds of plans are reviewed favorably. If your data management plan is receives favorable reviews and you are willing to share it with colleagues, please let us know. Sample data management plans: