An analysis of characteristics and structures embedded in data papers: a preliminary study

Ya-Ning Chen



Research data or datasets can be regarded as a catalyst to inspire new research by repurposing or combining existing research data, and grant applicants have been requested by funding institutions to include a data management plan as part of research project proposals. In addition to the metadata approach, data papers may mirror the scientific publication model as an alternative means of description and management of research data. However, there is not a common standard for all data papers across various communities. This study aimed to build up a common structural framework to investigate the embedded characteristics and structures of the content of data papers by using a content analysis approach, and 26 data journals from 16 publishers were selected as subject in this study. This study has proposed a common framework and further embodied a concept map (Candela et al. 2015) into more concrete components for the structure of data papers.


data papers; research data; datasets; research data management

Full Text:



Akers, Katherine. 2012. "Data journals: Incentivizing research data dissemination." CLIR Blog, 12 December. Accessed November 19, 2015.

Akers, Katherine. 2014. "A growing list of data journals." Data@MLibrary Blog, 9 May. Accessed January 7, 2015.

Atici, Levent, Sarah Whitcher Kansa, Justin Lev-Tov, and Eric C. Kansa. 2013. "Other people's data: A demonstration of the imperative of publishing primary data." Journal of Archaeological Method and Theory 20, 4: 663-681.

Borgman, Christine L., Jillian C. Wallis, Matthew S. Mayernik, and Alberto Pepe. 2007. "Drowning in data: Digital library architecture to support scientific use of embedded sensor networks." In Proceedings of the 7th ACM/IEEE-CS joint conference on Digital libraries, 269-277. New York, NY: ACM.

Breure, Leen. 2014. "Enhanced data journal: Next generation science." E-data %26 research, Special Issue. Accessed November 19, 2015.

Callaghan, Sarah, Jonathan Tedds, Rebecca Lawrence, Fiona Murphy, Timothy Roberts, and Will Wilcox. 2014. "Cross-linking between journal publications and data repositories: A selection of examples." International Journal of Digital Curation 9, 1: 164-75.

Callaghan, Sarah, Steve Donegan, Sam Pepler, Mark Thorley, Nathan Cunningham, Peter Kirsch, Linda Ault, Patrick Bell, Rod Bowie, Adam Leadbetter, Gwen Moncoiffé, Kate Harrison, Ben Smith-Haddon, Anita Weatherby, and D. Wright. 2012. "Making data a first class scientific output: Data citation and publication by NERCs Environmental Data Centres." International Journal of Digital Curation 7, 1: 107-13. doi:10.2218/ijdc.v7i1.218

Candela, Leonard, Donatella Castelli, Paolo Manghi, and Alice Tani. 2015. "Data journals: a survey." Journal of the Association for Information Science 66, 9:1747–62.

Chao, Tiffany C. 2015. "Mapping methods metadata for research data." International Journal of Digital Curation 10, 1: 82-94.

Chavan, Vishwas, and Lyubomir Penev. 2011. "The data paper: A mechanism to incentivize data publishing in biodiversity science." BMC Informatics 12, S15: S2. doi: 10.1186/1471-2105-12-S15-S2

Costello, Mark J. 2009. "Motivating online publication of data." BioScience 59, 5: 418-427. doi: 10.1525/bio.2009.59.5.9

De Schutter, Erik, Giorgio A. Ascoli, and David N. Kennedy. 2009. "Review of papers describing neuroinformatics software." Neuroinformatics 7, 4: 211-212. doi: 10.1007/s12021-009-9058-x

Gorgolewski, Krzysztof J., Daniel S. Margulies, and Michael P. Milham. 2013. "Making data sharing count: A publication-based solution." Frontiers in Neuroscience 7, Article 9. doi:

Gray, Stephen. 2015. "Case study: Publishing a data paper." Accessed December 28, 2015.

Kansa, Eric C., and Sarah Whitcher Kansa. 2013. "We all know that a 14 is a sheep: data publication and professionalism in archaeological communication." Journal of Eastern Mediterranean Archaeology and Heritage Studies 1, no. 1(2013): 1-14. Accessed November 19, 2015. doi:

Kennedy, David N., Giorgio A. Ascoli, and Erik De Schutter. 2011. "Next steps in data publishing." Neuroinform 9, 4: 317-320.

Kratz, John, and Strasser Carly. 2014. "Data publication consensus and controversies." Version 3, F1000 Research 3:94: 1-21. doi: 10.12688/f1000research.3979.3

Niu, Jinfang, and Margaret Hedstrom. 2008. "Documentation evaluation model for social science data." Proceedings of the American Society for Information Science and Technology 45, 1: 1-11. doi: 10.1002/meet.2008.1450450223

Penev, Lyubomir, Daniel Mietchen, Vishwas Chavan, Gregor Hagedorn, David Remsen, Vincent Smith, and David Shotton. 2015. "Pensoft data publishing policies and guidelines for biodiversity data." Accessed October 06, 2015. http://

Rees, Jonathan. 2010. "Recommendations for independent scholarly publication of data sets." Accessed October 06, 2015.

Smith, Vincent S. 2009. "Data publication: Towards a database of everything." BMC Research Notes 2, 113: 1-3. doi: 10.1186/1756-0500-2-113

Strasser, Carly. 2015. "Research data management." Accessed December 25, 2015.


Article Metrics

Metrics Loading ...

Metrics powered by PLOS ALM


  • There are currently no refbacks.

Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 International License.

Libellarium (Online). ISSN 1846-9213 © 2008


Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 International License.