<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Moving Forward with Ecological Informatics and Reproducibility</title>
	<atom:link href="http://www.esa.org/esablog/research/moving-forward-with-ecological-informatics-and-reproducibility/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.esa.org/esablog/research/moving-forward-with-ecological-informatics-and-reproducibility/</link>
	<description>Ecological science, news, and policy from the Ecological Society of America</description>
	<lastBuildDate>Wed, 19 Jun 2013 14:42:43 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.5.1</generator>
	<item>
		<title>By: Henry Walker</title>
		<link>http://www.esa.org/esablog/research/moving-forward-with-ecological-informatics-and-reproducibility/comment-page-1/#comment-10207</link>
		<dc:creator>Henry Walker</dc:creator>
		<pubDate>Tue, 10 Jul 2007 18:17:48 +0000</pubDate>
		<guid isPermaLink="false">http://www.esa.org/esablog/?p=62#comment-10207</guid>
		<description><![CDATA[Dr Hollister:   With a bit more participation this could become a very interesting Blog.   I&#039;m sure there are interesting examples of Ecological Informatics and Reproducible Research out there.

In addition to KNB, EML, Kepler and Analytical Webs, there is the basic business of referencing and sharing data.  

I&#039;ve recently come accoss &quot;Dataverse Network&quot;  G. King 2007. who points out the possibility of citing the data used in a paper using a Universal Numeric Fingerprint (UNF). The R package  UNF by Micah Altman computes a UNF based on the data. People can then search for the UNF if they want to obtain the identical data set used in a paper. even if it moves to a different URL.   After downloading a dataset, one can regenerate the UNF to be sure it is the same data used in a paper.   

Recent open source R packages also help make it possible to share data, script, and even cashed R computations: Eckel and Peng (2006), Peng (2007), Peng and Eckel (2007)   

Approaches involving UNFs for data, the use of open source script, and cashed computations for the analysis, and reproduction of published figures and tables will  facilitate : 
(1) more critical peer review of research results, 
(2) technology transfer of computation methods that others can adapt, and 
(3) a shift from advertisment or research results and advocacy arguments over alternative interpretations data, to quantitative weight-of-evidence approaches based on Information Theory , Burnham and Anderson (2002). 

We can make more credible advances in ecology and ecoinformatincs, based on (1) more effective data sharing, and (2) the adoption of reproducible research approaches. 

References: 

Gary King 2007  Dataverse Network. http://gking.harvard.edu/talks/dvn-nsfP.pdf  &amp; @ http://thedata.org/index.html    

Sandrah P. Eckel &amp; Roger Peng 2006
INTERACTING WITH LOCAL AND REMOTE DATA RESPOSITORIES USING THE stashR PACKAGE.  John Hopkins Working Paper 127. http://www.bepress.com/cgi/viewcontent.cgi?article=1127&amp;context=jhubiostat

Roger Peng 2007. A REPRODUCIBLE RESEARCH TOOLKIT
FOR R.  John Hopkins Working Paper 142. A REPRODUCIBLE RESEARCH TOOLKIT FOR R

Roger Peng and Sandrah P. Eckel 2007. DISTRIBUTED REPRODUCIBLE  RESEARCH USING CACHED
COMPUTATIONS John Hopkins Working Paper  147. http://www.bepress.com/cgi/viewcontent.cgi?article=1148&amp;context=jhubiostat

Burnham and Anderson (2002)  Model Selection and Multi-Model Inference. Springer. 

-------------------------------------------------

Henry A. Walker, PhD
EPA ORD NHEERL Atlantic Ecology Division
Narragansett, R.I.   02882]]></description>
		<content:encoded><![CDATA[<p>Dr Hollister:   With a bit more participation this could become a very interesting Blog.   I&#8217;m sure there are interesting examples of Ecological Informatics and Reproducible Research out there.</p>
<p>In addition to KNB, EML, Kepler and Analytical Webs, there is the basic business of referencing and sharing data.  </p>
<p>I&#8217;ve recently come accoss &#8220;Dataverse Network&#8221;  G. King 2007. who points out the possibility of citing the data used in a paper using a Universal Numeric Fingerprint (UNF). The R package  UNF by Micah Altman computes a UNF based on the data. People can then search for the UNF if they want to obtain the identical data set used in a paper. even if it moves to a different URL.   After downloading a dataset, one can regenerate the UNF to be sure it is the same data used in a paper.   </p>
<p>Recent open source R packages also help make it possible to share data, script, and even cashed R computations: Eckel and Peng (2006), Peng (2007), Peng and Eckel (2007)   </p>
<p>Approaches involving UNFs for data, the use of open source script, and cashed computations for the analysis, and reproduction of published figures and tables will  facilitate :<br />
(1) more critical peer review of research results,<br />
(2) technology transfer of computation methods that others can adapt, and<br />
(3) a shift from advertisment or research results and advocacy arguments over alternative interpretations data, to quantitative weight-of-evidence approaches based on Information Theory , Burnham and Anderson (2002). </p>
<p>We can make more credible advances in ecology and ecoinformatincs, based on (1) more effective data sharing, and (2) the adoption of reproducible research approaches. </p>
<p>References: </p>
<p>Gary King 2007  Dataverse Network. <a href="http://gking.harvard.edu/talks/dvn-nsfP.pdf" rel="nofollow">http://gking.harvard.edu/talks/dvn-nsfP.pdf</a>  &amp; @ <a href="http://thedata.org/index.html" rel="nofollow">http://thedata.org/index.html</a>    </p>
<p>Sandrah P. Eckel &amp; Roger Peng 2006<br />
INTERACTING WITH LOCAL AND REMOTE DATA RESPOSITORIES USING THE stashR PACKAGE.  John Hopkins Working Paper 127. <a href="http://www.bepress.com/cgi/viewcontent.cgi?article=1127&#038;context=jhubiostat" rel="nofollow">http://www.bepress.com/cgi/viewcontent.cgi?article=1127&#038;context=jhubiostat</a></p>
<p>Roger Peng 2007. A REPRODUCIBLE RESEARCH TOOLKIT<br />
FOR R.  John Hopkins Working Paper 142. A REPRODUCIBLE RESEARCH TOOLKIT FOR R</p>
<p>Roger Peng and Sandrah P. Eckel 2007. DISTRIBUTED REPRODUCIBLE  RESEARCH USING CACHED<br />
COMPUTATIONS John Hopkins Working Paper  147. <a href="http://www.bepress.com/cgi/viewcontent.cgi?article=1148&#038;context=jhubiostat" rel="nofollow">http://www.bepress.com/cgi/viewcontent.cgi?article=1148&#038;context=jhubiostat</a></p>
<p>Burnham and Anderson (2002)  Model Selection and Multi-Model Inference. Springer. </p>
<p>&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;-</p>
<p>Henry A. Walker, PhD<br />
EPA ORD NHEERL Atlantic Ecology Division<br />
Narragansett, R.I.   02882</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Henry Walker</title>
		<link>http://www.esa.org/esablog/research/moving-forward-with-ecological-informatics-and-reproducibility/comment-page-1/#comment-10019</link>
		<dc:creator>Henry Walker</dc:creator>
		<pubDate>Tue, 03 Jul 2007 14:00:08 +0000</pubDate>
		<guid isPermaLink="false">http://www.esa.org/esablog/?p=62#comment-10019</guid>
		<description><![CDATA[Scientists in diverse fields of study are increasing recognizing the value of transparency, reproducibility.  By making data available and computational methods transparent and reproducible, one can anticipate more rapid research advancements. It makes it possible for peer reviewers to examine the data and computations in more detail during the review process, reducing the likelihood of publication of fraudulent results, a journal&#039;s worst nightmare (Laine et al, 2007).   The approach is gaining traction in the biomedical field.  In addition to researchers learning the practice, for the approach to succeed more broadly major journals will need to embrace it.

Henry A. Walker  U.S. EPA, National Health and Ecological Effects Research Laboratory, Atlantic Ecology Division, Narragansett, R.I. 

Christine Laine, et al 2007. Reproducible Research: Moving toward Research the Public Can Really Trust.   Annals of Internal Medicine.  Volume 146 Issue 6 &#124; Pages 450-453 http://www.annals.org/cgi/content/full/146/6/450.]]></description>
		<content:encoded><![CDATA[<p>Scientists in diverse fields of study are increasing recognizing the value of transparency, reproducibility.  By making data available and computational methods transparent and reproducible, one can anticipate more rapid research advancements. It makes it possible for peer reviewers to examine the data and computations in more detail during the review process, reducing the likelihood of publication of fraudulent results, a journal&#8217;s worst nightmare (Laine et al, 2007).   The approach is gaining traction in the biomedical field.  In addition to researchers learning the practice, for the approach to succeed more broadly major journals will need to embrace it.</p>
<p>Henry A. Walker  U.S. EPA, National Health and Ecological Effects Research Laboratory, Atlantic Ecology Division, Narragansett, R.I. </p>
<p>Christine Laine, et al 2007. Reproducible Research: Moving toward Research the Public Can Really Trust.   Annals of Internal Medicine.  Volume 146 Issue 6 | Pages 450-453 <a href="http://www.annals.org/cgi/content/full/146/6/450" rel="nofollow">http://www.annals.org/cgi/content/full/146/6/450</a>.</p>
]]></content:encoded>
	</item>
</channel>
</rss>
