Open source mark-recapture data management platform

questions concerning analysis/theory using program MARK

Open source mark-recapture data management platform

Postby holmbergius » Thu Feb 25, 2010 2:56 pm

Hello! I'm the programmer for the ECOCEAN Whale Shark Photo-identification Library (http://www.whaleshark.org). Our library is a Web-based, collaborative mark-recapture platform for the study of a single species (Rhincodon typus). We have been developing our web-based mark-recapture software since 2003, and along the way we have received a number of requests to apply it to other species. In an effort to allow others to use it, I am working to open source it.

I am posting to ask: would a standard mark-recapture data management platform be of use to the Phi-Dot community?

In a nutshell, our software is a Java web application that can run on a laptop or on a web server. You can use it for very small, single researcher projects, or you can deploy it to a web server and use its built-in security system to coordinate a small or large collaborating team. For whaleshark.org, we rely on a globally distributed network of researchers to contribute data and to process it using a simple set of guidelines posted on a wiki (http://www.whaleshark.org/wiki/doku.php). Location-based security ensures that data is globally visible to all researchers but that permissions to modify data are limited to only relevant members (e.g. Seychelles members cannot edit Maldives data without obtaining higher privileges). Because the software can be deployed to the web, it also allows for public participation, allowing us to collect a lot more data and work with more intricate models.

We built our Java-based software on top of DataNucleus (http://www.datanucleus.org/), which is an object-relational mapping layer. This allows us to use many different types of data sources, from small Excel files, to relational databases, all the way up to Amazon S3. This potentially means that existing mark-recapture databases could be *relatively* easily mapped into and managed by our software. A complete list of types is available from
http://www.datanucleus.org/products/acc ... tores.html. In addition, for relational databases our basic data model is based on the Darwin Core (http://rs.tdwg.org/dwc/terms/), allowing us to easily use TapirLink (http://wiki.tdwg.org/twiki/bin/view/TAPIR/TapirLink) to expose mark-recapture data to larger biodiversity frameworks, such as the GBIF(http://www.gbif.org) and OBIS(http://www.iobis.org/).

A list of available functionality (in a whale shark context) is available here: http://www.ecocean.org/forum/index.php/topic,309.0.html

But as a summary, the platform allows you to:

-Quickly start a new mark-recapture project with existing, easy-to-install software. You can also tailor it (Java programming required) to meet your specific project's needs.
-Access different mark-recapture datastore types (e.g. relational databases, Excel files, etc.) through a common Java API. This could allow for pluggable functionality, such as allowing the community to create and share plug-ins to support data export for specific models for use in MARK, or for customized data mining.
-Manage mark-recapture data through a Web interface and safely collaborate by relying on authentication, authorization, and auditing to control and monitor access.
-Allow the public to collect and contribute data through a web interface.
-Add computer-assisted identification techniques as they become available. In a whale shark context, we use two pattern recognition algorithms to sift through a global database and suggest matches for new data to previously identified whale sharks.

Back to my question: would an open source, Java-based, and generic mark-recapture platform be useful as a community maintained project moving forward? I welcome your feedback. If you would specifically like to try the software when it's ready, please contact me off-list at jason at whaleshark dot org.

Thanks,
Jason Holmberg
ECOCEAN Whale Shark Photo-identification Library
http://www.whaleshark.org
holmbergius
 
Posts: 5
Joined: Tue Sep 26, 2006 2:08 pm
Location: Portland, OR

Re: Open source mark-recapture data management platform

Postby jlaake » Thu Feb 25, 2010 9:32 pm

I think it might be a useful tool for those doing photo-id but will it also work with tags etc? I encourage you to make it available. What may determine how much it is used is how easily users can learn the system and set it up for their own problem. regards--jeff
jlaake
 
Posts: 1480
Joined: Fri May 12, 2006 12:50 pm
Location: Escondido, CA

Re: Open source mark-recapture data management platform

Postby holmbergius » Thu Feb 25, 2010 11:44 pm

It will definitely work with tags. It has good photo-management, but they're not a requirement to assign identity or to record a capture.

I completely agree that usability is key. As I refine the software, this is the basic installation procedure I have as a goal, focusing on a simple laptop deployment.

1. Download the Tomcat web server and install it on your laptop/server.
2. Download three WAR files from the open source repository and add them to the webapps directory of Tomcat.
3. Start Tomcat.
4. Open a web browser to http://localhost:8080/cmr

The web-based platform runs from there in its generic form (i.e. no customized graphics, etc.).
holmbergius
 
Posts: 5
Joined: Tue Sep 26, 2006 2:08 pm
Location: Portland, OR

Re: Open source mark-recapture data management platform

Postby Eurycea » Fri Feb 26, 2010 5:36 pm

I could see potential use for this in my application. Right now all our data is managed through an Orcale database and is fairly streamlined. In addition to the usual records, we also have a large db of photographs which we use to double check our field calls. I would definitely be interested in trying out this type of application- unfortunately I have absolutely no programming skills so I would be of zero use as far as helping along the open source project.

I'm definitely interested in anything related to your photo matching algorithms. I'm matching spot patterns on salamander heads, and doing this by eye, even for double checking marks, is extremely time consuming. Cheers,

Nate
Eurycea
 
Posts: 103
Joined: Thu Feb 25, 2010 11:21 am


Return to analysis help

Who is online

Users browsing this forum: No registered users and 2 guests

cron