→ Slides doi:10.7490/f1000research.1112586.1
Nancy Ide, Keith Suderman, James Pustejovsky, Marc Verhagen, Eric Nyberg, Chris Cieri Abstract
The NSF/SI2-funded Language Applications (LAPPS) Grid project (http://www.lappsgrid.org) is a collaborative effort among Brandeis University, Vassar College, Carnegie-Mellon University (CMU), and the Linguistic Data Consortium (LDC) at the University of Pennsylvania, which has developed an open, web service-based infrastructure through which massive and distributed language resources can be accessed, and tailored language services can be composed, evaluated, disseminated and consumed by researchers, developers, and students.
We recently adopted Galaxy as the primary workflow management system for the LAPPS Grid. We have worked with the Galaxy development team to adapt the system to our domain and continue this collaboration to enhance the capabilities we require and contribute to the expansion of Galaxy to domains outside the life sciences.
We have contributed a “Galaxy Flavor" including all LAPPS Grid services and resources, and have developed or are developing the following capabilities for use in Galaxy : (1) exploitation of our web service metadata to automatically detect input/output requirements and invoke converters where necessary; (2) incorporation of authentication procedures for protected data using OAuth; and (3) addition of a visualization plugin for linguistic analyses.
An additional outcome of the LAPPS/Galaxy collaboration is that it provides researchers in the life sciences with access to a wide array of NLP tools. So, for example, biologists will be able to take advantage of bio-oriented NLP web services to mine bio-entities and relations from textual sources such as PubMed, and via capabilities already present in Galaxy, integrate them into existing bio-data resources and analysis tools.