14.3. SaaS and Science
The term
science gateways
has become increasingly often used to denote
a system that provides online access to scientific software [
261
]. In general, a
science gateway is a (typically web) portal that allows users to configure and invoke
scientific applications, often on supercomputers, providing a convenient gateway
to otherwise hard-to-access computers and software.
The impact of such systems on science has been considerable. For example, the
MG-RAST metagenomics analysis service
metagenomics.anl.gov
, which provides
online access to services for the analysis of genetic material in environmental
samples [
199
], has more than 22,000 registered users as of 2017, who have collec-
tively uploaded for analysis some 280,00 0 metagenomes containing more than 10
14
base pairs. That is a tremendous amount of science being supported by a single
service! Other s uccess ful systems, such as CIPRES [
201
], which provides access to
phylogenetic reconstruction software; CyberGIS [
185
], for collaborative geospatial
problem solving; and nanoHUB [
172
], which provides access to hundreds of com-
putational simulation codes in nanotechnology, also have thousands of users a nd
correspondingly large impacts on both science and education. A recent survey [
176
]
provides further insights into how and where science gateways are used.
While it is hard to generalize across such a broad spectrum of activities, we can
state that the typical science software service has some but not all of the properties
of SaaS as commonly und erstood. First, from a technology perspective: Most such
services commonly make a single version of a science application available to many
people, and many leverage modern web interface technologies to provide intuitive
interactive interfaces. Some also provide REST APIs and even SDKs to permit
programmatic access. On the other hand, many are less than fully elastic, due to
a need to run on specialized and typically overloaded supercomputers, and few are
architected to leverage the power of modern cloud platforms. Thus, they handle
modest numbers of users well, but may not scale.
From a business model perspective, few science software systems implement
pay-by-use or subscription-based payment schemes. Instead, they typically rely
on research grant support and/or allocations of compute and storage resources on
scientific computing centers. This lack of a business model can be a subject of
concern, because it raises a question about their long-term sustainability (what
happens when grants end?) and also hinders scaling (an allocation of supercomputer
time may be enough to support 10 concurrent users, but what happens when
demand increases to 1000 concurrent users? 10,000?).
We next use two example systems that have each taken a diff erent approach
to science SaaS from both technology and business model perspectives: Globus
Genomics and the Globus service.
302