Records
Links
Author
Huang, T. ; Armstrong, E.M. ; Bourassa, M.A. ; Cram, T.A. ; Elya, J. ; Greguska, F. ; Jacob, J.C. ; Ji, Z. ; Jiang, Y. ; Li, Y. ; Quach, N.T. ; McGibbney, L.J. ; Smith, S.R. ; Wilson, B.D. ; Worley S.J. ; Yang, C.
Title
An Integrated Data Analytics Platform
Type
$loc['typeJournal Article']
Year
2019
Publication
Marine Science
Abbreviated Journal
Mar. Sci.
Volume
6
Issue
Pages
Keywords
big data, Cloud computing, Ocean science, data analysis, Matchup, anomaly detection, open source
Abstract
An Integrated Science Data Analytics Platform is an environment that enables the confluence of resources for scientific investigation. It harmonizes data, tools and computational resources to enable the research community to focus on the investigation rather than spending time on security, data preparation, management, etc. OceanWorks is a NASA technology integration project to establish a cloud-based Integrated Ocean Science Data Analytics Platform for big ocean science at NASA�s Physical Oceanography Distributed Active Archive Center (PO.DAAC) for big ocean science. It focuses on advancement and maturity by bringing together several NASA open-source, big data projects for parallel analytics, anomaly detection, in situ to satellite data matchup, quality-screened data subsetting, search relevancy, and data discovery. Our communities are relying on data available through distributed data centers to conduct their research. In typical investigations, scientists would (1) search for data, (2) evaluate the relevance of that data, (3) download it, and (4) then apply algorithms to identify trends, anomalies, or other attributes of the data. Such a workflow cannot scale if the research involves a massive amount of data or multi-variate measurements. With the upcoming NASA Surface Water and Ocean Topography (SWOT) mission expected to produce over 20PB of observational data during its 3-year nominal mission, the volume of data will challenge all existing Earth Science data archival, distribution and analysis paradigms. This paper discusses how OceanWorks enhances the analysis of physical ocean data where the computation is done on an elastic cloud platform next to the archive to deliver fast, web-accessible services for working with oceanographic measurements.
Address
Corporate Author
Thesis
Publisher
Place of Publication
Editor
Language
Summary Language
Original Title
Series Editor
Series Title
Abbreviated Series Title
Series Volume
Series Issue
Edition
ISSN
ISBN
Medium
Area
Expedition
Conference
Funding
Approved
$loc['no']
Call Number
COAPS @ user @
Serial
1038
Permanent link to this record
Author
Armstrong, E.M. ; Bourassa, M.A. ; Cram, T.A. ; DeBellis, M. ; Elya, J. ; Greguska III, F.R. ; Huang, T. ; Jacob, J.C. ; Ji, Z. ; Jiang, Y. ; Li, Y. ; Quach, N. ; McGibbney, L. ; Smith, S. ; Tsontos, V.M. ; Wilson, B. ; Worley, S.J. ; Yang, C. ; Yam, E.
Title
An Integrated Data Analytics Platform
Type
$loc['typeJournal Article']
Year
2019
Publication
Frontiers in Marine Science
Abbreviated Journal
Front. Mar. Sci.
Volume
6
Issue
Pages
354
Keywords
Abstract
An Integrated Science Data Analytics Platform is an environment that enables the confluence of resources for scientific investigation. It harmonizes data, tools and computational resources to enable the research community to focus on the investigation rather than spending time on security, data preparation, management, etc. OceanWorks is a NASA technology integration project to establish a cloud-based Integrated Ocean Science Data Analytics Platform for big ocean science at NASA’s Physical Oceanography Distributed Active Archive Center (PO.DAAC) for big ocean science. It focuses on advancement and maturity by bringing together several NASA open-source, big data projects for parallel analytics, anomaly detection, in situ to satellite data matchup, quality-screened data subsetting, search relevancy, and data discovery. Our communities are relying on data available through distributed data centers to conduct their research. In typical investigations, scientists would (1) search for data, (2) evaluate the relevance of that data, (3) download it, and (4) then apply algorithms to identify trends, anomalies, or other attributes of the data. Such a workflow cannot scale if the research involves a massive amount of data or multi-variate measurements. With the upcoming NASA Surface Water and Ocean Topography (SWOT) mission expected to produce over 20PB of observational data during its 3-year nominal mission, the volume of data will challenge all existing Earth Science data archival, distribution and analysis paradigms. This paper discusses how OceanWorks enhances the analysis of physical ocean data where the computation is done on an elastic cloud platform next to the archive to deliver fast, web-accessible services for working with oceanographic measurements.
Address
Corporate Author
Thesis
Publisher
Place of Publication
Editor
Language
Summary Language
Original Title
Series Editor
Series Title
Abbreviated Series Title
Series Volume
Series Issue
Edition
ISSN
2296-7745
ISBN
Medium
Area
Expedition
Conference
Funding
Approved
$loc['no']
Call Number
COAPS @ user @
Serial
1042
Permanent link to this record