WP3: Data Analysis
Data Analysis
Data analysis is the core of the SeaBee product generation engine.
- We perform image analysis and implement a data workflow structure by building pipelines, as well as evaluating the data analysis.
- We accommodate automated image analysis in a plug-and-play manner via advanced and cutting-edge Artificial Intelligence (AI) algorithms.
- We facilitate improvements and developments within image and object analysis for identification of environmental and pollution relevant variables to respond to new technologies.
Design & Methods
Drone data is uploaded to the SeaBee data infrastructure, pre-processed and prepared for storage and analysis. The pre-processing approach contains:
- orthorectification.
- image stitching.
- radiometric calibration quality assessment of the data.
- removal of personal/sensitive information and metadata.
Product generation results from state-of-the-art machine learning (ML) and other artificial intelligence (AI) protocols for object detection and thematic mapping of environmental data and creation of visualization products. For example, marine habitat maps in coastal bays and inlets and automated recognition of marine mammals.
Central for a successful implementation of state-of-the-art ML algorithms are use of graphics processing units (GPUs) and fast-working storage. This will be provided by the UNINETT/Sigma2 high performance computation infrastructure.
A key part of the infrastructure is a framework that enables users to teach an algorithm to perform a specific task, to create a desired product, or identify objects of interest.
Further, protocols for re-analysis of previously recorded and stored data for new applications and/or research questions will be developed. This will provide powerful applications for reanalysis of previously collected data, for instance from seabird and mammal research, and for distribution of plastic debris.
A protocol for pre-processing and data analysis will be developed, incl. description of the data format and level of pre-processing needed before uploading data to the SeaBee infrastructure. The pre-processing software will be built from available software, standard methods, and algorithms.
Lead: Norsk Regnesentral
Norsk Regnesentral (Norwegian Computing Center, NR) is a private, independent, non-profit foundation established in 1952. NR carries out contract research and development projects in the areas of information and communication technology and applied statistical modelling.