Abstract
The new generation of large-scale telescopes will produce thousands of data products in hundreds of GBs to multiple TBs, increasing in complexity as they increase in size. Large scale data products will present several challenges that will need to be addressed to enable the science outputs from this data. This thesis investigates new techniques to extract, transform, model and load data from large scale raw data products into efficient data structures for storage and retrieval on an open-source parallel storage and processing platform. These approaches are investigated and demonstrated by exploring three separate use cases working with astronomical data.
Original language | English |
---|---|
Qualification | Doctor of Philosophy |
Awarding Institution |
|
Supervisors/Advisors |
|
Thesis sponsors | |
Award date | 4 Oct 2023 |
DOIs | |
Publication status | Unpublished - 2023 |