Finding the right data is challenging - data spaces hold large amounts of data of various types and formats from diverse sources, and with varying quality
Integrating data is challenging - different sources follow different schemas and taxonomies, they refer to domain-specific entities using non-standardized names, and have different spatial or temporal resolutions
Preparing data for Machine Learning is challenging - for data to be AI-ready, annotation and labelling require domain knowledge and expertise, which has a high cost in terms of time and effort of domain experts
Principal Researcher at Athena Research Center
and STELAR Project Coordinator