From raw data lakes to knowledge lakes
Did you know that the amount of data doubles every 1.5 years?
As new data science techniques are developed the types of analysis we’re doing on this data are becoming more complex and require more computation time. This is why we need software that can’t only do the analysis, but software that can do it fast.
In his MSc thesis titled titled “Correlation Detective: Efficient multivariate correlation discovery” for which he received the award of the best MSc thesis at TU/e, Koen Minartz developed an algorithm that can find interesting patterns in big data sets up to 500 times faster than existing methods. This will enable experts to discover complex relationships in a data-driven way. This algorithm will be integrated into STELAR for the discovery of multivariate correlations.
You can access and read the thesis via the following link https://research.tue.nl/en/studentTheses/correlation-detective