Data Science
In our research, we address various issues pertaining to the broader topic of data science. We are particularly interested in the development of methods for the study of large relational datasets, complex systems analysis, and collections of time-stamped interactions from various disciplines.
The methods highlighted here range from network science tools, data scraping and inference, and disambiguation.
Related Publications
Detecting and Optimising Team Interactions in Software Development
PLOS One - 2024
Empirical Networks are Sparse: Enhancing Multi-Edge Models with Zero-Inflation
arXiv Preprint - 2024
Processing Large-Scale Archival Records: The Case of the Swiss Parliamentary Records
Swiss Political Science Review - 2024
Disentangling the Timescales of a Complex System: A Bayesian Approach to Temporal Network Analysis
ArXiv Preprint - 2024
Reconstructing signed relations from interaction data
Scientific Reports - 2023
Predicting variable-length paths in networked systems using multi-order generative models.
Applied Network Science - 2023
Locating Community Smells in Software Development Processes Using Higher-Order Network Centralities
Social Network Analysis and Mining - 2023
Big Data = Big Insights? Operationalising Brooks' Law in a Massive GitHub Data Set
2022 IEEE/ACM 44th International Conference on Software Engineering (ICSE) - 2022
A network approach to expertise retrieval based on path similarity and credit allocation
Journal of Economic Interaction and Coordination - 2021
gambit - An Open Source Name Disambiguation Tool for Version Control Systems
2021 IEEE/ACM 18th International Conference on Mining Software Repositories (MSR) - 2021
The likelihood-ratio test for multi-edge network models
J. Phys. Complex. 2 035012 - 2021
HYPA: Efficient Detection of Path Anomalies in Time Series Data on Networks
Proceedings of the 2020 SIAM International Conference on Data Mining - 2020
A Gaussian Process-based Self-Organizing Incremental Neural Network
2019 International Joint Conference on Neural Networks (IJCNN) - 2019
Quantifying Triadic Closure in Multi-Edge Social Networks
ACM - 2019
git2net - An Open Source Package to Mine Time-Stamped Collaboration Networks from Large git Repositories
Proceedings of the 16th International Conference on Mining Software Repositories - 2019
From Relational Data to Graphs: Inferring Significant Links Using Generalized Hypergeometric Ensembles
Social Informatics: 9th International Conference, SocInfo 2017, Oxford, UK, September 13-15, 2017, Proceedings, Part II - 2017
Multiplex Network Regression: How do relations drive interactions?
arXiv e-print - 2017