Research interests
 Machine Learning
 Data Mining
 Graph Machine Learning
 Network Analysis
 Complex Systems
Focus on problems
 Data clustering and cluster analysis
 Learning methods for highdimensional data
 Representation and clustering of text documents
 Multimedia data semantics (text, image, video)
 Online learning on data streams
 Diffusion processes and control in networks
 Graph signals
Publications
– Journals
 DeGrootbased opinion formation under a global steering mechanism. I. Conjeaud*, P. LorenzSpreen. A. Kalogeratos*, IEEE Transactions on Computational Social Systems, 2024. [pdf]
AbstractThis paper investigates how interacting agents arrive to a consensus or a polarized state. We study the opinion formation process under the effect of a global steering mechanism (GSM), which aggregates the opiniondriven stochastic agent states at the network level and feeds back to them a form of global information. We also propose a new twolayer agentbased opinion formation model, called GSMDeGroot, that captures the coupled dynamics between agenttoagent local interactions and the GSM’s steering effect. This way, agents are subject to the effects of a DeGrootlike local opinion propagation, as well as to a wide variety of possible aggregated information that can affect their opinions, such as trending news feeds, press coverage, polls, elections, etc. Contrary to the standard DeGroot model, our model allows polarization to emerge by letting agents react to the global information in a stubborn differential way. Moreover, the introduced stochastic agent states produce event stream dynamics that can fit to real event data. We explore numerically the model dynamics to find regimes of qualitatively different behavior. We also challenge our model by fitting it to the dynamics of real topics that attracted the public attention and were recorded on Twitter. Our experiments show that the proposed model holds explanatory power, as it evidently captures real opinion formation dynamics via a relatively small set of interpretable parameters.
 Learning Laplacian Matrix from Graph Signals with Sparse Spectral Representation. P. Humbert, B. Le Bars, L. Oudre, A. Kalogeratos, and N. Vayatis, Journal of Machine Learning Research (JMLR), 2021. [pdf][bib][code] AbstractIn this paper, we consider the problem of learning a graph structure from multivariate signals, known as graph signals. Such signals are multivariate observations carrying measurements corresponding to the nodes of an unknown graph, which we desire to infer. They are assumed to enjoy a sparse representation in the graph spectral domain, a feature which is known to carry information related to the cluster structure of a graph. The signals are also assumed to behave smoothly with respect to the underlying graph structure. For the graph learning problem, we propose a new optimization program to learn the Laplacian of this graph and provide two algorithms to solve it, called IGL3SR and FGL3SR. Based on a 3steps alternating procedure, both algorithms rely on standard minimization methods – such as manifold gradient descent or linear programming – and have lower complexity compared to stateoftheart algorithms. While IGL3SR ensures convergence, FGL3SR acts as a relaxation and is significantly faster since its alternating process relies on multiple closedform solutions. Both algorithms are evaluated on synthetic and real data. They are shown to perform as good or better than their competitors in terms of both numerical performance and scalability. Finally, we present a probabilistic interpretation of the optimization program as a Factor Analysis Model. Keywords: Graph learning, graph Laplacian, nonconvex optimization, graph signal processing, sparse coding, clustering.
 Revealing posturographic profile of patients with Parkinsonian syndromes through a novel hypothesis testing framework based on machine learning. I. Bargiotas, A. Kalogeratos, M. Limnios, P.–P. Vidal, D. Ricard, and N. Vayatis, PLoS ONE, 2021. [pdf] AbstractFalling in Parkinsonian syndromes (PS) is associated with postural instability and consists a common cause of disability among PS patients. Current posturographic practices record the body’s centerofpressure displacement (statokinesigram) while the patient stands on a force platform. Statokinesigrams, after appropriate signal processing, can offer numerous posturographic features, which however challenges the efforts for valid statistics via standard univariate approaches. In this work, we present the tsAUC, a nonparametric multivariate twosample test, which we employ to analyze statokinesigram differences among PS patients that are fallers (PSF) and nonfallers (PSNF). We included 123 PS patients who were classified into PSFor PSNFbased on clinical assessment and underwent simple Romberg Test (eyes open/eyes closed). We analyzed posturographic features using both multiple testing with pvalue adjustment and the tsAUC. While the tsAUCshowed significant difference between groups (pvalue = 0.01), multiple testing did not show any such difference. Interestingly, significant difference between the two groups was found only using the openeyes protocol. PSF showed significantly increased anteroposterior movements as well as increased posturographic area, compared to PSNF. Our study demonstrates the superiority of the tsAUC test compared to standard statistical tools in distinguishing PSF and PSNF in the multidimensional feature space. This result highlights more generally the fact that machine learningbased statistical tests can be seen as a natural extension of classical statistical approaches and should be considered,especially when dealing with multifactorial assessments.
 Suppressing Epidemics in Networks using PriorityPlanning, K. Scaman, A. Kalogeratos, N. Vayatis, IEEE Transactions on Network Science and Engineering, vol. 3, no. 4, pp. 271285, 2016. [pdf][bib] – Prior, 2014 arxiv version [arXiv link]. AbstractIn this paper, we analyze a large class of dynamic resource allocation (DRA) strategies, named priority planning, that aim to suppress SIS epidemics taking place in a network. This is performed by distributing treatments of limited efficiency to its infected nodes, according to a priorityorder precomputed offline. Under this perspective, an efficient DRA strategy for a given network can be designed by learning a proper linear arrangement of its nodes. In our theoretical analysis, we derive upper and lower bounds for the extinction time of the diffusion process that reveal the role of the maxcut of the considered linear arrangement. Accordingly, we highlight that the cutwidth, which is the minimum maxcut of all possible linear arrangements for a network, is a fundamental network property that determines the resource budget required to suppress the epidemic under priority planning. Finally, by making direct use of our theoretical results, we propose a novel and efficient DRA strategy, called maxcut minimization (MCM), which outperforms other competing strategies in our simulations, while offering desirable robustness under various noise profiles.
 Text Document Clustering Using Global Term Context Vectors, A. Kalogeratos and A. Likas, Knowledge and Information Systems (KAIS), vol. 31, no. 3, pp. 455–474, 2012. [pdf][recompiled pdf][bib]. AbstractDespite the advantages of the traditional vector space model (VSM) representation, there are known deficiencies concerning the term independence assumption. The high dimensionality and sparsity of the text feature space and phenomena such as polysemy and synonymy can only be handled if a way is provided to measure term similarity. Many approaches have been proposed that map document vectors onto a new feature space where learning algorithms can achieve better solutions. This paper presents the global term context vectorVSM (GTCVVSM) method for text document representation. It is an extension to VSM that: (i) it captures local contextual information for each term occurrence in the term sequences of documents; (ii) the local contexts for the occurrences of a term are combined to define the global context of that term; (iii) using the global context of all terms a proper semantic matrix is constructed; (iv) this matrix is further used to linearly map traditional VSM (Bag of Words—BOW) document vectors onto a ‘semantically smoothed’ feature space where problems such as text document clustering can be solved more efficiently. We present an experimental study demonstrating the improvement of clustering results when the proposed GTCVVSM representation is used compared with traditional VSMbased approaches.
 Document clustering using synthetic cluster prototypes, A. Kalogeratos and A. Likas, Data and Knowledge Engineering, vol.70, no. 3, pp. 284–306, 2011. [pdf][recompiled pdf][bib]. AbstractThe use of centroids as prototypes for clustering text documents with the kmeans family of methods is not always the best choice for representing text clusters due to the high dimensionality, sparsity, and low quality of text data. Especially for the cases where we seek clusters with small number of objects, the use of centroids may lead to poor solutions near the bad initial conditions. To overcome this problem, we propose the idea of synthetic cluster prototype that is computed by first selecting a subset of cluster objects (instances), then computing the representative of these objects and finally selecting important features. In this spirit, we introduce the MedoidKNN synthetic prototype that favors the representation of the dominant class in a cluster. These synthetic cluster prototypes are incorporated into the generic spherical kmeans procedure leading to a robust clustering method called ksynthetic prototypes (ksp). Comparative experimental evaluation demonstrates the robustness of the approach especially for small datasets and clusters overlapping in many dimensions and its superior performance against traditional and subspace clustering methods.
– Conferences

 Online nonparametric likelihοodratio estimation by Pearsondivergence functional minimization. A. de la Concha, N. Vayatis, and A. Kalogeratos, International Conference on Artificial Intelligence and Statistics (AISTATS), 2024 [pdf]. AbstractQuantifying the difference between two probability density functions, p and q, using available data, is a fundamental problem in Statistics and Machine Learning. A usual approach for addressing this problem is the likelihoodratio estimation (LRE) between p and q, which to our best knowledge has been investigated mainly for the offline case. This paper contributes by introducing a new framework for online nonparametric LRE (OLRE) for the setting where pairs of iid observations (x_{t} ∼ p, x′_{t} ∼ q) are observed over time. The nonparametric nature of our approach has the advantage of being agnostic to the forms of p and q. Moreover, we capitalize on the recent advances in Kernel Methods and functional minimization to develop an estimator that can be efficiently updated online. We provide theoretical guarantees for the performance of the OLRE method along with empirical validation in synthetic experiments.
 Online nonparametric likelihοodratio estimation by Pearsondivergence functional minimization. A. de la Concha, N. Vayatis, and A. Kalogeratos, International Conference on Artificial Intelligence and Statistics (AISTATS), 2024 [pdf]. Abstract
 A framework for pairedsample hypothesis testing for highdimensional data. I. Bargiotas, A. Kalogeratos, and N. Vayatis, IEEE International Conference on Tools with Artificial Intelligence (ICTAI), 2023. [pdf][slides] AbstractThe standard pairedsample testing approach in the multidimensional setting applies multiple univariate tests on the individual features, followed by p value adjustments. Such an approach suffers when the data carry numerous features. A number of studies have shown that classification accuracy can be seen as a proxy for twosample testing. However, neither theoretical foundations nor practical recipes have been proposed so far on how this strategy could be extended to multidimensional pairedsample testing. In this work, we put forward the idea that scoring functions can be produced by the decision rules defined by the perpendicular bisecting hyperplanes of the line segments connecting each pair of instances. Then, the optimal scoring function can be obtained by the pseudomedian of those rules, which we estimate by extending naturally the HodgesLehmann estimator. We accordingly propose a framework of a twostep testing procedure. First, we estimate the bisecting hyperplanes for each pair of instances and an aggregated rule derived through the HodgesLehmann estimator. The paired samples are scored by this aggregated rule to produce a unidimensional representation. Second, we perform a Wilcoxon signedrank test on the obtained representation. Our experiments indicate that our approach has substantial performance gains in testing accuracy compared to the traditional multivariate and multiple testing, while at the same time estimates each feature’s contribution to the final result.
 Opinion formation under global steering with application to social network data analysis, I. Conjeaud, P. LorenzSpreen, and A. Kalogeratos, International Conference on Complex Networks and their Applications. Extended Abstract. 2017. [pdf][slides][link to Book of Abstracts tba]
 To tree or not to tree? Assessing the impact of smoothing the decision boundaries. A. Merida, A. Kalogeratos, and M. Mougeot, 31st International Conference on Artificial Neural Networks (ICANN 2022). [pdf] AbstractWhen analyzing a dataset, it can be useful to assess how smooth the decision boundaries need to be for a model to better fit the data. This paper addresses this question by proposing the quantification of how much should the ‘rigid’ decision boundaries, produced by an algorithm that naturally finds such solutions, be relaxed to obtain a performance improvement. The approach we propose starts with the rigid decision boundaries of a seed Decision Tree (seed DT), which is used to initialize a Neural DT (NDT). The initial boundaries are challenged by relaxing them progressively through training the NDT. During this process, we measure the NDT’s performance and decision agreement to its seed DT. We show how these two measures can help the user in figuring out how expressive his model should be, before exploring it further via model selection. The validity of our approach is demonstrated with experiments on simulated and benchmark datasets.
 Offline detection of changepoints in the mean for stationary graph signals. A. de la Concha, N. Vayatis, and A. Kalogeratos, 24th International Conference on Artificial Intelligence and Statistics (AISTATS 2021). [pdf][appendix][code][poster] AbstractThis paper addresses the problem of segmenting a stream of graph signals: we aim to detect changes in the mean of the multivariate signal defined over the nodes of a known graph. We propose an offline algorithm that relies on the concept of graph signal stationarity and allows the convenient translation of the problem from the original vertex domain to the spectral domain (Graph Fourier Transform), where it is much easier to solve. Although the obtained spectral representation is sparse in real applications, to the best of our knowledge this property has not been much exploited in the existing related literature. Our main contribution is a changepoint detection algorithm that adopts a model selection perspective, which takes into account the sparsity of the spectral representation and determines automatically the number of changepoints. Our detector comes with a proof of a nonasymptotic oracle inequality, numerical experiments demonstrate the validity of our method.
 Multivariate twosample hypothesis testing through AUC maximization for biomedical applications. I. Bargiotas, A. Kalogeratos, M. Limnios, P.P. Vidal, D. Ricard, N. Vayatis, SETN 2020: 11th Hellenic Conference on Artificial Intelligence, pp. 56–59, Sep. 2020. [pdf, appendix] AbstractClinical datasets usually carry numerous features (biomarkers, characteristics, etc.) concerning the examined populations. This fact, although beneficial, challenges the statistical analysis via standard univariate approaches. In the twosample setting, the majority of the clinical studies evaluate their assumptions relying on a variety of available univariate tests, such as the Student’s ttest or MannWhitney Wilcoxon. We developed an easytouseandinterpret nonparametric twosample hypothesis testing framework (tsAUC) particularly using machine learning and the AUC maximization criterion. We test and verify the effectiveness of tsAUC in real data containing posturographic features of Parkinsonian patients (PS) with and without history of falling.
 Learning the piecewise constant graph structure of a varying Ising model. B. Le Bars, P. Humbert, A. Kalogeratos, and N. Vayatis, International Conference on Machine Learning 2020. [pdf] AbstractThis work focuses on the estimation of multiple changepoints in a timevarying Ising model that evolves piecewise constantly. The aim is to identify both the moments at which significant changes occur in the Ising model, as well as the underlying graph structures. For this purpose, we propose to estimate the neighborhood of each node by maximizing a penalized version of its conditional loglikelihood. The objective of the penalization is twofold: it imposes sparsity in the learned graphs and, thanks to a fusedtype penalty, it also enforces them to evolve piecewise constantly. Using few assumptions, we provide two changepoints consistency theorems. Those are the first in the context of unknown number of changepoints detection in timevarying Isingmodel. Finally, experimental results on several synthetic datasets and a realworld dataset demonstrate the performance of our method.
 Optimal Multiple Stopping Rule for WarmStarting Sequential Selection, M. Fekom, N. Vayatis, and A. Kalogeratos, IEEE International Conference on Tools with Artificial Intelligence (ICTAI), 2019. [pdf][arxiv link] AbstractThis work focuses on the estimation of changepoints in a timevarying Ising graphical model (outputs −1 or 1) evolving in a piecewise constant fashion. The occurring changes alter the graphical structure of the model, a structure which we also estimate in the present paper. For this purpose, we propose a new optimization program consisting in the minimization of a penalized negative conditional loglikelihood. The objective of the penalization is twofold: it imposes the learned graphs to be sparse and, thanks to a fusedtype penalty, it enforces them to evolve piecewise constantly. Using few assumptions, we then give a changepoint consistency theorem. Up to our knowledge, we are the first to present such theoretical result in the context of timevarying Ising model. Finally, experimental results on several synthetic examples and a realworld dataset demonstrate the empirical performance of our method.
 Sequential Dynamic Resource Allocation for Epidemic Control, M. Fekom, N. Vayatis, and A. Kalogeratos, IEEE Conference on Decision and Control 2019 (CDC), 2019. [pdf][arxiv link] AbstractTUnder the Dynamic Resource Allocation (DRA) model, an administrator has the mission to allocate dynamically a limited budget of resources to the nodes of a network in order to reduce a diffusion process (DP) (e.g. an epidemic). The standard DRA assumes that the administrator has constantly full information and instantaneous access to the entire network. Towards bringing such strategies closer to reallife constraints, we first present the Restricted DRA model extension where, at each intervention round, the access is restricted to only a fraction of the network nodes, called sample. Then, inspired by sequential selection problems such as the wellknown Secretary Problem, we propose the Sequential DRA (SDRA) model. Our model introduces a sequential aspect in the decision process over the sample of each round, offering a completely new perspective to the dynamic DP control. Finally, we incorporate several sequential selection algorithms to SDRA control strategies and compare their performance in SIS epidemic simulations.
 Learning Laplacian Matrix from Bandlimited Graph Signals, B. Le Bars, P. Humbert, L. Oudre, and A. Kalogeratos, IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP). [pdf] AbstractIn this paper, we present a method for learning an underlying graph topology using observed graph signals as training data. The novelty of our method lies on the combination of two assumptions that are imposed as constraints to the graph learning process: i) the standard assumption used in the literature that signals are smooth with respect to graph structure (i.e. small signal variation at adjacent nodes), with ii) the additional assumption that signals are bandlimited, which implies sparsity in the signals’ representation in the spectral domain. The latter assumption affects the inference of the whole eigenvalue decomposition of the Laplacian matrix and leads to a challenging new optimization problem. The conducted experimental evaluation shows that the proposed algorithm to solve this problem outperforms a reference stateoftheart method that is based only on the smoothness assumption, when compared in the graph learning task on synthetic and real graph signals.
 A Probabilistic Framework to Nodelevel Anomaly Detection in Communication Networks, B. Le Bars and A. Kalogeratos, IEEE International Conference on Computer Communications 2019 (INFOCOM), 29 Apr – 2 May 2019, Paris, France. [pdf][bib]
AbstractIn this paper we consider the task of detecting abnormal communication volume occurring at nodelevel in communication networks. The signal of the communication activity is modeled by means of a clique stream: each occurring communication event is instantaneous, and activates an undirected subgraph spanning over a set of equally participating nodes. We present a probabilistic framework to model and assess the communication volume observed at any single node. Specifically, we employ nonparametric regression to learn the probability that a node takes part in a certain event knowing the set of other nodes that are involved. On the top of that, we present a concentration inequality around the estimated volume of events in which a node could participate, which in turn allows us to build an efficient and interpretable anomaly scoring function. Finally, the superior performance of the proposed approach is empirically demonstrated in realworld sensor network data, as well as using synthetic communication activity that is in accordance with that latter setting.
 Multivariate Hawkes Processes for Largescale Inference, R. Lemonnier, K. Scaman, A. Kalogeratos, 31st AAAI Conference on Artificial Intelligence (AAAI17). [pdf][supplementary][poster] – Prior arxiv version [arXiv link]. Abstract
In this paper, we present a framework for fitting multivariate Hawkes processes for largescale problems both in the number of events in the observed history n and the number of event types d (i.e. dimensions). The proposed LowRank Hawkes Process (SLRHP) framework introduces a lowrank approximation of the kernel matrix that allows to perform the nonparametric learning of the d^2 triggering kernels using at most O(ndr^2) operations, where r is the rank of the approximation (r << d, n). This comes as a major improvement to the existing stateoftheart inference algorithms that are in O(nd^2). Furthermore, the lowrank approximation allows SLRHP to learn representative patterns of interaction between event types, which is usually for the analysis of complex processes in realworld networks.

Partial network immunization in continuoustime information cascades, A. Kalogeratos, K. Scaman, L. Corinzia, and N. Vayatis, International Conference on Complex Networks and their Applications. Extended Abstract. 2017. [pdf][Book of Abstracts]
 Improving Text Stream Clustering using Term Burstiness and Coburstiness, A. Kalogeratos, P. Zagorisios, and A. Likas, Hellenic Conference of Artificial Intelligence (SETN), 2016. [pdf][recompiled][notes][slides][bib]. Abstract
In text streams, documents appear over time and their timestamps can be used to improve typical approaches for text representation and clustering. A way to exploit temporal information is through the detection of bursty terms in such streams, i.e. terms that appear in many documents during short time period. Research efforts so far have shown that utilizing the burst information in the text representation can improve the performance of text clustering algorithms. However, most attempts take into account the bursty terms individually, without investigating the relation between them.
In this work, we take advantage of the fact that most of the important documents of a topic are published during the period in which the `main’ topic terms are bursty. Therefore, we focus on both term burstiness and coburstiness by determining groups of terms that are simultaneously bursty at a time period and also cooccurring in the same documents. Next, the documents that contain cobursty terms from those groups, are considered as important for the respective topics and are used to construct robust synthetic prototypes following an agglomerative process. These prototypes are finally used to initialize in a deterministic fashion the spherical kmeans clustering algorithm. Experimental results validate empirically the quality of the solutions provided by the proposed approach which seems to efficiently overcome the initialization problem of spherical kmeans.
 A Greedy Approach for Dynamic Control of Diffusion Processes in Networks, K. Scaman, A. Kalogeratos, and N. Vayatis, IEEE International Conference on Tools with Artificial Intelligence (ICTAI), pp. 652659, 2015. [pdf][suppl. material][bib] .::::. [code]. Abstract
This paper investigates the control of a diffusion process by utilizing realtime information. More specifically, we allow the network administrator to adjust the allocation of control resources, a set of treatments that increase the recovery rate of infected nodes, according to the evolution of the diffusion process. We first present a novel framework for describing a large class of dynamic control strategies. These strategies rely on sorting the nodes according to a priority score in order to treat more sensitive regions first. Then, we propose the Largest Reduction in Infectious Edges (LRIE) control strategy which is based on a greedy minimization of the cost associated to the undesired diffusion, and has the benefits of being efficient and easy to implement. Our simulations, which were conducted using a software package that we developed and made available to the community, show that the LRIE strategy substantially outperforms its competitors in a wide range of scenarios.
 Dipmeans: an incremental clustering method for estimating the number of clusters, A. Kalogeratos and A. Likas, Proceedings of the 26th Annual Conference on Neural Information Processing Systems (NIPS), 2012. [pdf][poster][bib] .::::. [software]. AbstractLearning the number of clusters is a key problem in data clustering. We present dipmeans, a novel robust incremental method to learn the number of data clusters that can be used as a wrapper around any iterative clustering algorithm of kmeans family. In contrast to many popular methods which make assumptions about the underlying cluster distributions, dipmeans only assumes a fundamental cluster property: each cluster to admit a unimodal distribution. The proposed algorithm considers each cluster member as an individual ‘viewer’ and applies a univariate statistic hypothesis test for unimodality (diptest) on the distribution of distances between the viewer and the cluster members. Important advantages are: i) the unimodality test is applied on univariate distance vectors, ii) it can be directly applied with kernelbased methods, since only the pairwise distances are involved in the computations. Experimental results on artificial and real datasets indicate the effectiveness of our method and its superiority over analogous approaches.
 Movie Segmentation into Scenes and Chapters Using Locally Weighted Bag of Visual Words, V. Chasanis, A. Kalogeratos, and A. Likas, Proceedings of the ACM International Conference on Image and Video Retrieval (CIVR), pp. 8–10, 2009. [pdf][bib]. Abstract
Movies segmentation into semantically correlated units is a quite tedious task due to ”semantic gap”. Lowlevel features do not provide useful information about the semantical correlation between shots and usually fail to detect scenes with constantly dynamic content. In the method we propose herein, local invariant descriptors are used to represent the keyframes of video shots and a visual vocabulary is created from these descriptors resulting to a visual words histogram representation (bag of visual words) for each shot. A key aspect of our method is that, based on an idea from text segmentation, the histograms of visual words corresponding to each shot are further smoothed temporally by taking into account the histograms of neighboring shots. In this way, valuable contextual information is preserved. The final scene and chapter boundaries are determined at the local maxima of the difference of successive smoothed histograms for low and high values of the smoothing parameter respectively. Numerical experiments indicate that our method provides high detection rates while preserving a good tradeoff between recall and precision.

A SignificanceBased Graph Model for Clustering Web Documents, A. Kalogeratos and A. Likas, Hellenic Conference of Artificial Intelligence (SETN), LNAI 3955, pp. 516–519, SpringerVerlag, 2006. [pdf][extended pdf][bib]. Abstract
Traditional document clustering techniques rely on singleterm analysis, such as the widely used Vector Space Model. However, recent approaches have emerged that are based on Graph Models and provide a more detailed description of document properties. In this work we present a novel Significancebased Graph Model for Web documents that introduces a sophisticated graph weighting method, based on significance evaluation of graph elements. We also define an associated similarity measure based on the maximum common subgraph between the graphs of the corresponding web documents. Experimental results on artificial and real document collections using wellknown clustering algorithms indicate the effectiveness of the proposed approach.
– Book chapters
 Information Diffusion and Rumor Spreading, A. Kalogeratos, K. Scaman, L. Corinzia, and N. Vayatis, chapter in the book Cooperative and Graph Signal Processing – Principles and Applications, eds. P.M. Djuric and C. Richard, Elsevier, 2018. [pdf][publisher’s link][bib]
AbstractThis chapter studies information cascades on social networks with a special focus on types of diffusion processes such as rumors and false news. The complex temporal dynamics of information cascades and rapid changes in user interests require flexible mathematical modeling to properly describe the diffusion dynamics. After mentioning the modeling advancements of recent decades, we get to modern models, such as the Information Cascade Model (ICM), that are indeed capable of describing such timedependent user interests and are thus particularly suited to the analysis of information diffusion. We provide a theoretical analysis of ICM, relating the dynamics of the cascade to structural characteristics of the social network, and then use that analysis to design control policies capable of efficiently reducing the undesired diffusion. The presented framework for activity shaping is generic while enjoying a simple convex relaxation. Finally, we present an algorithm for the control of ContinuousTime Independent Cascades which is evaluated and compared against baseline and stateofthe art approaches through diffusion simulations on real and synthetic social networks.
 Algorithmes efficaces pour contenir des processus de contagion sur des réseaux, A. Kalogeratos and K. Scaman, Book: Big Data et politiques publiques dans les transports, , Economica, 2017. [book]
 Mining Clinical Data, A. Kalogeratos, V. Chasanis, G. Rakocevic, A. Likas, Z. Babovic, M. Novakovic, Book: Computational Medicine in Data Mining and Modeling, Eds. G. Rakocevic et al., pp. 134, 2013. [link][pdf][bib]
– Working papers & Preprints

 Collaborative nonparametric twosample testing, A. de la Concha, N. Vayatis, and A. Kalogeratos, preprint, 2024. [pdf] AbstractThis paper addresses the multiple twosample test problem in a graphstructured setting, which is a common scenario in fields such as Spatial Statistics and Neuroscience. Each node v in fixed graph deals with a twosample testing problem between two nodespecific probability density functions (pdfs), pv and qv. The goal is to identify nodes where the null hypothesis pv=qv should be rejected, under the assumption that connected nodes would yield similar test outcomes. We propose the nonparametric collaborative twosample testing (CTST) framework that efficiently leverages the graph structure and minimizes the assumptions over pv and qv. Our methodology integrates elements from fdivergence estimation, Kernel Methods, and Multitask Learning. We use synthetic experiments and a real sensor network detecting seismic activity to demonstrate that CTST outperforms stateoftheart nonparametric statistical tests that apply at each node independently, hence disregard the geometry of the problem.
 Collaborative nonparametric twosample testing, A. de la Concha, N. Vayatis, and A. Kalogeratos, preprint, 2024. [pdf] Abstract
 Stein Boltzmann Sampling: A Variational Approach forGlobal Optimization, G. Serré, A. Kalogeratos, and Nicolas Vayatis, preprint, 2024. [pdf] AbstractIn this paper, we introduce a new flowbased method for global optimization of Lipschitz functions, called Stein Boltzmann Sampling (SBS). Our method samples from the Boltzmann distribution that becomes asymptotically uniform over the set of the minimizers of the function to be optimized. Candidate solutions are sampled via the Stein Variational Gradient Descent algorithm. We prove the asymptotic convergence of our method, introduce two SBS variants, and provide a detailed comparison with several stateoftheart global optimization algorithms on various benchmark functions. The design of our method, the theoretical results, and our experiments, suggest that SBS is particularly wellsuited to be used as a continuation of efficient global optimization methods as it can produce better solutions while making a good use of the budget.
 UniForCE: The unimodality forest method for clustering and estimation of the number of clusters, G. Vardakas, A. Kalogeratos, and A. Likas, preprint, 2023. [pdf][code] AbstractEstimating the number of clusters k while clustering the data is a challenging task. An incorrect cluster assumption indicates that the number of clusters k gets wrongly estimated. Consequently, the model fitting becomes less important. In this work, we focus on the concept of unimodality and propose a flexible cluster definition called locally unimodal cluster. A locally unimodal cluster extends for as long as unimodality is locally preserved across pairs of subclusters of the data. Then, we propose the UniForCE method for locally unimodal clustering. The method starts with an initial overclustering of the data and relies on the unimodality graph that connects subclusters forming unimodal pairs. Such pairs are identified using an appropriate statistical test. UniForCE identifies maximal locally unimodal clusters by computing a spanning forest in the unimodality graph. Experimental results on both real and synthetic datasets illustrate that the proposed methodology is particularly flexible and robust in discovering regular and highly complex cluster shapes. Most importantly, it automatically provides an adequate estimation of the number of clusters.
 Online Centralized Nonparametric Changepoint Detection via Graphbased Likelihoodratio Estimation, A. de la Concha, A. Kalogeratos, and N. Vayatis, preprint, 2023 [pdf].AbstractConsider each node of a graph to be generating a data stream that is synchronized and observed at near realtime. At a changepoint τ, a change occurs at a subset of nodes C, which affects the probability distribution of their associated node streams. In this paper, we propose a novel kernelbased method to both detect τ and localize C, based on the direct estimation of the likelihoodratio between the postchange and the prechange distributions of the node streams. Our main working hypothesis is the smoothness of the likelihoodratio estimates over the graph, i.e. connected nodes are expected to have similar likelihoodratios. The quality of the proposed method is demonstrated on extensive experiments on synthetic scenarios.
 Clustering for directed graphs using parametrized random walk diffusion kernels, H. Sevi, M. Jonckheere, and A. Kalogeratos, preprint, 2022 [pdf] AbstractClustering based on the random walk operator has been proven effective for undirected graphs, but its generalization to directed graphs (digraphs) is much more challenging. Although the random walk operator is welldefined for digraphs, in most cases such graphs are not strongly connected, and hence the associated random walks are not irreducible, which is a crucial property for clustering that exists naturally in the undirected setting. To remedy this, the usual workaround is to either naively symmetrize the adjacency matrix or to replace the natural random walk operator by the teleporting random walk operator, but this can lead to the loss of valuable information carried by edge directionality. In this paper, we introduce a new clustering framework, the Parametrized Random Walk Diffusion Kernel Clustering (PRWDKC), which is suitable for handling both directed and undirected graphs. Our framework is based on the diffusion geometry and the generalized spectral clustering framework. Accordingly, we propose an algorithm that automatically reveals the cluster structure at a given scale, by considering the random walk dynamics associated with a parametrized kernel operator, and by estimating its critical diffusion time. Experiments on KNN graphs constructed from realworld datasets and realworld graphs show that our clustering approach performs well in all tested cases, and outperforms existing approaches in most of them.
 Collaborative likelihoodratio estimation over graphs, A. de la Concha, A. Kalogeratos, and N. Vayatis, preprint, 202224 [pdf]. AbstractAssuming we have i.i.d observations from two unknown probability density functions (pdfs), p and p’, the likelihoodratio estimation (LRE) is an elegant approach to compare the two pdfs just by relying on the available data, and without knowing the pdfs explicitly. In this paper we introduce a graphbased extension of this problem: Suppose each node of a fixed graph has access to observations coming from two unknown nodespecific pdfs, p_i and p’_i; the goal is then to compare the respective and of each node by also integrating information provided by the graph structure. This setting is interesting when the graph conveys some sort of `similarity’ between the nodewise estimation tasks, which suggests that the nodes can collaborate to solve more efficiently their individual tasks, while on the other hand trying to limit the data sharing among them. Our main contribution is a distributed nonparametric framework for graphbased LRE, called GRULSIF, that incorporates in a novel way elements from fdivengence functionals, Kernel methods, and Multitask Learning. Among the several applications of LRE, we choose the twosample hypothesis testing to develop a proof of concept for our graphbased learning framework. Our experiments compare favorably the performance of our approach against stateoftheart nonparametric statistical tests that apply at each node independently, and thus disregard the graph structure.
 Generalized Spectral Clustering for Directed and Undirected Graphs, H. Sevi, M. Jonckeere, and A. Kalogeratos, preprint, 2022. [pdf]. AbstractSpectral clustering is a popular approach for clustering undirected graphs, but its extension to directed graphs (digraphs) is much more challenging. A typical workaround is to naively symmetrize the adjacency matrix of the directed graph, which can however lead to discarding valuable information carried by edge directionality. In this paper, we present a generalized spectral clustering framework that can address both directed and undirected graphs. Our approach is based on the spectral relaxation of a new functional that we introduce as the generalized Dirichlet energy of a graph function, with respect to an arbitrary positive regularizing measure on the graph edges. We also propose a practical parametrization of the regularizing measure constructed from the iterated powers of the natural random walk on the graph. We present theoretical arguments to explain the efficiency of our framework in the challenging setting of unbalanced classes. Experiments using directed KNN graphs constructed from real datasets show that our graph partitioning method performs consistently well in all cases, while outperforming existing approaches in most of them.
 Online nonparametric changepoint detection for heterogeneous data streams observed over graph nodes . A. de la Concha, A. Kalogeratos, and Nicolas Vayatis, arxiv preprint, 2021. [pdf] AbstractConsider a heterogeneous data stream being generated by the nodes of a graph. The data stream is in essence composed by multiple streams, possibly of different nature that depends on each node. At a given moment τ, a changepoint occurs for a subset of nodes C, signifying the change in the probability distribution of their associated streams. In this paper we propose an online nonparametric method to infer τ based on the direct estimation of the likelihoodratio between the postchange and the prechange distribution associated with the data stream of each node. We propose a kernelbased method, under the hypothesis that connected nodes of the graph are expected to have similar likelihoodratio estimates when there is no changepoint. We demonstrate the quality of our method on synthetic experiments and realworld applications.
 Epidemic Models for COVID19 during the First Wave from February to May 2020: a Methodological Review. M. Garin, M. Limnios, A. Nicolaï, I. Bargiotas, O. Boulant, S.E. Chick, A. Dib, T. Evgeniou, M. Fekom, A. Kalogeratos, C. Labourdette, A. Ovchinnikov, R. Porcher, C. Pouchol, and N. Vayatis, arxiv:2109.01450, Sep 2021. [pdf] AbstractWe review epidemiological models for the propagation of the COVID19 pandemic during the early months of the outbreak: from February to May 2020. The aim is to propose a methodological review that highlights the following characteristics: (i) the epidemic propagation models, (ii) the modeling of intervention strategies, (iii) the models and estimation procedures of the epidemic parameters and (iv) the characteristics of the data used. We finally selected 80 articles from open access databases based on criteria such as the theoretical background, the reproducibility, the incorporation of interventions strategies, etc. It mainly resulted to phenomenological, compartmental and individuallevel models. A digital companion including an online sheet, a Kibana interface and a markdown document is proposed. Finally, this work provides an opportunity to witness how the scientific community reacted to this unique situation.
 Efficient streambased MaxMin diversification with minimal failure rate, M. Fekom and A. Kalogeratos, 2020 [pdf] AbstractThe streambased MaxMin diversification problem concerns the task of selecting a limited number of diverse instances from a data stream. The nature of the problem demands immediate and irrevocable decisions. The setwise diversity to be maximized is the minimum distance among any pair of the selected instances. Standard algorithmic approaches for sequential selection disregard the possibility of selection failures, which is the situation where the last instances of the stream are picked by default to prevent having an incomplete selection. This defect can be catastrophic for the MaxMin diversification objective. In this paper we present the Failure Rate Minimization (FRM) algorithm that allows the selection of a set of disparate instances while reducing significantly the probability of having failures. This is achieved by means of both analytical and empirical techniques. FRM is put in comparison with relevant algorithms from the literature through simulations on real datasets, where we demonstrate its efficiency and low time complexity.
 Winning the competition: enhancing countercontagion in SISlike epidemic processes. A. Kalogeratos and S.S. Mannelli, Technical Report, Jun 2020. [pdf] AbstractIn this paper we consider the epidemic competition between two generic diffusion processes, where each competing side is represented by a different state of a stochastic process. For this setting, we present the Generalized Largest Reduction in Infectious Edges (gLRIE) dynamic resource allocation strategy to advantage the preferred state against the other. Motivated by social epidemics, we apply this method to a generic continuoustime SISlike diffusion model where we allow for: i) arbitrary node transition rate functions that describe the dynamics of propagation depending on the network state, and ii) competition between the healthy (positive) and infected (negative) states, which are both diffusive at the same time, yet mutually exclusive on each node. Finally we use simulations to compare empirically the proposed gLRIE against competitive approaches from literature.
 Dynamic Epidemic Control via Sequential Resource Allocation, M. Fekom, N. Vayatis, and A. Kalogeratos, Technical Report, Jun 2020. [pdf] AbstractIn the Dynamic Resource Allocation (DRA) problem, an administrator has to allocate a limited amount of resources to the nodes of a network in order to reduce a diffusion process (DP) (e.g. an epidemic). In this paper we propose a multiround dynamic control framework, which we realize through two derived models: the Restricted and the Sequential DRA (RDRA, SDRA), that allows for restricted information and access to the entire network, contrary to standard fullinformation and fullaccess DRA models. At each intervention round, the administrator has only access — simultaneous for the former, sequential for the latter — to a fraction of the network nodes. This sequential aspect in the decision process offers a completely new perspective to the dynamic DP control, making this work the first to cast the dynamic control problem as a series of sequential selection problems. Through indepth SIS epidemic simulations we compare the performance of our multiround approach with other resource allocation strategies and several sequential selection algorithms on both generated, and realdata networks. The results provide evidence about the efficiency and applicability of the proposed framework for reallife problems.
 The Warmstarting Sequential Selection Problem and its Multiround Extension, M. Fekom, A. Kalogeratos, and N. Vayatis, Technical Report (eprint arxiv:1709.05231), Nov 2019. [pdf][arXiv link] (this updated the older version with title “The MultiRound Sequential Selection“, from Sep 2018.AbstractIn the Sequential Selection Problem (SSP), immediate and irrevocable decisions need to be made as candidates randomly arrive for a job interview. Standard SSP variants, such as the wellknown secretary problem, begin with an empty selection set (coldstart) and perform the selection process once over a single candidate set (singleround). In this paper we address these two limitations. First, we introduce the novel Warmstarting SSP (WSSP) setting which considers at hand a reference set, a set of previously selected items of a given quality, and tries to update optimally that set by (re)assigning each job at most once. We adopt a cutoffbased approach to optimize a rankbased objective function over the final assignment of the jobs. In our technical contribution, we provide analytical results regarding the proposed WSSP setting, we introduce the algorithm Cutoffbased Cost Minimization (CCM) (and the low failuresCCM, which is more robust to high rate of resignations) that adapts to changes in the quality of the reference set thanks to the translation method we propose. Finally, we implement and test CCM in a multiround setting that is particularly interesting for realworld application scenarios.
 A Spectral Method for Activity Shaping in ContinuousTime Information Cascades, K. Scaman, A. Kalogeratos, L. Corinzia, N. Vayatis, Technical Report, Sep 2017. [pdf][arXiv link].AbstractThe Information Cascades Model captures dynamical properties of user activity in a social network. In this work, we develop a novel framework for activity shaping under the ContinuousTime Information Cascades Model which allows the administrator for local control actions by allocating targeted resources that can alter the spread of the process. Our framework employs the optimization of the spectral radius of the Hazard matrix, a quantity that has been shown to drive the maximum influence in a network, while enjoying a simple convex relaxation when used to minimize the influence of the cascade. In addition, usecases such as quarantine and node immunization are discussed to highlight the generality of the proposed activity shaping framework. Finally, we present the NetShape influence minimization method which is compared favorably to baseline and stateoftheart approaches through simulations on real social networks.
Workshops / Meetings (short papers, posters, abstracts with no proc.)
 Collaborative likelihoodratio estimation over graphs, A. de la Concha, A. Kalogeratos, and N. Vayatis, Workshop NeurIPS@Paris 2022 [program], 2324 November 2022. [paper][slides][poster]
 Data inspection via challenging decision boundaries’ rigidity, A. Merida, A. Kalogeratos, and Mathilde Mougeot. Int. Conf. of Computational Statistics (COMPSTAT), 2022.
 Generalized Spectral Clustering for Directed and Undirected Graphs, H. Sevi, M. Jonckeere, and A. Kalogeratos, poster at the Workshop on Recent Advancements on Graph Machine Learning, Sorbonne University, 2022. [paper]
 The Warmstarting Sequential Selection Problem, M. Fekom, N. Vayatis, and A. Kalogeratos, International Workshop in Sequential Methodologies (IWSM), June 2019, State University of New York at Binghamton, US.
 Nodelevel Anomaly Detection in Communication Networks, B. Le Bars and A. Kalogeratos, 3rd Graph Signal Processing Workshop (GSP), 2018. [short paper, no proc.; longer version will become soon available] AbstractIn this paper we consider the task of the detection of abnormal communication volume, occurring at nodelevel in a communication network. We model the communication by means of a dynamic graph with edges related to an occurring communication event appearing instantaneously. We propose a statistical model for the communication volume recorded at a single node, in order to detect anomalies. Our approach is evaluated in realworld communication data.
 Optimizing group selection over multiple sequential selection rounds, M. Fekom, A. Kalogeratos, and N. Vayatis, Poster at the Workshop on MultiArmed Bandits and Learning Algorithms, 2018.
 On behavioral epidemics and dynamic control, S. Sarao, A. Kalogeratos, K. Scaman, and N. Vayatis, Poster at the 42nd Middle European Cooperation in Statistical Physics (MECO), Feb 2017, Lyon, France.
 Dynamic control of social diffusions using extensions of the SIS model, A. Kalogeratos, S. Sarao, K. Scaman, and N. Vayatis, to appear as a full oral presentation in 2016 Conference on Complex Systems in Amsterdam (no proceedings). Abstract
Diffusion processes model propagation phenomena on complex networks, such as epidemics, information diffusion, and viral marketing. In many situations, it is critical to suppress an undesired diffusion process by means of dynamic resource allocation, where one need to decide targeted actions by taking into account the evolving infection state of the network.
In the context of continuoustime SIS, and with provided full information regarding the nodes’ state, we consider the scenario where a budget of treatment resources of limited efficiency is available at each time for distribution to infected nodes. Recent results on this particular problem include the Priority Planning approach which computes a linear ordering of the nodes with minimal maxcut, and the optimal greedy approach called Largest Reduction of Infectious Edges (LRIE). The latter is a simple, yet efficient, strategy that computes an intuitive priority score for the infected nodes which combines the notion of node virality (possibility to infect other nodes) and vulnerability (possibility to get reinfected after recovery).
In this work we show that the principle of the LRIE score holds for a wide range of SISlike modeling scenarios. More specifically, we propose the Generalized LRIE (gLRIE) strategy and study the dynamic diffusion control by introducing a twofold extension to SIS which can model important aspects of social diffusion (e.g. behaviors or habits). The first considers nonlinear functions of infection rates with saturation. On the top that, our second extension considers competition in the sense that the two node states, the infected and the healthy, are both diffusive, though each node can only be in only one of them at a time. In this case, our control strategy has to align with the healthy diffusion to help it to win the competition. Finally, simulations on largescale real and synthetic networks show the efficiency of gLIE.
 Learning to Suppress SIS Epidemics in Networks, A. Kalogeratos, K. Scaman, and N. Vayatis, NIPS 2015 Networks in the Social and Information Sciences workshop (NIPS 2015), 2015. [workshop link][pdf][poster][bib]. Abstract
In this paper, we introduce the class of priority planning strategies for suppressing SIS epidemics taking place in a network. This is performed via dynamic allocation of treatment resources with limited efficiency to the infected nodes, according to a precomputed priorityorder. Using recent theoretical results that highlight the role of the maxcut of a node ordering and the extinction time of an epidemic, we propose a simple and efficient strategy called MaxCut Minimization (MCM) that outperforms competing stateoftheart strategies in simulated epidemic scenarios.
 Dynamic Treatment Allocation for Epidemic Control in Arbitrary Networks, K. Scaman, A. Kalogeratos, and N. Vayatis, WSDM 2014 Diffusion in Networks and Cascade Analytics (DiffNet) Workshop, 2014. [workshop link][pdf][slides][poster][bib]. Abstract
While static epidemic control, e.g. using vaccination, has been extensively studied for various network types, controlling epidemics dynamically remains an open issue. In this work, we first propose a general model formulation for the dynamic treatment allocation problem for the SusceptibleInfectedSusceptible diffusion model. Then, we investigate dynamic control strategies and further propose the novel Largest Reduction in Infectious Edges (LRIE) heuristic that gives priority to the treatment of nodes that have both a high dissemination rate of the infection to many healthy nodes, and low reinfection probability after recovery. Experiments on random and a realworld network show that the dynamic problem is significantly different from vaccination, since the latter strategies can lead to disastrous results, and that the proposed heuristic is an effective strategy under various initial infection conditions.
Invited Talks

At the 10th International Workshop on Applied Probability (IWAP) 2023 at Thessaloniki, Greece. The talk was about the Warmstarting selection process and a multiround extention.
 At the French German Summer School on Transfer Learning, ENSParisSaclay, June 46 2018.
 Invited seminar talk on “Epidemics, Competition and Resource Management” was given in Paris at a Working Group on Machine Learning and Big Data of the French Ministry of Social Affairs and Health (organizer: Magali Beffy). The group is formed by scientists and researchers working on various organizations/institutions (Ministry of Health, INSEE, IRDES, Dauphine) – 25/1/2017.
 Dynamic suppression of epidemics on networks, at the Complex Networks research group of LIP6, Paris 6 (Jussieu), France, 13/6/2016.
 Epidemics in the new socioeconomic era: challenges and applications, at the Department of Computer Science and Engineering, University of Ioannina, Greece, 25/5/2016.
 Suppressing epidemics on arbitrary networks using treatment resources of limited efficiency, at INRA Research Center at JouyenJosas, Paris area, 4/3/2016.
 Invited Talk – Efficient algorithms for the suppression of diffusion processes on networks with application in epidemiology and marketing, summit: “Big data and public policies for the transportation” organized by the General Direction of Infrastructures, Transportation, and the General Commissioner on Durable Development of the Ministry of Ecology, Durable Development and Energy, along with ENS Cachan and PSESchool of Economics of Paris (15/10/2015) [agenda][presentation].
PhD Thesis
Received on April 2013, [page]
 Knowledge Extraction Methods for Document Collections, Department of Computer Science, University of Ioannina [pdf][bib]
 Presentation (April 17, 2013) [pdf]
Records
DBPL record: [link] — Google scholar: [link]
Copyright notice: This page provides material to ensure timely dissemination of academic work and may slightly differ to some extent from the final published versions. In some cases, additional recompiled pdf versions of the papers are provided above, with the material as published after correcting very few minor typos. Copyright and all rights therein are retained by authors or by other copyright holders.