Eeshaan Jain | projects

Efficient Matroid-constraint-based Submodular Maximization
Post-hoc Out of Distribution Detection
Estimation of Epidemic State using Graph Neural Networks

Efficient Matroid-constraint-based Submodular Maximization

Report: [Click here]
Code: [Click here]
Summary: Maximization of monotone submodular functions subject to cardinality constraints has been a topic of interest with various greedy algorithms, with many of these algorithms achieving a \(\left(1 - \frac{1}{e}\right)\) approximation on the optimal solution (OPT) at high computational speeds. However, these variants of classical greedy do not work well on matroid constraints, and only provide a \(\frac{1}{2}\) approximation of OPT. To overcome this, CONTINUOUS-GREEDY was introduced which gives a \(\left(1-\frac{1}{e} -\epsilon\right)\) approximation of OPT but it has a \(O(n^8)\) running time. In spite of that, the algorithm is useful and generalizable. An improvement to the algorithm is ACCELERATED-CONTINUOUS-GREEDY introduced which has \(\widetilde{O}(n^2)\) running time and is much more efficient than the previous version. The project creates support for matroid-based optimization implementation of these two optimization algorithms along with four matroid-constraint-based submodular functions in SUBMODLIB an efficient and scalable Python library for submodular optimization in Python with a C++ optimization engine. We improve the ACCELERATED-CONTINUOUS-GREEDY procedure further, and demonstrate performance on the SAP, GAP and SWP problems.

Post-hoc Out of Distribution Detection

Report: [Click here]
Slides: [Click here]
Code: [Click here]
Summary: We focus on OOD detection, in a classification setting, after a classifier has already been trained (i.e., a post-hoc/post-training setting). Many popular baselines propose score functions to detect OOD data. These score functions should assign low scores to OOD data and high scores to ID data. We introduce a Dirichlet-based scoring function which outperforms other scoring functions, is computationally cheap, can generalize across different OOD settings and has a theoretical backing. In cases when the performance is not good enough, we provide ways to further improve the separability between the ID and OOD data without using a large and diverse auxiliary OOD dataset (like 80 Million Tiny Images and ImageNet-22K) for fine-tuning, unlike many other popular approaches. We further demonstrate that tuned margins don’t always improve the result over different non-parametric energy-based loss functions.

Estimation of Epidemic State using Graph Neural Networks

Report: [Click here]
Code: [Click here]
Summary: Epidemic forecasting provides an opportunity to predict geographic disease spread as well as case counts to better inform public health interventions when outbreaks occur. In this project, we firstly explore the work already done in the domain of epidemics, using both classical machine learning and deep learning. Further, we demonstrate the spread of epidemics using the famous SIR model on random graphs generated by the Erd˝os–Rényi-Gilbert, Watts-Strogatz and Barabási–Albert models. Finally, we construct a graph neural network followed by a fully connected network to predict the category (safe/infected/recovered) a person might be in during an epidemic, while monitoring only a fraction of the nodes – which aligns with the practicality of epidemic monitoring.

Table of Contents

Efficient Matroid-constraint-based Submodular Maximization

Post-hoc Out of Distribution Detection

Estimation of Epidemic State using Graph Neural Networks