Yeganeh
Alimohammadi
Assistant Professor of Data Sciences & Operations
USC Marshall School of Business
I design principled methods for learning and decision-making in complex systems where data is incomplete, costly to collect, or shaped by strategic behavior. Drawing on tools from applied probability, optimization, and algorithm design, my work builds robust, data-efficient algorithms for reliable inference and intervention.
Previously: UC President's Postdoctoral Fellow at Berkeley, Ph.D. in Operations Research at Stanford, research fellow at the Simons Institute, B.Sc. from Sharif University of Technology.
If you've ever stumbled over my name (Yeganeh /jeɡɒnɛ/), click here.
It's pronounced "Yeah Gone Eh" — say it swiftly, letting the second syllable blend in. To break it down: start with "Yeah!" like you just got something right, follow with "gone" like something mysteriously disappeared, and end with "eh" like you're asking a thoughtful question. Thanks for taking the time to get it right.
— RECENT NEWS
— RESEARCH
Working papers
abstract
Abstract coming soon.
abstract
Abstract coming soon.
Online social platforms increasingly rely on crowd-sourced systems to label misleading content at scale, but these systems must both aggregate users’ evaluations and decide whose evaluations to trust. To address the latter, many platforms audit users by rewarding agreement with the final aggregate outcome — a design we term consensus-based auditing. We analyze the consequences of this design in X’s Community Notes, which in September 2022 adopted consensus-based auditing that ties users’ eligibility for participation to agreement with the eventual platform outcome. We find evidence of strategic conformity: minority contributors’ evaluations drift toward the majority and their participation share falls on controversial topics, where independent signals matter most. We formalize this mechanism in a behavioral model in which contributors trade off private beliefs against anticipated penalties for disagreement.
The Mallows model is a widely-used probabilistic framework for learning from ranking data, with applications ranging from recommendation systems and voting to aligning language models with human preferences. Under this model, observed rankings are noisy perturbations of a central ranking σ, with likelihood decaying exponentially in distance from σ. Existing methods mainly focus on fixed distances (such as Kendall’s τ distance), with no principled approach to learning the distance metric directly from data. In practice, however, rankings naturally vary by context. We focus on the L_α Mallows model and present an efficient Maximum Likelihood Estimation algorithm that jointly estimates the central ranking σ, the dispersion parameter β, and the distance parameter α.
Small-world networks, known for high local clustering and short path lengths, are a fundamental structure in many real-world systems, including social, biological, and technological networks. We apply the theory of (marked) local convergence (also known as Benjamini–Schramm convergence) to derive the limiting behavior of the local structures for two commonly studied small-world network models: the Watts–Strogatz and the Kleinberg models. Establishing local convergence enables us to show that key network measures, such as clustering coefficient, PageRank, greedy maximal independent set, number of spanning trees, and tree entropy, converge as network size increases, with their limits determined by the graph’s local structure. We further observe a critical change in the behavior of the limit exactly when the parameter governing long-range connections in the Kleinberg model crosses the threshold where decentralized search remains efficient, offering a new perspective on why decentralized algorithms fail in certain regimes.
Refereed journal publications
abstract
We study how to predict the trajectory of an SIR epidemic on a network when the full network is unobservable. We design a local algorithm that, given a randomly chosen target node, explores at most k of its neighbors by simulating the epidemic process backward in time, and infers when the target would have been infected. Averaging over q random target nodes yields an estimator for the global fractions S(t), I(t), R(t). Under a tightness condition on the network sequence, we prove that the sample size required for ε accuracy is a constant independent of network size — both k and q remain bounded as the population grows. The result extends to random network models under a stability condition on local neighborhood distributions, and to weighted sampling. We validate the predictions on Copenhagen contact data and on a large-scale mobility network of San Francisco.
Abstract coming soon.
abstract
This paper makes three contributions to estimating the number of perfect matchings in bipartite graphs. First, we prove that the popular sequential importance sampling algorithm works in polynomial time for dense bipartite graphs: our algorithm gives a (1±ε)-approximation for the number of perfect matchings of a λ-dense bipartite graph, using O(n^((1−2λ)/λ) ε^(−2)) samples, where a λ-dense bipartite graph has all degrees greater than (λ+1/2)n. Second, practical applications of the algorithm require many calls to matching algorithms; a novel preprocessing step is provided which yields significant improvements. Third, three applications are given: counting Latin squares, a practical way of computing the greedy algorithm for a card-guessing game with feedback, and stochastic block models.
abstract
Abstract coming soon.
Refereed conference proceedings
We propose a theoretical framework for training Graph Neural Networks (GNNs) on large input graphs via training on small, fixed-size sampled subgraphs. This framework is applicable to a wide range of models, including popular sampling-based GNNs such as GraphSAGE and FastGCN. Leveraging the theory of graph local limits, we prove that, under mild assumptions, parameters learned from training sampling-based GNNs on small samples of a large input graph are within an ε-neighborhood of the outcome of training the same architecture on the whole graph. We derive bounds on the number of samples, the size of the graph, and the training steps required as a function of ε. We empirically illustrate our results on a node classification task on large citation graphs, observing that sampling-based GNNs trained on local subgraphs 12× smaller than the original achieve comparable performance to those trained on the input graph.
Auto-bidding has recently become a popular feature in ad auctions: advertisers provide high-level constraints and goals to an automated agent, which optimizes their auction bids on their behalf. We examine the effect of different auction rules on the incentives of advertisers to truthfully report their constraints to the auto-bidder intermediaries. More precisely, we study whether canonical auctions such as first-price (FPA) and second-price (SPA) are auto-bidding incentive compatible (AIC) — that is, whether an advertiser can gain by misreporting their constraints to the autobidder.
We study a simple model of epidemics where an infected node transmits the infection to its neighbors independently with probability p — also known as the independent cascade or SIR model with fixed recovery time. The size of an outbreak in this model is closely related to that of the giant connected component in edge percolation. Even though existing models capture the effects of degree inhomogeneity and the role of super-spreaders, they only consider graphs that are locally tree-like, i.e., have few or no short cycles. We give an algorithm that returns a (1−ε) approximation of the probability and final size of an outbreak by accessing a constant-size neighborhood of a constant number of nodes chosen uniformly at random — without requiring local tree-likeness.
We study dynamic matching in a spatial setting. Drivers are distributed at random on some interval. Riders arrive in some (possibly adversarial) order at randomly drawn points. The platform observes the location of the drivers and can match newly arrived riders immediately, or can wait for more riders to arrive; unmatched riders incur a waiting cost c per period. We quantify the value of slightly increasing supply: when there are (1+ε) drivers per rider (for any ε > 0), the cost of matching returned by a simple greedy algorithm — which pairs each arriving rider to the closest available driver — is provably better than that of the ex-post optimal matching with exactly one driver per rider. No level of sophistication in the matching algorithm and no amount of data to predict times and locations of future demand can beat a naive greedy algorithm with a small excess supply.
We show fully polynomial time randomized approximation schemes (FPRAS) for counting matchings of a given size, or more generally sampling/counting monomer–dimer systems in planar, not-necessarily-bipartite graphs. While perfect matchings on planar graphs can be counted exactly in polynomial time, counting non-perfect matchings was shown by Jerrum to be #P-hard, who also raised the question of whether efficient approximate counting is possible. We answer this affirmatively by showing that the multi-site Glauber dynamics on the set of monomers in a monomer–dimer system always mixes rapidly, and that this dynamics can be implemented efficiently on downward-closed families of graphs where counting perfect matchings is tractable. As further applications, we show how to sample efficiently from partition-constrained strongly Rayleigh distributions and nonsymmetric determinantal point processes.
— TEACHING
USC Marshall (instructor)
Core course for the MS in Business Analytics. Class size 190.
Stanford (teaching assistant)
— SERVICE
Conference program committees
Journal reviewing
Annals of Probability · Annals of Applied Probability · Operations Research · Management Science · Review of Economic Studies · Mathematics of Operations Research · Computational Statistics and Data Analysis · Computational and Applied Mathematics
Seminar organization
Students & mentoring
— HONORS & CONTACT
Selected honors
Contact
Download CV · Google Scholar · GitHub · LinkedIn
Email: yalimoha@usc.edu