· Constantine Caramanis

Constantine Caramanis

Professor, Dept. of Electrical and Computer Engineering

Chandra Family Endowed Distinguished Professorship in Electrical and Computer Engineering

Member of the Computer Science Graduate Studies Committee

Office: 2501 Speedway, EER Building Room 6.820

e-mail: constantine at utexas.edu

I am a Professor in the ECE department of The University of Texas at Austin. I received a PhD in EECS from The Massachusetts Institute of Technology, in the Laboratory for Information and Decision Systems (LIDS), and an AB in Mathematics from Harvard University. I received the NSF CAREER award in 2011, and I am an IEEE Fellow.

My current research interests focus on autonomous decision-making in large-scale complex systems, with a focus on learning and computation. Specifically, I am interested in robust and adaptable optimization, high dimensional statistics and machine learning, reinforcement learning and agents, and applications to generative models. I've worked on applications to large-scale networks, including social networks, wireless networks, transportation networks, energy networks and financial applications. I have also worked on applications of machine learning and optimization to computer-aided design.

I am affiliated with the NSF Institute for Foundations of Machine Learning, and the Machine Learning Lab.

I am affiliated with the Archimedes Research Center in Athens.

Teaching

Fall 2025: Convex Optimization
Spring 2026: Machine Learning aka (The New) Data Science Lab

I have also created three classes which I have made available online.

Research Group

Current Group

Alexia Atsidakou (ECE) (co-advised with Sujay Sanghavi)
Alexandros Kouridakis (CS)
Khang Le (ECE) (co-advised with Nhat Ho)
Litu Rout (ECE) (co-advised with Sanjay Shakkottai)
Nikos Tsikouras (NKUA) (co-advised with Christos Tzamos)
Apostolis Tsorvantzis (NTUA) (co-advised with Dimitris Fotakis and Chara Podimata)

Group Alumni

Eirini Asteri (co-advised with Alex Dimakis): Google
Yudong Chen: Associate Professor at Cornell ORIE –> Associate Professor at Wisconsin-Madison CS.
Matthew Faw (ECE) (co-advised with Sanjay Shakkottai): Post doc at Georgia Tech.
Doug Fearing (with C. Barnhart, MIT): Assistant Professor, UT Austin B. School –> Dodgers –> Zelus Analytics
Amin Abdel-Khalek (co-advised with Robert Heath): Apple
Harish Ganapathy: Google Brain
Aditya Gopalan (with Sanjay Shakkottai): Associate Professor, Indian Institute of Science
Melissa Hall: Facebook
Jessica Hoffmann: Google
Kiyeon Jeon: IBM
Ken’ichi Kamada: Visiting Scientist from Yokogawa Co.
Ashish Katiyar : Amazon
Jeongyeol Kwon (ECE) : Post doc at Madison
Anastasios Kyrillidis (w/ Sujay Sanghavi & Alex Dimakis): Assistant Professor, Rice CS
Tianyang Li: Two Sigma
Liu Liu: Tencent
Ioannis Mitliagkas (co-advised with Sriram Vishwanath): Associate Professor, U. Montreal
Joe Neeman (co-advised with Sujay Sanghavi): Assistant Professor, UT Austin, Math
Dohyung Park (co-advised with Sujay Sanghavi): Facebook
Orestis Papadigenopoulos (CS)
Srilakshmi Pattabiraman: UIUC Ph.D.
Sanika Phanse: MongoDB
Zrinka Puljiz (co-advised with Sanjay Shakkottai): Google
Ashish Singh (with Michael Orshansky): Terra Technology
Huan Xu (with David Morton): Assistant Professor, Georgia Tech, ISyE Dept. –> Alibaba
Wang Ye (co-advised with Michael Orshansky): Cadence
Qiaoyang Ye (co-advised with Jeff Andrews): Samsung Research America
Xinyang Yi: Google Brain
Sungho Yun: Apple.
Jiacheng Zhuo: Algorithm Developer at Hudson River Trading.

The last ten publications, chronologically…

Rout, Litu, Constantine Caramanis, and Sanjay Shakkottai. “Anchored Diffusion Language Model.” To Appear in Advances in Neural Information Processing Systems (NeurIPS), 2025.

@article{rout2025anchored,
  title = {Anchored Diffusion Language Model},
  author = {Rout, Litu and Caramanis, Constantine and Shakkottai, Sanjay},
  journal = {To appear in Advances in Neural Information Processing Systems (NeurIPS)},
  year = {2025},
  group = {proceedings},
  arxiv = {https://arxiv.org/pdf/2505.18456}
}

Diffusion Language Models (DLMs) promise parallel generation and bidirectional context, yet they underperform autoregressive (AR) models in both likelihood modeling and generated text quality. We identify that this performance gap arises when important tokens (e.g., key words or low-frequency words that anchor a sentence) are masked early in the forward process, limiting contextual information for accurate reconstruction. To address this, we introduce the Anchored Diffusion Language Model (ADLM), a novel two-stage framework that first predicts distributions over important tokens via an anchor network, and then predicts the likelihoods of missing tokens conditioned on the anchored predictions. ADLM significantly improves test perplexity on LM1B and OpenWebText, achieving up to 25.4% gains over prior DLMs, and narrows the gap with strong AR baselines. It also achieves state-of-the-art performance in zero-shot generalization across seven benchmarks and surpasses AR models in MAUVE score, which marks the first time a DLM generates better human-like text than an AR model. Theoretically, we derive an Anchored Negative Evidence Lower Bound (ANELBO) objective and show that anchoring improves sample complexity and likelihood modeling. Beyond diffusion, anchoring boosts performance in AR models and enhances reasoning in math and logic tasks, outperforming existing chain-of-thought approaches.

Raoof, Negin, Litu Rout, Giannis Daras, Sujay Sanghavi, Constantine Caramanis, Sanjay Shakkottai, and Alex Dimakis. “Infilling Score: A Pretraining Data Detection Algorithm for Large Language Models.” Proceedings of the International Conference on Learning Representations (ICLR), 2025.

@article{raoofinfilling,
  title = {Infilling Score: A Pretraining Data Detection Algorithm for Large Language Models},
  author = {Raoof, Negin and Rout, Litu and Daras, Giannis and Sanghavi, Sujay and Caramanis, Constantine and Shakkottai, Sanjay and Dimakis, Alex},
  journal = {Proceedings of the International Conference on Learning Representations (ICLR)},
  year = {2025},
  group = {proceedings},
  arxiv = {https://openreview.net/pdf?id=9QPH1YQCMn}
}

In pretraining data detection, the goal is to detect whether a given sentence is in the dataset used for training a Large Language Model LLM). Recent methods (such as Min-K % and Min-K%++) reveal that most training corpora are likely contaminated with both sensitive content and evaluation benchmarks, leading to inflated test set performance. These methods sometimes fail to detect samples from the pretraining data, primarily because they depend on statistics composed of causal token likelihoods. We introduce Infilling Score, a new test-statistic based on non-causal token likelihoods. Infilling Score can be computed for autoregressive models without re-training using Bayes rule. A naive application of Bayes rule scales linearly with the vocabulary size. However, we propose a ratio test-statistic whose computation is invariant to vocabulary size. Empirically, our method achieves a significant accuracy gain over state-of-the-art methods including Min-K%, and Min-K%++ on the WikiMIA benchmark across seven models with different parameter sizes. Further, we achieve higher AUC compared to reference-free methods on the challenging MIMIR benchmark. Finally, we create a benchmark dataset consisting of recent data sources published after the release of Llama-3; this benchmark provides a statistical baseline to indicate potential corpora used for Llama-3 training.

Rout, Litu, Yujia Chen, Nataniel Ruiz, Abhishek Kumar, Constantine Caramanis, Sanjay Shakkottai, and Wen-Sheng Chu. “RB-Modulation: Training-Free Personalization of Diffusion Models Using Stochastic Optimal Control.” Proceedings of the International Conference on Learning Representations (ICLR), 2025.

@article{rout2024rb,
  title = {RB-Modulation: Training-Free Personalization of Diffusion Models using Stochastic Optimal Control},
  author = {Rout, Litu and Chen, Yujia and Ruiz, Nataniel and Kumar, Abhishek and Caramanis, Constantine and Shakkottai, Sanjay and Chu, Wen-Sheng},
  journal = {Proceedings of the International Conference on Learning Representations (ICLR)},
  year = {2025},
  group = {proceedings},
  arxiv = {https://arxiv.org/pdf/2405.17401}
}

We propose Reference-Based Modulation (RB-Modulation), a new plug-and-play solution for training-free personalization of diffusion models. Existing training-free approaches exhibit difficulties in (a) style extraction from reference images in the absence of additional style or content text descriptions, (b) unwanted content leakage from reference style images, and (c) effective composition of style and content. 
RB-Modulation is built on a novel stochastic optimal controller where a style descriptor encodes the desired attributes through a terminal cost. The resulting drift not only overcomes the difficulties above, but also ensures high fidelity to the reference style and adheres to the given text prompt. We also introduce a cross-attention-based feature aggregation scheme that allows RB-Modulation to decouple content and style from the reference image. With theoretical justification and empirical evidence, our test-time optimization framework demonstrates precise extraction and control of content and style in a training-free manner. Further, our method allows a seamless composition of content and style, which marks a departure from the dependency on external adapters or ControlNets. See project page (https://rb-modulation.github.io/) for code and further details.

Rout, Litu, Yujia Chen, Nataniel Ruiz, Constantine Caramanis, Sanjay Shakkottai, and Wen-Sheng Chu. “Semantic Image Inversion and Editing Using Rectified Stochastic Differential Equations.” Proceedings of the International Conference on Learning Representations (ICLR), 2025.

@article{rout2024semantic,
  title = {Semantic image inversion and editing using rectified stochastic differential equations},
  author = {Rout, Litu and Chen, Yujia and Ruiz, Nataniel and Caramanis, Constantine and Shakkottai, Sanjay and Chu, Wen-Sheng},
  journal = {Proceedings of the International Conference on Learning Representations (ICLR)},
  arxiv = {https://arxiv.org/pdf/2410.10792},
  year = {2025}
}

Generative models transform random noise into images, while their inversion aims to reconstruct structured noise for recovery and editing. This paper addresses two key tasks: (i) inversion and (ii) editing of real images using stochastic equivalents of rectified flow models (e.g., Flux). While Diffusion Models (DMs) dominate the field of generative modeling for images, their inversion suffers from faithfulness and editability challenges due to nonlinear drift and diffusion. Existing DM inversion methods require costly training of additional parameters or test-time optimization of latent variables. Rectified Flows (RFs) offer a promising alternative to DMs, yet their inversion remains underexplored. 
We propose RF inversion using dynamic optimal control derived via a linear quadratic regulator, and prove that the resulting vector field is equivalent to a rectified stochastic differential equation. 
We further extend our framework to design a stochastic sampler for Flux.
Our method achieves state-of-the-art performance in zero-shot inversion and editing, surpassing prior works in stroke-to-image synthesis and semantic image editing, with large-scale human evaluations confirming user preference. See our project page https://rf-inversion.github.io/ for code and demo.

Rout, Litu, Yujia Chen, Abhishek Kumar, Constantine Caramanis, Sanjay Shakkottai, and Wen-Sheng Chu. “Beyond First-Order Tweedie: Solving Inverse Problems Using Latent Diffusion.” Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR), 2024.

@article{rout2024secondorder,
  title = {Beyond First-Order Tweedie: Solving Inverse Problems using Latent Diffusion},
  author = {Rout, Litu and Chen, Yujia and Kumar, Abhishek and Caramanis, Constantine and Shakkottai, Sanjay and Chu, Wen-Sheng},
  journal = {Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR)},
  year = {2024},
  group = {proceedings},
  arxiv = {https://arxiv.org/pdf/2312.00852.pdf}
}

Sampling from the posterior distribution poses a major computational challenge in solving inverse problems using latent diffusion models. Common methods rely on Tweedie’s first-order moments, which are known to induce a quality-limiting bias. Existing second-order approximations are impractical due to prohibitive computational costs, making standard reverse diffusion processes intractable for posterior sampling. This paper introduces Second-order Tweedie sampler from Surrogate Loss (STSL), a novel sampler that offers efficiency comparable to first-order Tweedie with a tractable reverse process using second-order approximation. Our theoretical results reveal that the second-order approximation is lower bounded by our surrogate loss that only requires O(1) compute using the trace of the Hessian, and by the lower bound we derive a new drift term to make the reverse process tractable. Our method surpasses SoTA solvers PSLD and P2L, achieving 4X and 8X reduction in neural function evaluations, respectively, while notably enhancing sampling quality on FFHQ, ImageNet, and COCO benchmarks. In addition, we show STSL extends to text-guided image editing and addresses residual distortions present from corrupted images in leading text-guided image editing methods. To our best knowledge, this is the first work to offer an efficient second-order approximation in solving inverse problems using latent diffusion and editing real-world images with corruptions.

Zhuo, Jiacheng, Jeongyeol Kwon, Nhat Ho, and Constantine Caramanis. “On the Computational and Statistical Complexity of over-Parameterized Matrix Sensing.” Journal of Machine Learning Research, 2024.

@article{zhuoSlowMatrix2024,
  title = {On the computational and statistical complexity of over-parameterized matrix sensing},
  author = {Zhuo, Jiacheng and Kwon, Jeongyeol and Ho, Nhat and Caramanis, Constantine},
  journal = {Journal of Machine Learning Research},
  year = {2024},
  group = {journal},
  arxiv = {https://arxiv.org/pdf/2102.02756.pdf}
}

We consider solving the low rank matrix sensing problem with Factorized Gradient Descend (FGD) method when the true rank is unknown and over-specified, which we refer to as over-parameterized matrix sensing.
If the ground truth signal X^* ∈\mathbbR^d*d is of rank r, but we try to recover it using FF^⊤where F ∈\mathbbR^d*k and k>r, the existing statistical analysis falls short, due to a flat local curvature of the loss function around the global maxima. By decomposing the factorized matrix \fitMat into separate column spaces to capture the effect of extra ranks, we show that ||F_t F_t - F||_F^2 converges to a statistical error of \tilde\mathcalO \parenthk d σ^2/n after \tilde\mathcalO(\frac\sigma_rσ\sqrt\fracnd) number of iterations where \fitMat_t is the output of FGD after t iterations, σ^2 is the variance of the observation noise, \sigma_r is the r-th largest eigenvalue of \trueMat, and n is the number of sample. Our results, therefore, offer a comprehensive picture of the statistical and computational complexity of FGD for the over-parameterized matrix sensing problem.

Atsidakou, Alexia, Constantine Caramanis, Evangelia Gergatsouli, Orestis Papadigenopoulos, and Christos Tzamos. “Contextual Pandora’s Box.” Association for the Advancement of Artificial Intelligence (AAAI), 2024.

@article{atsidakou24apandora,
  title = {Contextual Pandora's Box},
  author = {Atsidakou, Alexia and Caramanis, Constantine and Gergatsouli, Evangelia and Papadigenopoulos, Orestis and Tzamos, Christos},
  journal = {Association for the Advancement of Artificial Intelligence (AAAI)},
  year = {2024},
  group = {proceedings},
  arxiv = {https://arxiv.org/pdf/2205.13114.pdf}
}

Pandora’s Box is a fundamental stochastic optimization problem, where the decision-maker must find a good alternative while minimizing the search cost of exploring the value of each alternative. In the original formulation, it is assumed that accurate priors are given for the values of all the alternatives, while
recent work studies the online variant of Pandora’s Box where priors are originally unknown. 
In this work, we extend Pandora’s Box to the online setting, while incorporating context. 
At every round, we are presented with a number of alternatives each having a context, an exploration cost and an unknown value drawn from an unknown prior distribution that may change at every round. Our main result is a no-regret algorithm that performs comparably well to the optimal algorithm which knows all prior distributions exactly. Our algorithm works even in the bandit setting where the algorithm never learns the values of the alternatives that were not explored. 
The key technique that enables our result is novel a modification of the realizability condition in contextual bandits that connects a context to the reservation value of the corresponding distribution rather than its mean.

Kwon, Jeongyeol, Yonathan Efroni, Shie Mannor, and Constantine Caramanis. “RL in Latent MDPs Is Tractable: Online Guarantees via Off-Policy Evaluation.” Advances in Neural Information Processing Systems (NeurIPS), 2024.

@article{kwon24lmdp,
  title = {RL in Latent MDPs is Tractable: Online Guarantees via Off-Policy Evaluation},
  author = {Kwon, Jeongyeol and Efroni, Yonathan and Mannor, Shie and Caramanis, Constantine},
  journal = {Advances in Neural Information Processing Systems (NeurIPS)},
  year = {2024},
  group = {proceedings},
  arxiv = {https://arxiv.org/pdf/2406.01389}
}

In many real-world decision problems there is partially observed, hidden or latent information that remains fixed throughout an interaction. Such decision problems can be modeled as Latent Markov Decision Processes (LMDPs), where a latent variable is selected at the beginning of an interaction and is not disclosed to the agent. In the last decade, there has been significant progress in solving LMDPs under different structural assumptions. However, for general LMDPs, there is no known learning algorithm that provably matches the existing lower bound (Kwon et al., 2021). We introduce the first sample-efficient algorithm for LMDPs without any additional structural assumptions. Our result builds off a new perspective on the role of off-policy evaluation guarantees and coverage coefficients in LMDPs, a perspective, that has been overlooked in the context of exploration in partially observed environments. Specifically, we establish a novel off-policy evaluation lemma and introduce a new coverage coefficient for LMDPs. Then, we show how these can be used to derive near-optimal guarantees of an optimistic exploration algorithm. These results, we believe, can be valuable for a wide range of interactive learning problems beyond LMDPs, and especially, for partially observed environments.

Tsikouras, Nikos, Constantine Caramanis, and Christos Tzamos. “Optimization Can Learn Johnson Lindenstrauss Embeddings.” Advances in Neural Information Processing Systems (NeurIPS), 2024.

@article{tsikouras2024optimization,
  title = {Optimization Can Learn Johnson Lindenstrauss Embeddings},
  author = {Tsikouras, Nikos and Caramanis, Constantine and Tzamos, Christos},
  journal = {Advances in Neural Information Processing Systems (NeurIPS)},
  group = {proceedings},
  arxiv = {arXiv preprint arXiv:2412.07242},
  year = {2024}
}

Embeddings play a pivotal role across various disciplines, offering compact representations of complex data structures. Randomized methods like Johnson-Lindenstrauss (JL) provide state-of-the-art and essentially unimprovable theoretical guarantees for achieving such representations. These guarantees are worst-case and in particular, neither the analysis, nor the algorithm, takes into account any potential structural information of the data. The natural question is: must we randomize? Could we instead use an optimization-based approach, working directly with the data? A first answer is no: as we show, the distance-preserving objective of JL has a non-convex landscape over the space of projection matrices, with many bad stationary points. But this is not the final answer.
We present a novel method motivated by diffusion models, that circumvents this fundamental challenge: rather than performing optimization directly over the space of projection matrices, we use optimization over the larger space of random solution samplers, gradually reducing the variance of the sampler. We show that by moving through this larger space, our objective converges to a deterministic (zero variance) solution, avoiding bad stationary points.
This method can also be seen as an optimization-based derandomization approach and is an idea and method that we believe can be applied to many other problems.

Kwon, Jeongyeol, Yonathan Efroni, Shie Mannor, and Constantine Caramanis. “Prospective Side Information for Latent MDPs.” In International Conference on Machine Learning (ICML). PMLR, 2024.

@inproceedings{kwon2024prospective,
  title = {Prospective side information for latent {MDP}s},
  author = {Kwon, Jeongyeol and Efroni, Yonathan and Mannor, Shie and Caramanis, Constantine},
  booktitle = {International Conference on Machine Learning (ICML)},
  pages = {},
  year = {2024},
  organization = {PMLR},
  group = {proceedings},
  arxiv = {https://arxiv.org/pdf/2310.07596}
}

In many interactive decision-making settings, there is latent and unobserved information that remains fixed. Consider, for example, a dialogue system, where complete information about a user, such as the user’s preferences, is not given. In such an environment, the latent information remains fixed throughout each episode, since the identity of the user does not change during an interaction. This type of environment can be modeled as a Latent Markov Decision Process (LMDP), a special instance of Partially Observed Markov Decision Processes (POMDPs). Previous work established exponential lower bounds in the number of latent contexts for the LMDP class. This puts forward a question: under which natural assumptions a near-optimal policy of an LMDP can be efficiently learned? In this work, we study the class of LMDPs with prospective side information, when an agent receives additional, weakly revealing, information on the latent context at the beginning of each episode. We show that, surprisingly, this problem is not captured by contemporary settings and algorithms designed for partially observed environments. We then establish that any sample efficient algorithm must suffer at least Ω(K^2/3)-regret, as opposed to standard Ω(\sqrtK) lower bounds, and design an algorithm with a matching upper bound.