•  66
    Privacy-Preserving Sequential Decision Systems for Regulated Personalization
    with Narender Bitla, Naga Surya Pasupuleti, and Sumit Saha
    Regulated Personalization Systems Forum. 2026.
    Regulated personalization systems must decide what information, offer, limit, explanation, or next action to present while preserving privacy and controlling downstream harm. Conventional sequential recommenders optimize engagement over behavior traces, but regulated settings also require privacy budgets, evidence constraints, risk limits, and auditable tool use. This paper proposes Privacy-Preserving Sequential Decision Systems (PPSDS), a synthetic architecture for regulated personalization tha…Read more
  •  89
    Wide-area grid monitoring increasingly depends on high-rate phasor measurement unit (PMU) streams, cloud-native telemetry fabrics, predictive calibration models, and operator-facing decision support. These capabilities are usually deployed as separate systems: PMU analytics detect events, maintenance models estimate instrument risk, forecasting models project future load or stability margins, and schedulers route workloads across edge and cloud resources. This paper proposes an Edge-to-Cloud PMU…Read more
  •  89
    Governed Knowledge Base Integrity Workshop
    with Narender Bitla, Murali Shankar Dulam, and Naga Surya Pasupuleti
    Governed Knowledge Base Integrity Workshop. 2026.
    Enterprise knowledge bases encode more than facts. They also encode policy preferences, operational assumptions, persuasive phrases, reviewer norms, and routing defaults that shape downstream retrieval-augmented generation and recommendation behavior. This paper proposes Ideological Drift Detection for Governed Knowledge Bases (IDD-GKB), a synthetic architecture for detecting when recurring policy narratives, compliance phrases, documentation claims, and retrieved evidence gradually change meani…Read more
  •  69
    Enterprise compliance documentation is often written after systems have already changed, which leaves policy narratives, architecture diagrams, runbooks, and audit evidence out of sync with executable behavior. This paper proposes Retrieval-Grounded Documentation Agents (RGDA), a synthetic architecture in which bounded documentation agents maintain compliance text by combining abstract-syntax-tree-aware reflexion, hybrid semantic-relational evidence packs, distributed retrieval-augmented generat…Read more
  •  79
    Wide-area power-grid observability depends on phasor measurement unit (PMU) streams, calibration-aware instrumentation, and fast operator workflows, but the analytic stack that joins these elements is often split between stream processors, model-serving systems, privacy filters, and manual maintenance queues. This paper proposes Self-Governing Grid Intelligence (SGI), a synthetic cyber-physical stream-processing architecture in which bounded agents coordinate PMU telemetry, calibration-risk tran…Read more
  •  91
    GPU-Accelerated Parallel Stochastic Gradient Descent
    with Prithvi Krishna Gattamaneni
    Gpus: Architecture and Programming. 2016.
    This paper studies a simplified instance of the design question raised by GPU A-SGD: how should stochastic optimization be organized so that device-level parallelism improves throughput without destabilizing convergence? Motivated by the asynchronous multi-GPU training framework of Paine et al., we examine a controlled linear-regression setting in which workers process disjoint data shards, compute stochastic gradient updates in parallel, and combine local models through either plain averaging o…Read more
  •  68
    This paper studies automatic attribute extraction and sentiment assignment from product reviews. The task is to recover product aspects from free-form customer reviews and label them as positive, negative, or irrelevant. The modeling progression runs from two lightweight heuristics to two discriminative sequence models: a noun-driven bag-of-words baseline, a nearest-adjective sentiment heuristic, a Maximum Entropy Markov Model (MEMM), and a Conditional Random Field (CRF). The empirical pattern i…Read more
  •  68
    Regulated analytics teams increasingly run data pipelines, retrieval systems, and machine learning workloads across multiple cloud providers, but the operational control plane remains split across pipeline schedulers, API gateways, privacy tooling, and audit systems. This paper proposes Policy-Verified Agentic DataOps (PVA-DataOps), a synthetic reference architecture in which bounded agents propose operational actions while a policy verifier, evidence layer, and contract-mediated gateway decide …Read more
  •  61
    Forecasting when reviewers will agree or disagree is useful for peer-review management because high-disagreement submissions often require additional discussion, stronger area-chair intervention, or more careful reviewer assignment. Prior computational work has mostly focused on predicting final decisions or review scores from papers and completed reviews, while a separate legal machine learning line introduced citation-propagated phrase scoring to model agreement through "memes" that spread ove…Read more
  •  87
    Predicting disagreement on appellate panels is a useful problem for empirical legal studies because dissent is rare, institutionally important, and often tied to persistent differences in judicial style and precedent use. Prior English-language work has shown that structured case features, seating patterns, and citation-derived phrase statistics can help predict vote alignment in the U.S. Courts of Appeals, while recent legal NLP work has introduced domain-specific language models and stronger l…Read more
  •  74
    Semi-supervised image classification seeks to improve predictive accuracy by combining a small labeled set with a larger unlabeled pool. Currently, deep semi-supervised learning has produced strong benchmark results through entropy-based objectives, pseudo-labeling, self-ensembling, and perturbation consistency, yet a practical tension remains: methods that use every unlabeled example can absorb harmful targets, while simple pseudo-labeling can be unstable when model confidence is miscalibrated.…Read more
  •  56
    Machine Learning for Long-Term Forecasting from Historical Data
    with Prithvi Krishna Gattamaneni and Nachiappan Chockalingam
    Foundations of Machine Learning. 2017.
    This paper presents a two-stage forecasting framework for future carbon dioxide (CO2) emissions using historical country-level indicators. The method first extrapolates the future trajectories of the input indicators and then predicts future CO2 emissions with nonlinear regressors trained on the most recent segment of the time series. Two design choices define the approach: recency-weighted training, which increases the influence of modern observations, and a boundary-consistency rule that selec…Read more