PhD Candidate · CS · Kansas State University

Talha Zaidi

I am a Ph.D. Candidate in Computer Science at Kansas State University, under the supervision of Prof. Arslan Munir, working as a Graduate Research Assistant at the Intelligent Systems, Computer Architecture, Analytics and Security Laboratory ( ISCAAS Lab). Earlier, I completed my M.S. in Biomedical Engineering at Istanbul Medipol University, Turkey, and my B.S. in Mechatronics and Control Engineering at UET Lahore.

Research Interests My primary research focuses on reinforcement learning and generative modeling, building agents that learn reliable, long-horizon behavior from imperfect data through latent skill learning, planning under partial observability, and diffusion-based policy refinement. I am also interested in connecting these ideas with vision-language models and LLM-guided planning for embodied agents, alongside applications in Robotics, Health, and Cyber-Physical Systems — reflecting a broader commitment to principled AI with real-world impact.

10+Publications
3.9GPA
3Grant Agencies
2026PhD Expected
Talha Zaidi
01

News

Recent Updates

Selected updates
  • April 2026: GRAZE is accepted at CVsports @ CVPR 2026 on grounded and motion-aware video understanding.
  • Mar 2026: Selected as Graduate Student of the Month at Kansas State University for research and scholarly contributions.
  • Mar 2026: Received a Best Presentation Award at APEC 2026 for work on resilient coordination and security of grid-forming inverter networks.
  • Feb 2026: CRISP is submitted to the IROS 2026 Main Track on long-horizon offline planning and context-robust skill inference.
  • Jan 2026: GRALP is submitted to the IJCAI 2026 AI and Robotics Track.
  • 2025–2026: The research portfolio is expanding toward LLMs, vision-language and embodied agents, robotics, cyber-physical systems, and health AI.
02

Multidisciplinary Research

The center remains fixed on Generative AI + Reinforcement Learning, while the surrounding domains show where these core methods are applied and extended. The goal is not to present unrelated topics, but to show a connected research program spanning robotics, cyber-physical systems, LLM and VLM agents, health AI, autonomous systems, and resilient infrastructure through shared ideas in planning, representation learning, robustness, and decision-making.

🛰 NASA
Spacecraft Autonomy
AI Trajectory Optimization
⚡ Dept. of Energy
Smart Grid Security
Resilient Infrastructure
🔬 TUBITAK
Brain-Machine Interface
Neuroprosthetics Control
03

Research & Publications

GRAZE
CVsports @ CVPR 2026 Accepted
GRAZE: Grounded Refinement and Motion-Aware Zero-Shot Event Recognition

Zero-shot sports event recognition framework combining grounded visual refinement with motion-aware features for fine-grained video understanding.

GRALP
IJCAI 2026- AI and Robotics Track Submitted
GRALP: Generative Representation for Action Refinement and Latent Planning in Offline RL

Latent-skill offline RL framework for contact-rich robotic manipulation. Improves long-horizon planning while preserving behavior support and low-level execution quality across D4RL, Adroit, and RoboSuite benchmarks.

CRISP
IROS 2026- Main Track Submitted
CRISP: Context-Robust Inpainting for Long-Horizon Skill Planning under Partial Observability

Masked latent-skill inference for robust long-horizon planning under missing or degraded context. Focuses on partial observability in offline reinforcement learning and context-robust skill sequencing.

SA3C low-thrust trajectory optimization
IEEE Transaction on Aerospace and Electronic Systems 2025 🛰 NASA Funded
Autonomous Planning of Low-Thrust Geocentric and Cislunar Spacecraft Trajectories Using Reinforcement Learning

Developed the SA3C algorithm with an attention mechanism to improve sample efficiency and decision quality for low-thrust spacecraft trajectory optimization in geocentric and cislunar missions.

Automated trajectory planning with cascaded DRL
IEEE Magazine on Aerospace and Electronic Systems 2025
Automated Trajectory Planning: A Cascaded Deep Reinforcement Learning Approach for Low-Thrust Spacecraft Orbit-Raising

Developed a novel Cascaded Deep Reinforcement Learning (CDRL) approach to optimize low-thrust spacecraft trajectory planning, significantly improving time-efficient orbit transfers in complex multi-body environments for transfers to GEO and NRHO.

Smart grid resilience
APEC 2026 Conference Paper
An Integrated AI-Based Approach for Virtual-Impedance Scheduling and Cyber-Attack Mitigation in Smart-Grid Environments

Introduced a resilient neural coordination framework for grid-forming inverter networks that maintains stability and coordination under cyberattacks in smart-grid environments.

Low-thrust orbit-raising
AIAA SCITECH 2024
Machine Learning Assisted Low-Thrust Orbit-Raising: A Comparative Assessment of Sequential Algorithm and Deep RL

Developed a machine-learning-assisted method for optimizing low-thrust orbit-raising trajectories, integrating a sequential algorithm with a neural network-based high-level planner and benchmarking it against deep reinforcement learning approaches for geostationary and halo-orbit missions.

Cascaded deep RL low-thrust orbit transfer
IEEE Access 2023
Cascaded Deep Reinforcement Learning-Based Multi-Revolution Low-Thrust Spacecraft Orbit-Transfer

Developed a cascaded DRL model for optimizing long-duration, low-thrust spacecraft transfers from GTO to GEO. Guided by a gradient-aided reward function, the method significantly reduces transfer time and improves spacecraft autonomy in complex multi-revolution transfers.

Knowledge Distillation
Springer Nature 2025 Submitted
Improving Adversarial Robustness Through Adaptive Learning-Driven Multi-Teacher Knowledge Distillation

Adaptive multi-teacher knowledge distillation framework aimed at improving adversarial robustness beyond standard single-teacher or conventional training approaches.

Mode-guided feature augmentation for domain generalization Mode-guided feature augmentation for domain generalization
BMVC 2021
Mode-Guided Feature Augmentation for Domain Generalization

Proposed a simple and efficient domain generalization approach that augments source domains by exploring dominant modes of variation in the feature space, improving generalization to unseen domains across standard DG benchmarks.

Earthquake LANL
IEEE SIU 2020
Learned vs. Hand-Crafted Features for Deep Learning Based Aperiodic Laboratory Earthquake Time-Prediction

Developed and compared machine learning models for laboratory earthquake prediction using LANL data, where CNN-LSTM models improved time-to-failure prediction over hand-crafted approaches.

Neuroprosthetics BMI Neuroprosthetics setup
Journal of Neuroscience Methods 🔬 TUBITAK Funded
A Behavioral Paradigm for Cortical Control of a Robotic Actuator by Freely Moving Rats in a One-Dimensional Two-Target Reaching Task

Demonstrated trajectory-based neuroprosthetic control in rodents using primary motor cortex activity, providing a cost-effective platform for studying brain-machine interfaces and neural control.

04

Experience

Graduate Research AssistantISCAAS Lab · Kansas State UniversityAug 2021 — Present
  • Developed GRALP — a latent-skill offline RL framework for contact-rich robotic manipulation; ~8% higher avg. performance on D4RL, Adroit, RoboSuite. Related work includes submissions to IJCAI 2026 and IROS 2026, alongside accepted work at CVsports @ CVPR 2026.
  • Led NASA-funded SA3C project: attention-based RL agent for low-thrust spacecraft trajectory optimization, reducing transfer time by 10% over strong baselines. Published in IEEE Transactions on Aerospace and Electronic Systems (2025).
  • Engineered AI-driven security controller for smart grids (DOE-funded): high attack-detection rates, low false positives on inverter-level anomalies, faster post-disturbance recovery in MATLAB/Simulink + PyTorch simulations.
  • Authoring research proposals to secure federal and industry funding for robust RL and autonomous systems research.
🛰 NASA⚡ DOE
Graduate Research AssistantNeuroprosthetics Group · Istanbul Medipol UniversityOct 2018 — Dec 2020
  • Engineered a cortically-driven robotic arm control system using primary motor cortex signals from freely moving rats. First trajectory-based neuroprosthetic control in rodents, >78% accuracy. Published in Journal of Neuroscience Methods.
  • Built CNN-LSTM models for aperiodic earthquake time-prediction using LANL data, significantly outperforming traditional signal-processing baselines. Published at IEEE SIU 2020.
🔬 TUBITAK
Field EngineerTetraPakApr 2016 — Jun 2018
  • Technical support and troubleshooting for complex automated industrial systems, building deep familiarity with real-world control and automation constraints.
05

Skills

Reinforcement Learning
Offline RLOnline RLActor-CriticConservative Q-LearningLatent Skill LearningPolicy RefinementRLHFMulti-Agent RL
Large Language Models
Fine-Tuning (LoRA / QLoRA)RAG PipelinesPrompt EngineeringInstruction TuningLLM-as-PlannerTool-Augmented LLMsAlignment & SafetyRLHF for LLMs
Generative & Representation
Diffusion ModelsLatent DiffusionVAEsTransformersSequence ModelingDomain GeneralizationMasked Modeling
Robotics & Autonomy
MuJoCoRoboSuiteD4RL / AdroitTrajectory OptimizationModel Predictive ControlValue-Guided PlanningBrain-Machine Interface
Programming & Tools
PyTorchPythonC++LinuxCUDA / GPU TrainingMATLAB / SimulinkHuggingFaceWeights & BiasesOpenAI GymDocker
06

Awards & Service

🏆
Best Presentation Award — APEC 2026IEEE Applied Power Electronics Conference & Exposition, 2026
Graduate Student of the Month — March 2026Kansas State University, Dept. of Computer Science
🎓
Funded PhD Scholarship & Research AssistantshipKansas State University, 2021 — Present
🎓
Funded Master's Scholarship & Research AssistantshipIstanbul Medipol University, 2018 — 2020
Invited Peer Reviewer
IEEE Trans. Cloud ComputingIEEE AccessIEEE Jouranl. STARSWiley Engineering Reports
Leadership & Service
President · KSU CS Graduate Students Assoc. (2023–24)Graduate Mentor · Prospective PhD StudentsPresident · Mechatronics Club UET Lahore (2013–14)