AI Researcher · PhD Candidate · CS · Kansas State University

Talha Zaidi

I am an AI researcher and Ph.D. Candidate in Computer Science at Kansas State University, advised by Prof. Arslan Munir in the ISCAAS Lab. I am expected to graduate in August 2026. Earlier, I completed my M.S. in Biomedical Engineering at Istanbul Medipol University, Turkey, and my B.S. in Mechatronics and Control Engineering at UET Lahore.

Research Interests My research develops reinforcement learning, generative modeling, and foundation-model adaptation methods for reliable decision-making in complex real-world systems. I focus on sequential decision modeling, policy refinement, and latent representation learning for offline/online RL, long-horizon planning, and reasoning under uncertainty, with applications in embodied AI, robotics, autonomous systems, and cyber-physical intelligence.

10+Publications
3.9GPA
3Grant Agencies
2026Aug PhD Graduate
Talha Zaidi
01

News

Recent Updates

Selected updates
  • June 2026: Selected to attend the 2026 IEEE Summer School on Telerobotics and Cyborg Technologies at Rochester Institute of Technology (RIT).
  • May 2026: Our recent RL robotics paper GRALP is accepted at 35th International Joint Conference on Artificial Intelligence (IJCAI) 2026.
  • April 2026: GRAZE is accepted at CVsports @ CVPR 2026 on grounded and motion-aware video understanding.
  • Mar 2026: Selected as Graduate Student of the Month at Kansas State University for research and scholarly contributions.
  • Mar 2026: Received a Best Presentation Award at APEC 2026 for work on resilient coordination and security of grid-forming inverter networks.
  • Feb 2026: CRISP is submitted to the IROS 2026 Main Track on long-horizon offline planning and context-robust skill inference.
  • 2025–2026: The research portfolio is expanding towards robotics, vision-language-action models, and cyber-physical systems. .
02

Multidisciplinary Research

The center remains fixed on Generative AI + Reinforcement Learning, while the surrounding domains show where these core methods are applied and extended. The goal is not to present unrelated topics, but to show a connected research program spanning robotics, cyber-physical systems, LLM and VLM agents, health AI, autonomous systems, and resilient infrastructure through shared ideas in planning, representation learning, robustness, and decision-making.

🛰 NASA
Spacecraft Autonomy
AI Trajectory Optimization
⚡ Dept. of Energy
Smart Grid Security
Resilient Infrastructure
🔬 TUBITAK
Brain-Machine Interface
Neuroprosthetics Control
03

Research & Publications

GRALP
International Joint Conference on Artificial Intelligence (IJCAI) 2026- AI and Robotics Track Accepted
GRALP: Generative Representation for Action Refinement and Latent Planning in Offline RL

Latent-skill offline RL framework for contact-rich robotic manipulation. Improves long-horizon planning while preserving behavior support and low-level execution quality across D4RL, Adroit, and RoboSuite benchmarks.

GRAZE
CVsports @ CVPR 2026 Accepted
GRAZE: Grounded Refinement and Motion-Aware Zero-Shot Event Recognition

Zero-shot sports event recognition framework combining grounded visual refinement with motion-aware features for fine-grained video understanding.

CRISP
IROS 2026- Main Track Submitted
CRISP: Context-Robust Inpainting for Long-Horizon Skill Planning under Partial Observability

Masked latent-skill inference for robust long-horizon planning under missing or degraded context. Focuses on partial observability in offline reinforcement learning and context-robust skill sequencing.

SA3C low-thrust trajectory optimization
IEEE Transaction on Aerospace and Electronic Systems 2025 🛰 NASA Funded
Autonomous Planning of Low-Thrust Geocentric and Cislunar Spacecraft Trajectories Using Reinforcement Learning

Developed the SA3C algorithm with an attention mechanism to improve sample efficiency and decision quality for low-thrust spacecraft trajectory optimization in geocentric and cislunar missions.

Automated trajectory planning with cascaded DRL
IEEE Magazine on Aerospace and Electronic Systems 2025
Automated Trajectory Planning: A Cascaded Deep Reinforcement Learning Approach for Low-Thrust Spacecraft Orbit-Raising

Developed a novel Cascaded Deep Reinforcement Learning (CDRL) approach to optimize low-thrust spacecraft trajectory planning, significantly improving time-efficient orbit transfers in complex multi-body environments for transfers to GEO and NRHO.

Smart grid resilience
APEC 2026 Conference Paper
An Integrated AI-Based Approach for Virtual-Impedance Scheduling and Cyber-Attack Mitigation in Smart-Grid Environments

Introduced a resilient neural coordination framework for grid-forming inverter networks that maintains stability and coordination under cyberattacks in smart-grid environments.

Low-thrust orbit-raising
AIAA SCITECH 2024
Machine Learning Assisted Low-Thrust Orbit-Raising: A Comparative Assessment of Sequential Algorithm and Deep RL

Developed a machine-learning-assisted method for optimizing low-thrust orbit-raising trajectories, integrating a sequential algorithm with a neural network-based high-level planner and benchmarking it against deep reinforcement learning approaches for geostationary and halo-orbit missions.

Cascaded deep RL low-thrust orbit transfer
IEEE Access 2023
Cascaded Deep Reinforcement Learning-Based Multi-Revolution Low-Thrust Spacecraft Orbit-Transfer

Developed a cascaded DRL model for optimizing long-duration, low-thrust spacecraft transfers from GTO to GEO. Guided by a gradient-aided reward function, the method significantly reduces transfer time and improves spacecraft autonomy in complex multi-revolution transfers.

Knowledge Distillation
Springer Nature 2025 Submitted
Improving Adversarial Robustness Through Adaptive Learning-Driven Multi-Teacher Knowledge Distillation

Adaptive multi-teacher knowledge distillation framework aimed at improving adversarial robustness beyond standard single-teacher or conventional training approaches.

Mode-guided feature augmentation for domain generalization Mode-guided feature augmentation for domain generalization
BMVC 2021
Mode-Guided Feature Augmentation for Domain Generalization

Proposed a simple and efficient domain generalization approach that augments source domains by exploring dominant modes of variation in the feature space, improving generalization to unseen domains across standard DG benchmarks.

Earthquake LANL
IEEE SIU 2020
Learned vs. Hand-Crafted Features for Deep Learning Based Aperiodic Laboratory Earthquake Time-Prediction

Developed and compared machine learning models for laboratory earthquake prediction using LANL data, where CNN-LSTM models improved time-to-failure prediction over hand-crafted approaches.

Neuroprosthetics BMI Neuroprosthetics setup
Journal of Neuroscience Methods 🔬 TUBITAK Funded
A Behavioral Paradigm for Cortical Control of a Robotic Actuator by Freely Moving Rats in a One-Dimensional Two-Target Reaching Task

Demonstrated trajectory-based neuroprosthetic control in rodents using primary motor cortex activity, providing a cost-effective platform for studying brain-machine interfaces and neural control.

04

Experience

Graduate Research AssistantISCAAS Lab · Kansas State UniversityAug 2021 — Present
  • Developed GRALP — a latent-skill offline RL framework for contact-rich robotic manipulation; ~8% higher avg. performance on D4RL, Adroit, RoboSuite. Related work includes submissions to IJCAI 2026 and IROS 2026, alongside accepted work at CVsports @ CVPR 2026.
  • Led NASA-funded SA3C project: attention-based RL agent for low-thrust spacecraft trajectory optimization, reducing transfer time by 10% over strong baselines. Published in IEEE Transactions on Aerospace and Electronic Systems (2025).
  • Engineered AI-driven security controller for smart grids (DOE-funded): high attack-detection rates, low false positives on inverter-level anomalies, faster post-disturbance recovery in MATLAB/Simulink + PyTorch simulations.
  • Authoring research proposals to secure federal and industry funding for robust RL and autonomous systems research.
🛰 NASA⚡ DOE
Graduate Research AssistantNeuroprosthetics Group · Istanbul Medipol UniversityOct 2018 — Dec 2020
  • Engineered a cortically-driven robotic arm control system using primary motor cortex signals from freely moving rats. First trajectory-based neuroprosthetic control in rodents, >78% accuracy. Published in Journal of Neuroscience Methods.
  • Built CNN-LSTM models for aperiodic earthquake time-prediction using LANL data, significantly outperforming traditional signal-processing baselines. Published at IEEE SIU 2020.
🔬 TUBITAK
Field EngineerTetraPakApr 2016 — Jun 2018
  • Technical support and troubleshooting for complex automated industrial systems, building deep familiarity with real-world control and automation constraints.
05

Skills

RL & Decision-Making
Offline RL Online RL Imitation Learning Policy Optimization Policy Refinement Long-Horizon Planning Latent Skill Learning Actor-Critic Methods
Foundation Models & Adaptation
Vision-Language Models VLA Models Robot Foundation Models CLIP LoRA / PEFT Preference-Based Fine-Tuning RLHF / GRPO
Generative & Representation Learning
Diffusion Models VAEs Latent Representations Sequence Modeling World Models Masked Modeling Domain Generalization
Robotics & Embodied AI
Robot Learning Contact-Rich Manipulation Dexterous Manipulation MuJoCo RoboSuite D4RL / Adroit Gazebo ROS 2
Programming & ML Tools
Python PyTorch C++ Linux CUDA / GPU Training Hugging Face / Transformers Scalable ML Experiments MATLAB / Simulink Weights & Biases Docker
06

Awards & Service

🏆
Best Presentation Award — APEC 2026IEEE Applied Power Electronics Conference & Exposition, 2026
Graduate Student of the Month — March 2026Kansas State University, Dept. of Computer Science
🎓
Funded PhD Scholarship & Research AssistantshipKansas State University, 2021 — Present
🎓
Funded Master's Scholarship & Research AssistantshipIstanbul Medipol University, 2018 — 2020
Invited Peer Reviewer
IEEE Trans. Cloud ComputingIEEE Trans. Aersopace & Elecronics systemsIEEE AccessIEEE Jouranl. STARSWiley Engineering Reports
Leadership & Service
President · KSU CS Graduate Students Assoc. (2023–24)Graduate Mentor · Prospective PhD StudentsPresident · Mechatronics Club UET Lahore (2013–14)