AI Researcher · PhD Candidate · CS · Kansas State University

Talha Zaidi

I am an AI researcher and Ph.D. Candidate in Computer Science at Kansas State University, advised by Prof. Arslan Munir in the ISCAAS Lab. I am expected to graduate in August 2026. Earlier, I completed my M.S. in Biomedical Engineering at Istanbul Medipol University, Turkey, and my B.S. in Mechatronics and Control Engineering at UET Lahore.

Research Interests My research develops reinforcement learning, generative modeling, and foundation-model adaptation methods for reliable decision-making in complex real-world systems. I focus on sequential decision modeling, policy refinement, and latent representation learning for offline/online RL, long-horizon planning, and reasoning under uncertainty, with applications in embodied AI, robotics, autonomous systems, and cyber-physical intelligence.

Email Me Google Scholar GitHub LinkedIn CV ↓

10+Publications

3.9GPA

3Grant Agencies

2026Aug PhD Graduate

News

Recent Updates

Selected updates

•June 2026: Selected to attend the 2026 IEEE Summer School on Telerobotics and Cyborg Technologies at Rochester Institute of Technology (RIT).
•May 2026: Our recent RL robotics paper GRALP is accepted at 35th International Joint Conference on Artificial Intelligence (IJCAI) 2026.
•April 2026: is accepted at CVsports @ CVPR 2026 on grounded and motion-aware video understanding.
•Mar 2026: Selected as Graduate Student of the Month at Kansas State University for research and scholarly contributions.
•Mar 2026: Received a Best Presentation Award at APEC 2026 for work on resilient coordination and security of grid-forming inverter networks.
•Feb 2026: CRISP is submitted to the IROS 2026 Main Track on long-horizon offline planning and context-robust skill inference.
•2025–2026: The research portfolio is expanding towards robotics, vision-language-action models, and cyber-physical systems. .

Multidisciplinary Research

The center remains fixed on Generative AI + Reinforcement Learning, while the surrounding domains show where these core methods are applied and extended. The goal is not to present unrelated topics, but to show a connected research program spanning robotics, cyber-physical systems, LLM and VLM agents, health AI, autonomous systems, and resilient infrastructure through shared ideas in planning, representation learning, robustness, and decision-making.

🛰 NASA

Spacecraft Autonomy
AI Trajectory Optimization

⚡ Dept. of Energy

Smart Grid Security
Resilient Infrastructure

🔬 TUBITAK

Brain-Machine Interface
Neuroprosthetics Control

Research & Publications

International Joint Conference on Artificial Intelligence (IJCAI) 2026- AI and Robotics Track Accepted

GRALP: Generative Representation for Action Refinement and Latent Planning in Offline RL

Latent-skill offline RL framework for contact-rich robotic manipulation. Improves long-horizon planning while preserving behavior support and low-level execution quality across D4RL, Adroit, and RoboSuite benchmarks.

↗ Paper ↗ Code

CVsports @ CVPR 2026 Accepted

GRAZE: Grounded Refinement and Motion-Aware Zero-Shot Event Recognition

Zero-shot sports event recognition framework combining grounded visual refinement with motion-aware features for fine-grained video understanding.

↗ Paper ↗ Code

IROS 2026- Main Track Submitted

CRISP: Context-Robust Inpainting for Long-Horizon Skill Planning under Partial Observability

Masked latent-skill inference for robust long-horizon planning under missing or degraded context. Focuses on partial observability in offline reinforcement learning and context-robust skill sequencing.

↗ Paper ↗ Code

IEEE Transaction on Aerospace and Electronic Systems 2025 🛰 NASA Funded

Autonomous Planning of Low-Thrust Geocentric and Cislunar Spacecraft Trajectories Using Reinforcement Learning

Developed the SA3C algorithm with an attention mechanism to improve sample efficiency and decision quality for low-thrust spacecraft trajectory optimization in geocentric and cislunar missions.

↗ Paper ↗ Code

Automated trajectory planning with cascaded DRL

IEEE Magazine on Aerospace and Electronic Systems 2025

Automated Trajectory Planning: A Cascaded Deep Reinforcement Learning Approach for Low-Thrust Spacecraft Orbit-Raising

Developed a novel Cascaded Deep Reinforcement Learning (CDRL) approach to optimize low-thrust spacecraft trajectory planning, significantly improving time-efficient orbit transfers in complex multi-body environments for transfers to GEO and NRHO.

↗ Paper ↗ Code

APEC 2026 Conference Paper

An Integrated AI-Based Approach for Virtual-Impedance Scheduling and Cyber-Attack Mitigation in Smart-Grid Environments

Introduced a resilient neural coordination framework for grid-forming inverter networks that maintains stability and coordination under cyberattacks in smart-grid environments.

↗ Paper ↗ Code

AIAA SCITECH 2024

Machine Learning Assisted Low-Thrust Orbit-Raising: A Comparative Assessment of Sequential Algorithm and Deep RL

Developed a machine-learning-assisted method for optimizing low-thrust orbit-raising trajectories, integrating a sequential algorithm with a neural network-based high-level planner and benchmarking it against deep reinforcement learning approaches for geostationary and halo-orbit missions.

↗ Paper ↗ Code

Cascaded deep RL low-thrust orbit transfer

IEEE Access 2023

Cascaded Deep Reinforcement Learning-Based Multi-Revolution Low-Thrust Spacecraft Orbit-Transfer

Developed a cascaded DRL model for optimizing long-duration, low-thrust spacecraft transfers from GTO to GEO. Guided by a gradient-aided reward function, the method significantly reduces transfer time and improves spacecraft autonomy in complex multi-revolution transfers.

↗ Paper ↗ Code

Springer Nature 2025 Submitted

Improving Adversarial Robustness Through Adaptive Learning-Driven Multi-Teacher Knowledge Distillation

Adaptive multi-teacher knowledge distillation framework aimed at improving adversarial robustness beyond standard single-teacher or conventional training approaches.

↗ Paper ↗ Code

Mode-guided feature augmentation for domain generalization

BMVC 2021

Mode-Guided Feature Augmentation for Domain Generalization

Proposed a simple and efficient domain generalization approach that augments source domains by exploring dominant modes of variation in the feature space, improving generalization to unseen domains across standard DG benchmarks.

↗ Paper ↗ Code

IEEE SIU 2020

Learned vs. Hand-Crafted Features for Deep Learning Based Aperiodic Laboratory Earthquake Time-Prediction

Developed and compared machine learning models for laboratory earthquake prediction using LANL data, where CNN-LSTM models improved time-to-failure prediction over hand-crafted approaches.

↗ Paper ↗ Code

Journal of Neuroscience Methods 🔬 TUBITAK Funded

A Behavioral Paradigm for Cortical Control of a Robotic Actuator by Freely Moving Rats in a One-Dimensional Two-Target Reaching Task

Demonstrated trajectory-based neuroprosthetic control in rodents using primary motor cortex activity, providing a cost-effective platform for studying brain-machine interfaces and neural control.

↗ Paper ↗ Code

Experience

Graduate Research AssistantISCAAS Lab · Kansas State UniversityAug 2021 — Present

Developed GRALP — a latent-skill offline RL framework for contact-rich robotic manipulation; ~8% higher avg. performance on D4RL, Adroit, RoboSuite. Related work includes submissions to IJCAI 2026 and IROS 2026, alongside accepted work at CVsports @ CVPR 2026.
Led NASA-funded SA3C project: attention-based RL agent for low-thrust spacecraft trajectory optimization, reducing transfer time by 10% over strong baselines. Published in IEEE Transactions on Aerospace and Electronic Systems (2025).
Engineered AI-driven security controller for smart grids (DOE-funded): high attack-detection rates, low false positives on inverter-level anomalies, faster post-disturbance recovery in MATLAB/Simulink + PyTorch simulations.
Authoring research proposals to secure federal and industry funding for robust RL and autonomous systems research.

🛰 NASA⚡ DOE

Graduate Research AssistantNeuroprosthetics Group · Istanbul Medipol UniversityOct 2018 — Dec 2020

Engineered a cortically-driven robotic arm control system using primary motor cortex signals from freely moving rats. First trajectory-based neuroprosthetic control in rodents, >78% accuracy. Published in Journal of Neuroscience Methods.
Built CNN-LSTM models for aperiodic earthquake time-prediction using LANL data, significantly outperforming traditional signal-processing baselines. Published at IEEE SIU 2020.

🔬 TUBITAK

Field EngineerTetraPakApr 2016 — Jun 2018

Technical support and troubleshooting for complex automated industrial systems, building deep familiarity with real-world control and automation constraints.

Skills

RL & Decision-Making

Offline RL Online RL Imitation Learning Policy Optimization Policy Refinement Long-Horizon Planning Latent Skill Learning Actor-Critic Methods

Foundation Models & Adaptation

Vision-Language Models VLA Models Robot Foundation Models CLIP LoRA / PEFT Preference-Based Fine-Tuning RLHF / GRPO

Generative & Representation Learning

Diffusion Models VAEs Latent Representations Sequence Modeling World Models Masked Modeling Domain Generalization

Robotics & Embodied AI

Robot Learning Contact-Rich Manipulation Dexterous Manipulation MuJoCo RoboSuite D4RL / Adroit Gazebo ROS 2

Programming & ML Tools

Python PyTorch C++ Linux CUDA / GPU Training Hugging Face / Transformers Scalable ML Experiments MATLAB / Simulink Weights & Biases Docker

Awards & Service

🏆

Best Presentation Award — APEC 2026IEEE Applied Power Electronics Conference & Exposition, 2026

⭐

Graduate Student of the Month — March 2026Kansas State University, Dept. of Computer Science

🎓

Funded PhD Scholarship & Research AssistantshipKansas State University, 2021 — Present

🎓

Funded Master's Scholarship & Research AssistantshipIstanbul Medipol University, 2018 — 2020

Invited Peer Reviewer

IEEE Trans. Cloud ComputingIEEE Trans. Aersopace & Elecronics systemsIEEE AccessIEEE Jouranl. STARSWiley Engineering Reports

Leadership & Service

President · KSU CS Graduate Students Assoc. (2023–24)Graduate Mentor · Prospective PhD StudentsPresident · Mechatronics Club UET Lahore (2013–14)