Closed-Loop Co-Design of Motors, Motions, and Feedback Control for Robotic Manipulators

Jue-Te Lin; Zehui Lu; Yebin Wang

doi:10.53941/jaia.2026.100008

Abstract

The co-design paradigm claims substantial advantages to hardware and control system design by addressing multidisciplinary challenges within a unified framework. Established co-design frameworks for robot manipulators have predominantly focused on two components: motor/arm design and trajectory optimization, which inadequately address real-world disturbances and model uncertainties and thus render suboptimal design and closed-loop system performance. This paper proposes a closed-loop co-design (CLCD) framework to jointly determine motors, motions, and a feedback controller, where the controller comprises a reinforcement learning (RL)-based compensator and a classic proportional-derivative controller for trajectory tracking. Simulation is performed to validate (1) the effectiveness of the proposed CLCD framework to attenuate the sim-2-real gap, (2) the viability of incorporating an RL-based controller into the CLCD for flexible and efficient synthesis of control policy, and (3) the scalability of the CLCD by applying it to perform co-design for 12 and 120 tasks.

References

1.
De Michell, G.; Gupta, R.K. Hardware/Software Co-Design. Proc. IEEE 1997, 85, 349–365.
2.
Jiang, Y.; Wang, Y.; Bortoff, S.A.; et al. Optimal Codesign of Nonlinear Control Systems Based on a Modified Policy Iteration Method. IEEE Trans. Neural Netw. Learn. Syst. 2015, 26, 409–414.
3.
Hale, A.L.; Dahl, W.; Lisowski, J. Optimal Simultaneous Structural and Control Design of Maneuvering Flexible Spacecraft. J. Guid. Control Dyn. 1985, 8, 86–93.
4.
Ravichandran, T.; Wang, D.; Heppler, G. Simultaneous Plant-Controller Design Optimization of a Two-Link Planar Manipulator. Mechatronics 2006, 16, 233–242.
5.
Park, J.H.; Asada, H. Concurrent Design Optimization of Mechanical Structure and Control for High Speed Robots. In Proceedings of the 1993 American Control Conference, San Francisco, CA, USA, 2–4 June 1993.
6.
Pettersson, M.; Olvander, J. Drive Train Optimization for Industrial Robots. IEEE Trans. Robot. 2009, 25, 1419–1424.
7.
Ha, S.; Coros, S.; Alspach, A.; et al. Computational Co-Optimization of Design Parameters and Motion Trajectories for Robotic Systems. Int. J. Robot. Res. 2018, 37, 1521–1536.
8.
Herber, D.R.; Allison, J.T. Nested and Simultaneous Solution Strategies for General Combined Plant and Control Design Problems. J. Mech. Des. 2019, 141, 011402.
9.
Garcia-Sanz, M. Control Co-Design: An Engineering Game Changer. Adv. Control Appl. 2019, 1, e18.
10.
Hwang, J.T. A Modular Approach to Large-Scale Design Optimization of Aerospace Systems. Ph.D. Thesis, University of Michigan, Ann Arbor, MI, USA, 2015.
11.
Toussaint, M.; Ha, J.S.; Oguz, O.S. Co-Optimizing Robot, Environment, and Tool Design via Joint Manipulation Planning. In Proceedings of the 2021 IEEE International Conference on Robotics and Automation, Xi’an, China, 30 May–5 June 2021.
12.
Chen, T.; He, Z.; Ciocarlie, M. Co-Designing Hardware and Control for Robot Hands. Sci. Robot. 2021, 6, eabg2133.
13.
Dinev, T.; Mastalli, C.; Ivan, V.; et al. A Versatile Co-Design Approach for Dynamic Legged Robots. In Proceedings of the 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Kyoto, Japan, 23–27 October 2022; pp. 10343–10349.
14.
Baykal, C.; Alterovitz, R. Asymptotically Optimal Design of Piecewise Cylindrical Robots using Motion Planning. In Proceedings of the Robotics: Science and Systems, Cambridge, MA, USA, 12–16 July 2017.
15.
Bhatia, J.; Jackson, H.; Tian, Y.; et al. Evolution Gym: A Large-Scale Benchmark for Evolving Soft Robots. In Proceedings of the 35th Conference on Neural Information Processing Systems (NeurIPS 2021), virtual, 6–14 December 2021; pp. 2201–2214.
16.
Fan, Q.Y.; Wang, D.; Xu, B. H∞ Codesign for Uncertain Nonlinear Control Systems Based on Policy Iteration Method. IEEE Trans. Cybern. 2021, 52, 10101–10110.
17.
Zhang, W.; Huang, Y.; Xie, L. Infinite Horizon Stochastic H2/H∞ Control for Discrete-Time Systems with State and Disturbance Dependent Noise. Automatica 2008, 44, 2306–2316.
18.
Bravo-Palacios, G.; Grandesso, G.; Prete, A.D.; et al. Robust Co-Design: Coupling Morphology and Feedback Design through Stochastic Programming. J. Dyn. Syst. Meas. Control 2022, 144, 021007.
19.
Benosman, M.; Le Vey, G. Control of Flexible Manipulators: A Survey. Robotica 2004, 22, 533–545.
20.
Yuan, M.; Manzie, C.; Good, M.; et al. A Review of Industrial Tracking Control Algorithms. Control Eng. Pract. 2020, 102, 104536.
21.
Visioli, A.; Legnani, G. On the Trajectory Tracking Control of Industrial SCARA Robot Manipulators. IEEE Trans. Ind. Electron. 2002, 49, 224–232.
22.
Cervantes, I.; Alvarez-Ramirez, J. On the PID Tracking Control of Robot Manipulators. Syst. Control Lett. 2001, 42, 37–46.
23.
Lin, F. An Optimal Control Approach to Robust Control of Robot Manipulators. IEEE Trans. Robot. Autom. 1998, 14, 69–77.
24.
Green, A.; Sasiadek, J.Z. Dynamics and Trajectory Tracking Control of a Two-Link Robot Manipulator. J. Vib. Control 2004, 10, 1415–1440.
25.
Wai, R.J. Tracking Control Based on Neural Network Strategy for Robot Manipulator. Neurocomputing 2003, 51, 425–445.
26.
Jiang, Z.H.; Ishida, T. A Neural Network Controller for Trajectory Control of Industrial Robot Manipulators. J. Comput. 2008, 3, 1–8.
27.
Dai, L.; Yu, Y.; Zhai, D.H.; et al. Robust Model Predictive Tracking Control for Robot Manipulators with Disturbances. IEEE Trans. Ind. Electron. 2020, 68, 4288–4297.
28.
Boscariol, P.; Gasparetto, A.; Zanotto, V. Model Predictive Control of a Flexible Links Mechanism. J. Intell. Robot. Syst. 2010, 58, 125–147.
29.
Pane, Y.P.; Nageshrao, S.P.; Kober, J.; et al. Reinforcement Learning Based Compensation Methods for Robot Manipulators. Eng. Appl. Artif. Intell. 2019, 78, 236–247.
30.
Stein, A.; Wang, Y.; Sakamoto, Y.; et al. Application-Oriented Co-Design of Motors and Motions for a 6DOF Robot Manipulator. In Proceedings of the 2025 IEEE International Conference on Automation Science and Engineering (CASE), Los Angeles, CA, USA, 17–21 August 2025; pp. 3318–3324.
31.
Lu, Z.; Wang, Y. A Differentiable Dynamic Modeling Approach to Integrated Motion Planning and Actuator Physical Design for Mobile Manipulators. J. Field Robot. 2025, 42, 37–64.
32.
Andersson, J.A.E.; Gillis, J.; Horn, G.; et al. CasADi: A Software Framework for Nonlinear Optimization and Optimal Control. Math. Program. Comput. 2019, 11, 1–36 .
33.
Betts, J.T. Practical Methods for Optimal Control and Estimation Using Nonlinear Programming; SIAM: Philadelphia, PA, USA, 2010.
34.
Zhao, Y.; Wang, Y.; Zhou, M.C.; et al. Energy-Optimal Collision-Free Motion Planning for Multiaxis Motion Systems: An Alternating Quadratic Programming Approach. IEEE Trans. Autom. Sci. Eng. 2019, 16, 327–338.
35.
Wchter, A.; Biegler, L.T. On the Implementation of an Interior-Point Filter Line-Search Algorithm for Large-Scale Nonlinear Programming. Math. Program. 2006, 106, 25–57.
36.
Tan, J.; Zhang, T.; Coumans, E.; et al. Sim-to-Real: Learning Agile Locomotion for Quadruped Robots. arXiv 2018, arXiv:1804.10332.
37.
Yu, N.; Zou, W.; Sun, Y. Passivity Guaranteed Stiffness Control with Multiple Frequency Band Specifications for a Cable-Driven Series Elastic Actuator. Mech. Syst. Signal Process. 2019, 117, 709–722.
38.
Bicego, D.; Mazzetto, J.; Carli, R.; et al. Nonlinear Model Predictive Control with Enhanced Actuator Model for Multi-Rotor Aerial Vehicles with Generic Designs. J. Intell. Robot. Syst. 2020, 100, 1213–1247.
39.
Torrey, L.; Shavlik, J. Transfer Learning. In Handbook of Research on Machine Learning Applications and Trends: Algorithms, Methods, and Techniques; IGI Global: Hershey, PA, USA, 2010; pp. 242–264.
40.
Haarnoja, T.; Zhou, A.; Hartikainen, K.; et al. Soft Actor-Critic Algorithms and Applications. arXiv 2018, arXiv:1812.05905.
41.
Brockman, G.; Cheung, V.; Pettersson, L.; et al. Openai Gym. arXiv 2016, arXiv:1606.01540.

Scilight Press

Author Information

Abstract

Keywords

References

About Scilight

Journals

Publishing Policies

Contact Us