Sharif Digital Repository / Sharif University of Technology / Search result

A graph-theoretic approach toward autonomous skill acquisition in reinforcement learning

, Article Evolving Systems ; Volume 9, Issue 3 , 2018 , Pages 227-244 ; 18686478 (ISSN) Kazemitabar, S. J ; Taghizadeh, N ; Beigy, H ; Sharif University of Technology

Springer Verlag 2018

Abstract

Hierarchical reinforcement learning facilitates learning in large and complex domains by exploiting subtasks and creating hierarchical structures using these subtasks. Subtasks are usually defined through finding subgoals of the problem. Providing mechanisms for autonomous subgoal discovery and skill acquisition is a challenging issue in reinforcement learning. Among the proposed algorithms, a few of them are successful both in performance and also efficiency in terms of the running time of the algorithm. In this paper, we study four methods for subgoal discovery which are based on graph partitioning. The idea behind the methods proposed in this paper is that if we partition the transition...

Automatic discovery of subgoals in reinforcement learning using strongly connected components

, Article 15th International Conference on Neuro-Information Processing, ICONIP 2008, Auckland, 25 November 2008 through 28 November 2008 ; Volume 5506 LNCS, Issue PART 1 , 2009 , Pages 829-834 ; 03029743 (ISSN); 3642024890 (ISBN); 9783642024894 (ISBN) Kazemitabar, J ; Beigy, H ; Asia Pacific Neural Network Assembly (APNNA); International Neural Network Society (INNS); IEEE Computational Intelligence Society; Japanese Neural Network Society (JNNS); European Neural Network Society (ENNS) ; Sharif University of Technology

2009

Abstract

The hierarchical structure of real-world problems has resulted in a focus on hierarchical frameworks in the reinforcement learning paradigm. Preparing mechanisms for automatic discovery of macro-actions has mainly concentrated on subgoal discovery methods. Among the proposed algorithms, those based on graph partitioning have achieved precise results. However, few methods have been shown to be successful both in performance and also efficiency in terms of time complexity of the algorithm. In this paper, we present a SCC-based subgoal discovery algorithm; a graph theoretic approach for automatic detection of subgoals in linear time. Meanwhile a parameter tuning method is proposed to find the...

Stabilizing control structures: An optimization framework

, Article IEEE Transactions on Automatic Control ; 2021 ; 00189286 (ISSN) Mosalli, H ; Babazadeh, M ; Sharif University of Technology

Institute of Electrical and Electronics Engineers Inc 2021

Abstract

This paper presents a new optimization-based approach to determine the class of stabilizing control structures with the necessary set of feedback links for interconnected systems. The proposed approach relies on a graph theoretic interpretation and its equivalence in terms of binary linear programs (BLP). To carry out the primary goal, first, the stabilizability of an LTI system under the decentralized control structure is presented in terms of a BLP. Next, two graph-based criteria are proposed to characterize stabilizing control structures with the required feedback links. Finally, all possible stabilizing control structures with the necessary feedback links are derived via solving a set of...

Stabilizing control structures: An optimization framework

, Article IEEE Transactions on Automatic Control ; Volume 67, Issue 7 , 2022 , Pages 3738-3745 ; 00189286 (ISSN) Mosalli, H ; Babazadeh, M ; Sharif University of Technology

Institute of Electrical and Electronics Engineers Inc 2022

Abstract

This article presents a new optimization-based approach to determine the class of stabilizing control structures with the necessary set of feedback links for interconnected systems. The proposed approach relies on a graph-theoretic interpretation and its equivalence in terms of binary linear programs (BLPs). To carry out the primary goal, first, the stabilizability of a linear time-invariant (LTI) system under the decentralized control structure is presented in terms of a BLP. Next, two graph-based criteria are proposed to characterize stabilizing control structures with the required feedback links. Finally, all possible stabilizing control structures with the necessary feedback links are...

Sensor selection cost optimisation for tracking structurally cyclic systems: a P-order solution

, Article International Journal of Systems Science ; Volume 48, Issue 11 , 2017 , Pages 2440-2450 ; 00207721 (ISSN) Doostmohammadian, M ; Zarrabi, H ; Rabiee, H. R ; Sharif University of Technology

Abstract

Measurements and sensing implementations impose certain cost in sensor networks. The sensor selection cost optimisation is the problem of minimising the sensing cost of monitoring a physical (or cyber-physical) system. Consider a given set of sensors tracking states of a dynamical system for estimation purposes. For each sensor assume different costs to measure different (realisable) states. The idea is to assign sensors to measure states such that the global cost is minimised. The number and selection of sensor measurements need to ensure the observability to track the dynamic state of the system with bounded estimation error. The main question we address is how to select the state...