Loading...
Search for: optimal-policies
0.006 seconds

    Efficient, Fair, and QoS-Aware policies for wirelessly powered communication networks

    , Article IEEE Transactions on Communications ; Volume 68, Issue 9 , 2020 , Pages 5892-5907 Rezaei, R ; Omidvar, N ; Movahednasab, M ; Pakravan, M. R ; Sun, S ; Guan, Y. L ; Sharif University of Technology
    Institute of Electrical and Electronics Engineers Inc  2020
    Abstract
    In this paper, we propose efficient wireless power transfer (WPT) policies for various practical scenarios in wirelessly powered communication networks (WPCNs). First, we consider WPT from an energy access point (E-AP) to multiple energy receivers (E-Rs). We formulate the problem of maximizing the total average received power of the E-Rs subject to power constraints of the E-AP, which is a non-convex stochastic optimization problem. Using eigenvalue decomposition techniques, we derive a closed-form expression for the optimal policy, which requires the distribution of the channel state information (CSI) in the network. We then propose a near-optimal policy that does not require this knowledge... 

    Developing an Efficient Solution Method Based on Dynamic Programming with Considering Nested and Cancelation in Revenue Management

    , M.Sc. Thesis Sharif University of Technology Aliahmadi, Fatemeh (Author) ; Modarres, Mohammad (Supervisor)
    Abstract
    In this thesis by considering airplane capacity as a perishable asset revenue management models are employed for capacity allocation to customer different class. The thesis contains two main revenue management models. In the first one, single leg capacity allocation with overbooking, cancellation no-show and sell up is studied. The main contribution of this model is considering sell up or nested associated with other assumption. This problem is formulated as a dynamic programming model. Optimal policy and optimal seat allocation in case of overbooking is also determined. Contrary to what is assumed, it is shown that the income of lower fare class is more than accepting higher class.In this... 

    Efficient Cooperative Power and Data Transmission in Wireless Communication Networks

    , M.Sc. Thesis Sharif University of Technology Haghifam, Mahdi (Author) ; Nasiri Kenari, Masoumeh (Supervisor) ; Ashtiani, Farid (Supervisor)
    Abstract
    Relay-assisted communication is one of the promising techniques that have been proposed for wireless networks. The main idea of a relay network is to improve data transmission efficiency by using intermediate relay nodes which support data transmission from a source to a destination. From another perspective, energy consumption has recently become an important consideration for design of wireless communication networks. The shrinking size and increasing density of next-generation wireless devices imply reduced battery capacities, indicating that designing energy efficient networks is of paramount importance. In this regard, recent emphasis on green communications has generated great... 

    Dynamic Pricing of Charter Flight Tickets with Learning

    , M.Sc. Thesis Sharif University of Technology Mehrdar, Atabak (Author) ; Modarres, Mohammad (Supervisor)
    Abstract
    In this thesis, an approach is developed to obtain an optimal pricing policy for chartered flights. In order to do so, a model within the framework of dynamic programming is presented and its main structure is also analyzed. Since in real world cases the dimension of this model happens to be very large, a solution method is developed by “Q Learning” technique. This is an appropriate approach in approximate dynamic programming and reinforcement learning. Analysis is carried out under two different assumptions regarding demand, namely “linear-deterministic” and probabilistic demand for transition probabilities. An exact solution for deterministic demand case is developed. Furthermore, for... 

    Inverse Reinforcement Learning with Gaussian Processes

    , M.Sc. Thesis Sharif University of Technology Habibi, Beheshteh (Author) ; Sharifi Tabar, Mohsen (Supervisor)
    Abstract
    Inverse reinforcement learning (IRL) is one of the machine learning frameworks based on learning from humans; That is, instead of producing a decision process maximizing a predefined reward function, seeks to find the reward function based on the observed behavior of an agent. The biggest motivation of IRL is that, usually, determining a reward function for a problem is very difficult. We consider IRL in Markov decision processes; that is, the problem of extracting a reward function with the assumption of knowing the optimal behavior. IRL could be useful for apprenticeship learning to obtain skilled behavior, and for optimizing a reward function by a natural system. We first, determine a set... 

    Divided POMDP method for complex menu problems in spoken dialogue systems

    , Article 2010 IEEE Workshop on Spoken Language Technology, SLT 2010 - Proceedings, 12 December 2010 through 15 December 2010 ; 2010 , Pages 484-489 ; 9781424479030 (ISBN) Habibi, M ; Rahbar, S ; Sameti, H ; The Institute of Electrical and Electronics Engineers (IEEE); IEEE Signal Processing Society ; Sharif University of Technology
    2010
    Abstract
    In this paper, a problem in spoken dialogue systems namely the menu problem, is introduced and solved by a POMDP model. To overcome the large size of the menu problem, a new method for achieving an optimal policy called divided POMDP method is introduced. Conditions for the problem to be solved by the proposed method are specified and the problem properties resulting in the given conditions are presented. The proposed method is evaluated using a typical menu problem with different menu sizes and it is shown that this method is superior to the conventional methods such as FRTDP for the problems it is capable to solve. Moreover, it converges faster in getting to an optimal policy  

    Delay-optimal static relaying policy in a slotted aloha wireless network

    , Article 2018 Iran Workshop on Communication and Information Theory, IWCIT 2018, 25 April 2018 through 26 April 2018 ; 2018 , Pages 1-6 ; 9781538641491 (ISBN) Vaezi, K ; Ashtiani, F ; Sharif University of Technology
    Institute of Electrical and Electronics Engineers Inc  2018
    Abstract
    We consider a slotted Aloha-based wireless communication network comprising a source and an active relay, both transmitting to a common destination. The relay node is able to relay the unsuccessfully transmitted packets of the source node, if it can detect them. We find the analytical closed form optimal static relaying policy in order to minimize the average transmission delay (ATD) of source packets, while ATD of its own packets is constrained. A static relaying policy is determined by acceptance and relaying probabilities of the relay node. According to acceptance probability, the relay node decides whether to accept an unsuccessfully transmitted packet from source or not, while the... 

    Determining Optimal Policy for Production and Inventory Control of Deteriorating Poducts in Supply Chains Under Uncertain Conditions

    , Ph.D. Dissertation Sharif University of Technology Sazvar, Zeinab (Author) ; Akbari Jokar, Mohammad Reza (Supervisor)
    Abstract
    Nowadays, increasing of difficulties in production and procurement conditions such as competitiveness among manufacturing firms, variety of products, rapidly changing of customer tastes, shortening of product lifetime and complexity of demand forecasting on one hand, and key role of consumers and deteriorating products on countries revenue on the other hand, make replenishment policies of deteriorating products challenging and also interesting for both researchers and corporate managers.In this dissertation several novel models on replenishment policies for deteriorating products are proposed under uncertainty by the help of single or multi objective mathematical models. One important... 

    Analysis and Enhancement of Information Transmission in a Cognitive Radio Device-to-Device Network with Full-Duplex Communication Capability

    , M.Sc. Thesis Sharif University of Technology Yousefi, Mohammad (Author) ; Ashtiani, Farid (Supervisor) ; Mirmohseni, Mahtab (Supervisor)
    Abstract
    Increasing demand of bandwidth in wireless communications systems led to excessive congestion in frequency spectrum. Since radio spectrum is an expensive and rare source, its efcient use is an important challenge. These limitations as well as increasing number of communications service users,necessitate the use of newer technologies that work efciently with numerous users. Device-to-Device (D2D) communications, full-duplex capability and cognitive radio systems are introduced for this reason. In the previous technologies, due to excessive self-interference (SI), full-duplex transmission has not been implemented. However nowadays technology allows self-interference to be attenuated up to 100... 

    Opportunistic RF Energy Harvesting in Cognitive Radio Networks

    , M.Sc. Thesis Sharif University of Technology Miri, Zoheir (Author) ; Nasiri-kenari, Masoumeh (Supervisor) ; Ashtiani, Farid (Supervisor)
    Abstract
    Considering energy efficiency ways a key role in designing future wireless networks( 5G mobile networks). Moreover Spectrum efficiency is another critical issues in designing wireless networks. Cognitive radios can improve the spectrum efficiency. On the other hand, radio frequency (RF) energy harvesting has emerged as a promising technique to supply energy for wireless networks and thereby increase their energy efficiency. In this thesis, we propose a new technique for the RF-powered CRNs.To this end,We consider a cognitive radio network comprised of a primary user and a secondary user. The primary user, uses a typical frequency band for transmit data in a time slot basis and both the... 

    A queueing system with inventory and mixed exponentially distributed lead times

    , Article International Journal of Advanced Manufacturing Technology ; Volume 53, Issue 9-12 , August , 2011 , Pages 1231-1237 ; 02683768 (ISSN) Saffari, M ; Haji, R ; Hassanzadeh, F ; Sharif University of Technology
    2011
    Abstract
    We consider M/M/1/∞ systems with inventory in which completing each service in the queueing system requires an on-hand inventory. Continuous review (r, Q) policy is considered for the inventory system, and lead times are assumed to be mixed exponentially distributed. During stockout, arriving demands get rejected from the queue and become lost (lost sale situation). We derive stationary distribution of product form of joint queue length and on-hand inventory. The resulting distribution is employed to compute performance measures which can be used to derive the optimal policy. Optimal order size for predetermined reorder policy is initially determined and finally, optimal reorder point and... 

    Dynamic-programming-based failure-tolerant control for satellite with thrusters in 6-DOF motion

    , Article Advances in Space Research ; Volume 65, Issue 12 , 2020 , Pages 2857-2877 Taheri, A ; Assadian, N ; Sharif University of Technology
    Elsevier Ltd  2020
    Abstract
    In this paper, a dynamic-programming approach to the coupled translational and rotational control of thruster-driven spacecraft is studied. To reduce the complexity of the problem, dynamic-programming-based optimal policies are calculated using decoupled position and attitude dynamics with generalized forces and torques as controls. A quadratic-programming-based control allocation is then used to map the controls to actuator commands. To control the spacecraft in the event of thruster failure, both the dynamic programming policies and control allocation are reconfigured to cope with the losses in controls. The control allocation parameters are adjusted dynamically to ensure the satellite... 

    Delay-Optimal cooperation policy in a slotted aloha full-duplex wireless network: static approach

    , Article IEEE Systems Journal ; Volume 14, Issue 2 , 2020 , Pages 2257-2268 Vaezi, K ; Ashtiani, F ; Sharif University of Technology
    Institute of Electrical and Electronics Engineers Inc  2020
    Abstract
    We consider a cooperative wireless communication network comprising two full-duplex (FD) nodes transmitting to a common destination based on slotted Aloha protocol. Each node has exogenous arrivals and also may relay some of the unsuccessfully transmitted packets of the other node. In this article, we find the optimal static policies of nodes in order to minimize the sum of the average transmission delays, while the average transmission delay of each node is constrained. The static policy of each node specifies the probability of accepting an unsuccessfully transmitted packet of the other node and how the node prioritizes its transmissions. We show that in the optimal policies, just the node... 

    Multi-agent machine learning in self-organizing systems

    , Article Information Sciences ; Volume 581 , 2021 , Pages 194-214 ; 00200255 (ISSN) Hejazi, E ; Sharif University of Technology
    Elsevier Inc  2021
    Abstract
    This paper develops a novel insight and procedure that includes a variety of algorithms for finding the best solution in a structured multi-agent system with internal communications and a global purpose. In other words, it finds the optimal communication structure among agents and the optimal policy in this structure. First, a unique reinforcement learning algorithm is proposed to find the optimal policy of each agent in a fixed structure with non-linear function approximators like artificial neural networks (ANN) and with eligibility traces. Secondly, a mechanism is presented to perform self-organization based on the information of the learned policy. Finally, an algorithm that can discover... 

    Designing an optimum acceptance sampling plan using bayesian inferences and a stochastic dynamic programming approach

    , Article Scientia Iranica ; Volume 16, Issue 1 E , 2009 , Pages 19-25 ; 10263098 (ISSN) Akhavan Niaki, T ; Fallah Nezhad, M. S ; Sharif University of Technology
    2009
    Abstract
    In this paper, we use both stochastic dynamic programming and Bayesian inference concepts to design an optimum-acceptance-sampling-plan policy in quality control environments. To determine the optimum policy, we employ a combination of costs and risk functions in the objective function. Unlike previous studies, accepting or rejecting a batch are directly included in the action space of the proposed dynamic programming model. Using the posterior probability of the batch being in state p (the probability of non-conforming products), first, we formulate the problem into a stochastic dynamic programming model. Then, we derive some properties for the optimal value of the objective function, which... 

    Optimal policy of energy innovation in developing countries: Development of solar PV in Iran

    , Article Energy Policy ; Volume 37, Issue 3 , 2009 , Pages 1116-1127 ; 03014215 (ISSN) Shafiei, E ; Saboohi, Y ; Ghofrani, M.B ; Sharif University of Technology
    2009
    Abstract
    The purpose of this study is to apply managerial economics and methods of decision analysis to study the optimal pattern of innovation activities for development of new energy technologies in developing countries. For this purpose, a model of energy research and development (R&D) planning is developed and it is then linked to a bottom-up energy-systems model. The set of interlinked models provide a comprehensive analytical tool for assessment of energy technologies and innovation planning taking into account the specific conditions of developing countries. An energy-system model is used as a tool for the assessment and prioritization of new energy technologies. Based on the results of the...