Markov Decision Process Value Iteration and Linear Programming

Cargado por

gulshan kumar

0% encontró este documento útil (0 votos)

86 vistas4 páginas

Markov Decision Process

Título original

AI MDP Assignment

Derechos de autor

Formatos disponibles

PDF, TXT o lea en línea desde Scribd

Compartir este documento

Compartir o incrustar documentos

Opciones para compartir

¿Le pareció útil este documento?

¿Este contenido es inapropiado?

Denunciar este documento

Markov Decision Process

Copyright:

Formatos disponibles

Descargue como PDF, TXT o lea en línea desde Scribd

Marcar por contenido inapropiado

0% encontró este documento útil (0 votos)

86 vistas4 páginas

Markov Decision Process Value Iteration and Linear Programming

Cargado por

gulshan kumar

Markov Decision Process

Copyright:

Formatos disponibles

Descargue como PDF, TXT o lea en línea desde Scribd

Marcar por contenido inapropiado

Saltar a página

Está en la página 1de 4

Buscar dentro del documento

ASSIGNMENT II

MARKOV DECISION PROCESS

You will be given a grid world based MDP parameterized as below
Input: First line of the input consists of 2 space separated integers N M the dimensions
of the grid world.
N lines follow, each having M real numbers specifying the rewards for getting into that
state
Next line consists of 2 space separated integers E W, number of end states and number
of walls
Next E lines follow - each having 2 integers - the coordinates of end states.
Next W lines follow - each having 2 integers - the coordinates of walls.
Next line has 2 integers specifying the coordinates of the start state.
Next line has 1 integer specifying the unit step reward.

The following input corresponds to the MDP below:

44
-x 0 0 0
0 0 0 0
x/10 0 0 0
0 -x/10 0 x
21
00
33
12
30
-x / 5
+X / 10

START -X / 10

Problem Specifications
● Rows are numbered 0,1,2,3 from top to bottom and columns are numbered 0,1,2,3
from left to right. Eg. Start cell is (3,0)
● The cell (3,3) is the positive(green) sink while (0,0) is the negative(red) sink
● The blue cells are blocked (assume them as walls)
● The borders of the grid are also walls
Agent can go North, South, East or West
Action from a state results in
○ Movement in intended direction with probability 0.8
○ Movement in directions perpendicular to the intended direction with 0.1 probability
each (0.8 + 0.1 + 0.1 = 1). Eg. If action is North, then actual movement will be to North
with 0.8 prob, to East with 0.1 prob, & to West with 0.1 prob.
●If an action results in reaching a cell with wall, agent will remain in the same cell
● No action needs to be performed at terminal states
NOTE:
You will be given numeric inputs for PART 1 of the assignment.
For Part 2 and 3 of the assignment replace X with your team number.

Problem Statement
The assignment consists of three parts:
Part A: Given any MDP of the aforementioned form - write a program that performs
Value Iteration(VI) on that MDP and gives final utility of each state as output. For
termination condition consider 1% change or less within tolerance.
Output format: if the grid is n * m , output should consist of n lines - having m space
separated values each upto 3 decimal precision.
For ex: if grid is 2 * 3
Output: 5.300 3.200 7.200
1.200 5.900 6.100
Part B: Perform the VI algorithm on the above given specific MDP [here x will be your
team number]. Now, carefully analyse how the policy changes over the iterations.
Observe the iterations in which there is a change in the optimal action to be taken in any
state and why the policy changed in that specific way.
You have to submit a handwritten / pdf note, highlighting the iterations in which the
policy changes and then explain the reason for policy changing in that specific way.
Part C: Model the above problem(specific MDP) using LP shown below

Q1. Model the parameters r,A and α

Q2. Use the excel LP solver to compute the x values and the expected utilities for this
MDP
Please verify that the expected utility obtained is equivalent to the one obtained using
the VI algorithm. The VI value and LP value can differ at max by delta*(1.2)

Deliverables
You are supposed to submit :
1. A working code for general MDP solving as specified in part A. 20 marks
2. Handwritten / pdf as specified in part B. 20 marks
3. Excel file or .ods (LibreOffice) file, showing the LP solved. 20 marks
Upload the above files in a zipped folder with the name
Assignment2_teamNumber.zip and submit the hard copy in class.
Deadline
● The zip folder needs to be uploaded on moodle on or before March 12, 11.55 pm.
Note: You need to load the solver add-in in Excel if not already installed. It comes
loaded with LibreOffice packages by default.

También podría gustarte

Mathcad and Matlab Instruction Guide: by Breneman Whitfield (gth782h)
Documento26 páginas
Mathcad and Matlab Instruction Guide: by Breneman Whitfield (gth782h)
Puspal Ghosh
Aún no hay calificaciones
Skip
Documento22 páginas
Skip
Anonymous 0MQ3zR
Aún no hay calificaciones
Excel Solver Tutorial: Solving LP, IP & TSP Problems
Documento26 páginas
Excel Solver Tutorial: Solving LP, IP & TSP Problems
parth
Aún no hay calificaciones
7.2. Obtención de Modelo Matemático
Documento12 páginas
7.2. Obtención de Modelo Matemático
matias
Aún no hay calificaciones
Assignment 3
Documento5 páginas
Assignment 3
ritik12041998
Aún no hay calificaciones
Efficient Implementation of 16-Bit Multiplier-Accumulator Using Radix-2 Modified Booth Algorithm and SPST Adder Using Verilog
Documento12 páginas
Efficient Implementation of 16-Bit Multiplier-Accumulator Using Radix-2 Modified Booth Algorithm and SPST Adder Using Verilog
Anonymous e4UpOQEP
Aún no hay calificaciones
Question Bank
Documento26 páginas
Question Bank
Arun Choudhary
Aún no hay calificaciones
Condition Numbers and Pivoting in Linear Systems
Documento4 páginas
Condition Numbers and Pivoting in Linear Systems
aleyhaider
Aún no hay calificaciones
Lab 01
Documento1 página
Lab 01
Veneiro
Aún no hay calificaciones
Matlab Basics Tutorial: Electrical and Electronic Engineering & Electrical and Communication Engineering Students
Documento23 páginas
Matlab Basics Tutorial: Electrical and Electronic Engineering & Electrical and Communication Engineering Students
sushantnirwan
Aún no hay calificaciones
Lab02 Worksheet
Documento4 páginas
Lab02 Worksheet
Spin Fotonio
Aún no hay calificaciones
2 To 1 Mux
Documento19 páginas
2 To 1 Mux
TelmoDiez
100% (1)
AI Test1 Key
Documento14 páginas
AI Test1 Key
Lakshmi Ramu
Aún no hay calificaciones
Convolutional Neural Networks (LeNet) - DeepLearning 0.1 Documentation
Documento12 páginas
Convolutional Neural Networks (LeNet) - DeepLearning 0.1 Documentation
Sumit Patel
Aún no hay calificaciones
SVM Example in R
Documento4 páginas
SVM Example in R
John Alexis Cabolis
Aún no hay calificaciones
Building The Up
Documento30 páginas
Building The Up
abhiii86
Aún no hay calificaciones
Embedded Systems Interfacing Speed Control
Documento7 páginas
Embedded Systems Interfacing Speed Control
Andy Meyer
Aún no hay calificaciones
MATLAB Matrices and DC Circuits Analysis
Documento5 páginas
MATLAB Matrices and DC Circuits Analysis
Ahmad Ash Sharkawi
Aún no hay calificaciones
A Scenario 4
Documento28 páginas
A Scenario 4
Vasi Acm
Aún no hay calificaciones
Matlogix Question 2012
Documento12 páginas
Matlogix Question 2012
Aman04509
Aún no hay calificaciones
Computer Graphis
Documento7 páginas
Computer Graphis
Albin Ps Plappallil
Aún no hay calificaciones
ECEN 4233 – Goldschmidt Algorithm
Documento13 páginas
ECEN 4233 – Goldschmidt Algorithm
Cale Spratt
100% (1)
Practical Exercise Epipolar Geometry
Documento3 páginas
Practical Exercise Epipolar Geometry
dansileshi
Aún no hay calificaciones
CG Lecture3 V
Documento19 páginas
CG Lecture3 V
Zeenah khalid ŔJ
Aún no hay calificaciones
Chem 142 Experiment #1: Atomic Emission: Deadline: Due in Your TA's 3rd-Floor Mailbox 72 Hours After Your Lab Ends
Documento8 páginas
Chem 142 Experiment #1: Atomic Emission: Deadline: Due in Your TA's 3rd-Floor Mailbox 72 Hours After Your Lab Ends
api-301880160
Aún no hay calificaciones
FSM Prob
Documento3 páginas
FSM Prob
Souhardya Mondal
Aún no hay calificaciones
Output Primitives (Rasterization)
Documento30 páginas
Output Primitives (Rasterization)
Suada Bőw Wéěžý
Aún no hay calificaciones
Numerical Solution of PDE: Comsol Multiphysics: Example 1, The Poisson Equation On An Ellipse
Documento4 páginas
Numerical Solution of PDE: Comsol Multiphysics: Example 1, The Poisson Equation On An Ellipse
Peanut d. Destroyer
Aún no hay calificaciones
Ass2 2022 2
Documento3 páginas
Ass2 2022 2
saumya11599
Aún no hay calificaciones
EENG 2611 HW Lab1 PDF
Documento6 páginas
EENG 2611 HW Lab1 PDF
Pipe Ogunbanwo
Aún no hay calificaciones
Me3360 HW1-SP2013
Documento3 páginas
Me3360 HW1-SP2013
elm391229
Aún no hay calificaciones
Appendix A. Verilog Examples: A.1 Combinational Logic Structures
Documento14 páginas
Appendix A. Verilog Examples: A.1 Combinational Logic Structures
Trenton Lindholm
Aún no hay calificaciones
Efficient VLSI Implementation of Modulo Addition and Multiplication
Documento10 páginas
Efficient VLSI Implementation of Modulo Addition and Multiplication
vka_prince
Aún no hay calificaciones
ENG3104 Assignment3
Documento8 páginas
ENG3104 Assignment3
Nayim Mohammad
Aún no hay calificaciones
Green Energy
Documento13 páginas
Green Energy
Alin Òó
Aún no hay calificaciones
Configure MTL Receiver and Transmitter Settings
Documento3 páginas
Configure MTL Receiver and Transmitter Settings
JHANSI
Aún no hay calificaciones
Error Correcting Codes Guide for Single Bit Correction
Documento17 páginas
Error Correcting Codes Guide for Single Bit Correction
Lê Thanh Hùng
Aún no hay calificaciones
Bode Plot
Documento5 páginas
Bode Plot
Alexandro Andra Pizzaro
Aún no hay calificaciones
Mem351lab MatlabSimulink
Documento13 páginas
Mem351lab MatlabSimulink
ebrehe
Aún no hay calificaciones
MATLAB
Documento47 páginas
MATLAB
Maryam Kausar
Aún no hay calificaciones
Parallel Implementation of A 4 X 4-Bit Multiplier
Documento4 páginas
Parallel Implementation of A 4 X 4-Bit Multiplier
Mathew George
Aún no hay calificaciones
Digital Assignment - I Computational Fluid Dynamics: Course Code-Mee4006
Documento19 páginas
Digital Assignment - I Computational Fluid Dynamics: Course Code-Mee4006
SIDDHANT KUMAR 17BEM0015
Aún no hay calificaciones
Operations Research
Documento58 páginas
Operations Research
shailja Singh
Aún no hay calificaciones
Lab #2 Signal Ploting and Matrix Operatons in Matlab
Documento17 páginas
Lab #2 Signal Ploting and Matrix Operatons in Matlab
Nihal Ahmad
Aún no hay calificaciones
Computer Task 2
Documento4 páginas
Computer Task 2
Ivan Er
Aún no hay calificaciones
Data Link Layer Notes
Documento14 páginas
Data Link Layer Notes
Alexz Alee
Aún no hay calificaciones
ADS Tutorial 2
Documento4 páginas
ADS Tutorial 2
Minh Vu
Aún no hay calificaciones
Combinational Circuit Analysis and Design
Documento63 páginas
Combinational Circuit Analysis and Design
魏延任
Aún no hay calificaciones
Split The Non-Key Columns To Separate Tables With Key Column in Both
Documento25 páginas
Split The Non-Key Columns To Separate Tables With Key Column in Both
r.m.ram234
Aún no hay calificaciones
PCM Codigo
Documento14 páginas
PCM Codigo
Leux Guillen
Aún no hay calificaciones
Floorplan Design of VLSI
Documento18 páginas
Floorplan Design of VLSI
Atul Prakash Dwivedi
Aún no hay calificaciones
Design A Full Adder by Using Two Half Adders. (7M, May-19)
Documento10 páginas
Design A Full Adder by Using Two Half Adders. (7M, May-19)
Sri Silpa Padmanabhuni
Aún no hay calificaciones
Control Systems Group Project 2
Documento3 páginas
Control Systems Group Project 2
Yu-Yun Chang
Aún no hay calificaciones
Free Body Diagram and Newton's Law
Documento4 páginas
Free Body Diagram and Newton's Law
seeknana2833
Aún no hay calificaciones
Applications of The Wavelet Transform in Image Processing: 1 Prelimiaries
Documento16 páginas
Applications of The Wavelet Transform in Image Processing: 1 Prelimiaries
Dundi Nath
Aún no hay calificaciones
Fundamental Math
De Everand
Fundamental Math
Russell Pead
Aún no hay calificaciones
Integer Optimization and its Computation in Emergency Management
De Everand
Integer Optimization and its Computation in Emergency Management
Zhengtian Wu
Aún no hay calificaciones
A Brief Introduction to MATLAB: Taken From the Book "MATLAB for Beginners: A Gentle Approach"
De Everand
A Brief Introduction to MATLAB: Taken From the Book "MATLAB for Beginners: A Gentle Approach"
Peter Kattan
Calificación: 2.5 de 5 estrellas
2.5/5 (2)
Matrices with MATLAB (Taken from "MATLAB for Beginners: A Gentle Approach")
De Everand
Matrices with MATLAB (Taken from "MATLAB for Beginners: A Gentle Approach")
Peter Kattan
Calificación: 3 de 5 estrellas
3/5 (4)
Microelectronic Systems 1 Checkbook: The Checkbook Series
De Everand
Microelectronic Systems 1 Checkbook: The Checkbook Series
R.E. Vears
Aún no hay calificaciones
Westford University College readies flagship campus with new programs
Documento20 páginas
Westford University College readies flagship campus with new programs
Saju Janardhanan
Aún no hay calificaciones
CELTA Pre-Interview Grammar, Vocabulary and Pronunciation Exercises
Documento4 páginas
CELTA Pre-Interview Grammar, Vocabulary and Pronunciation Exercises
MichelJorge
100% (2)
Exploratory Data Analysis: M. Srinath
Documento19 páginas
Exploratory Data Analysis: M. Srinath
roma
Aún no hay calificaciones
Explore Spanish Lesson Plan - Animals
Documento8 páginas
Explore Spanish Lesson Plan - Animals
api-257582917
Aún no hay calificaciones
IBM Credit Corp BPR Process
Documento8 páginas
IBM Credit Corp BPR Process
Anubhav Puri
Aún no hay calificaciones
Komposit UHMWPE Sebagai Alternatif Bantalan Rel Kereta Api: Abel Evan, Alia Kristika, Farid Mulia Latief
Documento11 páginas
Komposit UHMWPE Sebagai Alternatif Bantalan Rel Kereta Api: Abel Evan, Alia Kristika, Farid Mulia Latief
Alia Kristika
Aún no hay calificaciones
2022 - J - Chir - Nastase Managementul Neoplaziilor Pancreatice Papilare
Documento8 páginas
2022 - J - Chir - Nastase Managementul Neoplaziilor Pancreatice Papilare
corina
Aún no hay calificaciones
What Are Your Observations or Generalizations On How Text/ and or Images Are Presented?
Documento2 páginas
What Are Your Observations or Generalizations On How Text/ and or Images Are Presented?
Darlene Panisa
Aún no hay calificaciones
Ubc 2015 May Sharpe Jillian
Documento65 páginas
Ubc 2015 May Sharpe Jillian
herzog
Aún no hay calificaciones
Land-Use Planning
Documento15 páginas
Land-Use Planning
Ciara Mary
Aún no hay calificaciones
Valentine Gifting - Accessories Edition
Documento25 páginas
Valentine Gifting - Accessories Edition
Priyanath Paul
Aún no hay calificaciones
High Yield Pics For STEP 2 CK
Documento24 páginas
High Yield Pics For STEP 2 CK
Kinan Alhalabi
96% (28)
Johnson 1999
Documento20 páginas
Johnson 1999
Linh Hoàng Phương
Aún no hay calificaciones
Automated Crime Reporting System
Documento101 páginas
Automated Crime Reporting System
Deepak Kumar
60% (10)
Mpce 24
Documento39 páginas
Mpce 24
Sachin Mehla
0% (1)
Otis Brochure Gen2life 191001-BELGIUM Small
Documento20 páginas
Otis Brochure Gen2life 191001-BELGIUM Small
veersainik
Aún no hay calificaciones
EDUC 5 - Questionaires
Documento7 páginas
EDUC 5 - Questionaires
William Ranara
Aún no hay calificaciones
2021.01.28 - Price Variation of Steel Items - SAIL Ex-Works Prices of Steel - RB-Civil
Documento2 páginas
2021.01.28 - Price Variation of Steel Items - SAIL Ex-Works Prices of Steel - RB-Civil
Saugata Halder
Aún no hay calificaciones
EA Flora 1
Documento3 páginas
EA Flora 1
A. Magno
Aún no hay calificaciones
Hmdu - English
Documento20 páginas
Hmdu - English
Abdulaziz Seiko
Aún no hay calificaciones
Mohammad R. Mestarihi: About Me Objective
Documento1 página
Mohammad R. Mestarihi: About Me Objective
Mhmd Mstt
Aún no hay calificaciones
DOW™ HDPE 05962B: High Density Polyethylene Resin
Documento3 páginas
DOW™ HDPE 05962B: High Density Polyethylene Resin
Fredo NL
Aún no hay calificaciones
Footprints 080311 For All Basic Ics
Documento18 páginas
Footprints 080311 For All Basic Ics
Amit Pujar
Aún no hay calificaciones
Book 2 - Test 1
Documento2 páginas
Book 2 - Test 1
Đức Long
Aún no hay calificaciones
MAN 2 Model Medan Introduction to School Environment Report
Documento45 páginas
MAN 2 Model Medan Introduction to School Environment Report
dinda
Aún no hay calificaciones
Predictive Analytics: QM901.1x Prof U Dinesh Kumar, IIMB
Documento36 páginas
Predictive Analytics: QM901.1x Prof U Dinesh Kumar, IIMB
Venkata Nelluri Pmp
Aún no hay calificaciones
Event Report
Documento2 páginas
Event Report
akshitdaharwal997
Aún no hay calificaciones
Pahang JUJ 2012 SPM Chemistry
Documento285 páginas
Pahang JUJ 2012 SPM Chemistry
JeyShida
100% (1)
The Golden Harvest
Documento3 páginas
The Golden Harvest
Mark Angelo Diaz
Aún no hay calificaciones
Philosophy of Disciple Making Paper
Documento5 páginas
Philosophy of Disciple Making Paper
api-665038631
Aún no hay calificaciones