|
CS 598, Section EA Decision Making Under Uncertainty University of Illinois, Urbana-Champaign Spring 2007 Tentative Course Syllabus |
| Approximate Schedule | (updated 1/31/2007) |
| Session | Date | Topic | Readings | Sample Application Reading |
Assignment |
|---|---|---|---|---|---|
| |
|
Modeling Knowledge in Scrabble | Signup for paper presentations begins | ||
| |
|
Introduction to Decision Making and Uncertainty |
[Russell & Norvig '03] Ch. 13
|
||
| |
|
Bayesian Networks: Representation |
[Russell & Norvig '03] Ch. 14
|
||
| |
|
Bayesian Networks: Representation |
[Russell & Norvig '03] Ch. 14
|
||
| |
|
Bayesian Networks: Inference |
[Russell & Norvig '03] Ch. 14
|
||
| |
|
Bayesian Networks: Inference |
[Russell & Norvig '03] Ch. 14
|
||
| |
|
Bayesian Networks: Inference Complexity | |||
| |
|
Treewidth Algorithms | |||
| |
|
Classes CANCELED | |
||
| |
|
Hidden Markov Models (HMMs) and Dynamic Bayesian Networks (DBNs) |
[Russell & Norvig '03] Ch. 15
|
||
| |
|
Inference in DBNs |
[Russell & Norvig '03] Ch. 15
|
||
| |
|
Markov Decision Processes (MDPs) |
[Russell & Norvig '03] Ch. 17
|
||
| |
|
Partially Observable MDPs (POMDPs) |
[Littman; Brown U. thesis 1996] ch. 6-7, Slides lec.10 (using slides by Craig Boutilier)
|
||
| |
|
Partially Observable MDPs (POMDPs) |
[Littman; Brown U. thesis 1996] ch. 6-7, Slides lec.10 (using slides by Craig Boutilier)
|
Signup for paper presentations ends; Proposal 1 due; | |
| |
|
Partially Observable MDPs (POMDPs) |
[Littman; Brown U. thesis 1996] ch. 6-7, Slides lec.10 (using slides by Craig Boutilier)
|
| |
| |
|
POMDPs | |
||
| |
|
POMDPs | |
||
| |
|
POMDPs | |
|
|
| |
|
MOVED TO APRIL 2 | |||
| |
|
Paper Presentation: Reward Shaping in MDPs | [Ng, Harada, & Russell; ICML '99] | Abdullah Muzahid; Extended proposal due | |
| |
|
Paper Presentation: The challenge of poker | Aniruddh Nath | ||
| |
|
Paper Presentation: Stochastic Local Search for POMDP Controllers | Mark Sammons | ||
| |
|
Projects mid-semester review | |
5 min. presentations in class | |
| |
|
Paper Presentation: A probabilistic Approach to Solving Crossword Puzzles |
[Littman, Keim, & Shazeer, AIJ 134 (1-2), 2002]
|
Quang X. Do | |
| |
|
Paper Presentation: Planning with Sensing |
[Bonet & Geffner; AIPS 2000]
|
Yu Ru (Robin) | |
| |
|
Paper Presentation: Hierarchical Reinforcement Learning |
[Dietterich; ICML 1998]
|
Jehanzeb Abbas | |
| |
|
Paper Presentation: OPEN | |
TBA TBA |
|
| |
|
Paper Presentation: Planning with Sensing | Lih-Ching Chou | ||
| |
|
Paper Presentation: OPEN | |
||
| |
|
Posters session (5pm-7pm) | |
final project submission |
| Papers for Presentations |
| Number | Topic (PO = Partially Observable; PL = Planning; ST = Stochastic; SM = Semantics; RL = Reinforcement Learning) | Paper/s | Sample Application Reading |
Presenter |
|---|---|---|---|---|
| |
POMDPs (RL;ST) | TBA TBA |
||
| |
Exploration (ST) | TBA TBA |
||
| |
Utility elicitation and POMDPs (PL;ST) | TBA TBA |
||
| |
POMDPs models customers | TBA TBA |
||
| |
POMDP approximation | TBA TBA |
||
| |
POMDP approximation | TBA TBA |
||
| |
POMDP approximation | |||
| |
First-Order MDPs | |||
| |
Factored MDPs | |||
| |
Monitoring POMDPs | |||
| |
MDPs | |||
| |
MDPs | |||
| |
MDPs (survey) | |||
| |
Programmable RL agents | |||
| |
Inverse Reinforcement Learning | |||
| |
Function Approximation in MDPs | |||
| |
Reward Shaping in MDPs | |||
| |
Factored MDPs | |||
| |
POMDPs as DBNs | |||
| |
Relational MDPs | |||
| |
Approximating POMDPs | |||
| |
POMDPs | |||
| |
Approximating POMDPs | |||
| |
Approximating POMDPs | |||
| |
Approximating POMDPs | |||
| |
Planning with Nondeterminism and Sensing | |||
| |
Planning with Sensing | |||
| |
Planning with Sensing | |||
| |
Planning with Sensing: unifying view | |||
| |
Risk-sensitive planning | |||
| |
Poker Playing | |||
| |
Solving Crossword Puzzles | |||
| |
k-arm Bandit Problems | |||
| |
Developmental Reinforcement Learning |
| Possible Projects |
| Number | Topic | Presenter |
|---|---|---|
| |
Automatic Adventure-game player | |
| |
First-Order POMDPs | |
| |
Exact Factored MDPs | |
| |
Position Estimation for an autonomous car | |
| |
Stochastic Filtering via Logical Filtering | |
| |
Localization and Mapping with many world features | |
| |
Poker Playing | |
| |
Bridge Player | |
| |
First-Order Factored Planning | |
| |
Adventure-Game Exploration using Commonsense knowledge | |
| |
Approximate Deterministic POMDPs via Logical Filtering | |
| |
Relational Probabilistic Resolution in Dynamic Bayesian Networks | |
| |
POMDPs approximated via DBNs | |
| |
Robot control using a POMDP | |
| |
First-Order Reinforcement Learning | |
| |
Reshaping rewards using commonsense knowledge | |
| |
Partially observable LSA-based robot control architecture | |
| |
Reinforcement learning in deterministic domains | |
| |
POMDP model of a stock in the stock market | |
| |
Comparison of Approximation techniques for probabilistic inference | |
| |
LSA-based control system for a robotic arm | |
| |
Reward shaping for POMDPs |
| Bibliography |
| Key | Authors | Title |
|---|---|---|
| [McCarthy '58] | John McCarthy | Programs with Common Sense |
| [Amir & Maynard-Reid '99] | Eyal Amir and Pedrito Maynard-Reid II | Logic-Based Subsumption Architecture |
| [Russell & Norvig '03] | Stuart Russell and Peter Norvig | Artificial Intelligence, a Modern Approach |
| [Genesereth & Nilsson '87] | Michael R. Genesereth and Nils J. Nilsson | Logical Foundations for Artificial Intelligence |
| [McFarland '93] | Michael C. McFarland | Formal verification of sequential hardware: a tutorial |
| [Biere etal. '99] | A. Biere, A. Cimatti, E.M. Clarke, M. Fujita, Y. Zhu | Symbolic model checking using SAT procedures instead of BDDs |
| [Barrett '03] | Clark Barrett | Logic in Computer Science, NYU, Fall 2003 |
| [OpenGalen '03] | OpenGALEN, by Kermanog | GALEN common reference model, version 1.02, and software |
| [Baader & Nutt '03] | Franz Baader & Werner Nutt | Basic Description Logics, Ch.2 in the Description Logic Handbook |
| [Franconi '03] | Enrico Franconi | Natural Language Processing, Ch.15 in the Description Logic Handbook |
| [Gadsil, Koller, & Striegnitz '01] | M. Gabsdil, A. Koller, and K. Striegnitz | Building a Text Adventure on Description Logic |
| [Horrocks '02] | Ian Horrocks | DAML+OIL: a Description Logic for the Semantic Web |
| [Amir & McIlraith '03] | Eyal Amir and Sheila McIlraith | Partition-Based Reasoning for First-Order and Propositional Theories |
| [MacCartney etal. '03] | B. MacCartney, S. McIlraith, E. Amir, & T. Uribe | Practical Partition-Based Theorem Proving for Large Knowledge Bases |
| [Amir & Engelhardt '03] | Eyal Amir and Barbara Engelhardt | Factored Planning |
| [Rish & Dechter '00] | Irina Rish and Rina Dechter | Resolution versus Search: Two Strategies for SAT |
| [Doucet etal '00] | Arnaud Doucet, Nando de Freitas, Kevin Murphy, Stuart Russell | Rao-Blackwellised Particle Filtering for Dynamic Bayesian Networks, Tutorial, and slides |
| [Doucet etal '00b] | Arnaud Doucet, Simon Godsill, and Christophe Andrieu | On sequential Monte-Carlo sampling methods for Bayesian filtering |
| [Coates '04] | Mark Coates | Distributed particle filters for sensor networks |
| [Fox etal '01] | Dieter Fox, Sebastian Thrun, Wolfram Burgard, Frank Dellaert | Particle Filters for Mobile Robot Localization |
| [Moskewicz etal '01] | M. W. Moskewicz, C. F. Madigan, Y. Zhao, L. Zhang, and S. Malik | Chaff: engineering an efficient SAT solver |
| [Ginsberg and McAllester '94] | M. Ginsberg and D. McAllester | GSAT and Dynamic Backtracking |
| [Selman, Mitchell, and Levesque '97] | B. Selman, D. Mitchell, and H. Levesque | Generating hard satisfiability problems |
| [MacKay '98] | David MacKay | Introduction to Monte Carlo methods |
| [Crick & Pfeffer '03] | Christopher Crick and Avi Pfeffer | Loopy belief propagation as a basis for communication in sensor networks |
| [Yedidia etal. '03] | J.S. Yedidia, W.T. Freeman and Y. Weiss | Bethe free energy, Kikuchi approximations and belief propagation algorithms |
| [McEliece, MacKay & Cheng '98] | R.J. McEliece, D. J. C. MacKay, and J. F. Cheng | Turbo decoding as an instance of Pearl's `belief propagation |
| [Poole '03] | David Poole | First-order probabilistic inference |
| [Braunstein etal '03] | A. Braunstein, M. Mezard, R. Zecchina | Survey propagation: an algorithm for satisfiability |
| [Amir & Russell '03] | E. Amir and S. Russell | Logical Filtering |
| [Jaakola '00] | T. Jaakola | Tutorial on variational approximation methods (slides) |
| [El-Hay & Friedman '01] | T. El-Hay and N. Friedman | Incorporating Expressive Graphical Models in Variational Approximations: Chain-Graphs and Hidden Variables |
| [Tu & Zhu '02] | Zhuowen Tu and Song-Chun Zhu | Image Segmentation by Data-Driven Markov Chain Monte Carlo |
| [Wei & Altman '97] | L. Wei and R. B. Altman | An Automated System for Generating Comparative Disease Profiles and Making Diagnoses |
| [Segal etal. '03] | E. Segal and R. Yelensky and D. Koller | Genome-wide Discovery of Transcriptional Modules from DNA Sequence and Gene Expression |
| [Baumgartner '00] | Peter Baumgartner | FDPLL - A First-Order Davis-Putnam-Logeman-Loveland Procedure |
| [Khan etal. '03] | Z. Khan, T. Balch, and F. Dellaert | An MCMC-based Particle Filter for Tracking Multiple Interacting Targets |
| [Marthi etal. '03] | B. Marthi, H. Pasula, and S. Russell | Decayed MCMC Filtering |
| [Pfeffer '00] | A. Pfeffer | Probabilistic Reasoning for Complex Systems |
| [Pasula & Russell '00] | H. Pasula and S. Russell | Approximate Inference For First-Order Probabilistic Languages |
| [Pasula etal. '02] | H. Pasula, B. Marthi, B. Milch, S. Russell, I. Shpitser | Identity Uncertainty and Citation Matching |
| [del Val '99] | A. del Val | A New Method for Consequence Finding and Compilation for Restricted Languages |
| [de Kleer '92] | J. de Kleer | An Improved Incremental Algorithm for Generating Prime Implicates |
| [Slagle etal. '70] | J.R. Slagle, C.-L. Chang and R. Lee | A new algorithm for generating prime implicants |
| [Groote & Zantema '00] | J.F. Groote and H. Zantema | Resolution and binary decision diagrams cannot simulate each other polynomially |
| [Groote & Tveretina '03] | J.F. Groote and O. Tveretina | Binary decision diagrams for first-order predicate logic |
| [Sanghai, Domingo, & Weld '03] | S. Sanghai, P. Domingo, and D. Weld | Dynamic Probabilistic Relational Models |
| [Rabiner '89] | L.R. Rabiner | A tutorial on hidden Markov models and selected applications in speech recognition |
| [Lerner & Parr '01] | U. Lerner and R. Parr | Inference in Hybrid Networks: Theoretical Limits and Practical Algorithms |
| Key | Authors | Title |
| Key | Authors | Title |
| Key | Authors | Title |
| Key | Authors | Title |
| Key | Authors | Title |
| Key | Authors | Title |
| Viewing PostScript and PDF |
Depending on the computer you are using, you may be able to download a PostScript viewer or PDF viewer for it if you don't already have one.
| Comments to Eyal Amir |