Project
Papers:
Minimax Probability Machine:
- G.
Lanckriet, L. El Ghaoui, C. Bhattacharyya, & M. I. Jordan. Minimax Probability Machine. Accepted for
14th Conference on Neural Information Processing Systems, Vancouver,
Canada, December 3-8, 2001.
SVM Value Function and Policy Gradient RL:
- Kakade,
S (2001). A natural policy gradient. NIPS 2002 [ps.gz]
- Dietterich,
T. G. and Wang, X. (to appear). Batch value function approximation via
support vectors. Accepted for publication in Dietterich, T. G.,
Becker, S., Ghahramani, Z. (Eds.) Advances in Neural Information
Processing Systems 14. Cambridge, MA: MIT Press. Postscript preprint.
- Baird,
L. C., & Moore, A. W. (1999). "Gradient descent for
general reinforcement learning". Advances in Neural
Information Processing Systems 11.
[7 pages: ps.gz | pdf | rtf | HTML]
Reinforcement Learning and Robotics (Soccer):
- G. Z. Grudic and L. H. Ungar.
Exploiting Multiple Secondary Reinforcers in Policy
Gradient Reinforcement Learning, Seventeenth International Joint
Conference on Artificial Intelligence (IJCAI 01), August 4th - 10th, 2001,
Seattle, Washington, USA (postscript)(pdf)
Active Learning for Probabilistic Classification and
Clustering in Relational Data:
- Probabilistic
Clustering in Relational Data, B. Taskar, E. Segal, and D. Koller. Seventeenth
International Joint Conference on Artificial Intelligence, Seattle,
Washington, August 2001, pages 870--876. =
- Active
Learning for Structure in Bayesian Networks, S. Tong and D. Koller. Seventeenth
International Joint Conference on Artificial
- Rich
Probabilistic Models for Gene Expression, E. Segal, B. Taskar, A.
Gasch, N. Friedman, and D. Koller. Ninth International Conference on
Intelligent Systems for Molecular Biology (ISMB), Copenhagen, Denmark,
June 2001, pages 243--252.