![]() |
|
My CV (PDF).
My primary research has been in the area of predictive state models for stochastic dynamical systems. Predictive state models use predictions about future outcomes as their state, rather than distributions over latent variables, as state is commonly represented in traditional models. These models have the advantage that observational data provides samples of the values of state, whereas the latent variables of traditional models are never observed. The original paper on PSRs established that a PSR can represent any system that a POMDP can and, moreover, can do it using no more space than a POMDP.
More specifically, my current research is on the predictive linear Gaussian model (PLG), which models dynamical systems with continuous actions and observations. It has equivalent representational power to the linear dynamical system (on which the widely used Kalman filter is based). I have designed a consistent parameter estimation algorithm for this model, and am exploring improvements to the model itself and to estimating its parameters.
My past projects have included Autominder and a sparse sampling planner at HRL.
The Autominder project is directed by Martha Pollack. Autominder is a cognitive orthotic designed to help people with mild cognitive impairment carry out routine tasks while limiting users' dependence on it. My adviser (Satinder Singh) and I worked with Professor Pollack to study the effectiveness of using reinforcement learning to improve Autominder's user interaction.
At HRL Laboratories, I worked on a sensor resource management problem in which a sensor with several modes must trade off among qualities like accuracy, focus area, and energy cost to maximize an informational measure. I adapted Kearns, Mansour and Ng's sparse sampling algorithm to this problem, which has a mixture of continuous and discrete elements.
Jump to: Conference Papers Workshop Papers