[1a]Reinforcement Learning 1 - Domain Representation
571An Object-Oriented Representation for Efficient Reinforcement Learning. Carlos Diuk, Andre Cohen, and Michael Littman
544Hierarchical Model-Based Reinforcement Learning: R-max + MAXQ. Nicholas Jong and Peter Stone
682On the Hardness of Finding Symmetries in Markov Decision Processes. Shravan Narayanamurthy and Balaraman Ravindran
197Efficiently Learning Linear-Linear Exponential Family Predictive Representations of State. David Wingate and Satinder Singh
[2a]Reinforcement Learning 2 - Value Representation
341Online Kernel Selection for Bayesian Reinforcement Learning. Joseph Reisinger, Peter Stone, and Risto Miikkulainen
125A Worst-Case Comparison Between Temporal Difference and Residual Gradient with Linear Function Approximation. Lihong Li
581An Analysis of Linear Models, Linear Value-Function Approximation, and Feature Selection for Reinforcement Learning. Ronald Parr, Lihong Li, Gavin Taylor, Christopher Painter-Wakefield, and Michael Littman
429A Semi-parametric Statistical Approach to Model-free Policy Evaluation. Tsuyoshi Ueno, Motoaki Kawanabe, Takeshi Mori, Shin-Ichi Maeda, and Shin Ishii
[4a]Reinforcement Learning 3
259Non-Parametric Policy Gradients: A Unified Treatment of Propositional and Relational Domains. Kristian Kersting and Kurt Driessens
488Space-indexed Dynamic Programming: Learning to Follow Trajectories. J. Zico Kolter, Adam Coates, Andrew Ng, Yi Gu, and Charles DuHadway
335Privacy-Preserving Reinforcement Learning. Jun Sakuma, Shigenobu Kobayashi, and Rebecca Wright
645Apprenticeship Learning Using Linear Programming. Umar Syed, Michael Bowling, and Robert Schapire
257Learning All Optimal Policies with Multiple Criteria. Leon Barrett and Srinivas Narayanan
[5a]Reinforcement Learning 4 - Active Learning
290Active Reinforcement Learning. Arkady Epshteyn, Adam Vogel, and Gerald DeJong
627Knows What It Knows: A Framework For Self-Aware Learning. Lihong Li, Michael Littman, and Thomas Walsh
487Reinforcement Learning with Limited Reinforcement: Using Bayes Risk for Active Learning in POMDPs. Finale Doshi, Joelle Pineau, and Nicholas Roy
490The Many Faces of Optimism: a Unifying Approach. Istvan Szita and Andras Lorincz
479Transfer of Samples in Batch Reinforcement Learning. Alessandro Lazaric, Marcello Restelli, and Andrea Bonarini
[6a]Reinforcement Learning 5
452Learning for Control from Multiple Demonstrations. Adam Coates, Pieter Abbeel, and Andrew Ng
580Reinforcement Learning in the Presence of Rare Events. Jordan Frank, Shie Mannor, and Doina Precup
317On-line Discovery of Temporal-Difference Networks. Takaki Makino and Toshihisa Takagi
111Preconditioned Temporal Difference Learning. Hengshuai Yao and Zhi-Qiang Liu
[7a]Reinforcement Learning 6
458Automatic Discovery and Transfer of MAXQ Hierarchies. Neville Mehta, Soumya Ray, Prasad Tadepalli, and Thomas Dietterich
652An Analysis of Reinforcement Learning with Function Approximation. Francisco Melo, Sean Meyn, and Isabel Ribeiro
519Exploration Scavenging. John Langford, Alexander Strehl, and Jennifer Wortman
564Sample-Based Learning and Search with Permanent and Transient Memories. David Silver, Richard Sutton, and Martin Mueller
[8a]Transfer Learning and Games
412Learning to Learn Implicit Queries from Gaze Patterns. Kai Puolamäki, Antti Ajanki, and Samuel Kaski
520Multi-Task Learning for HIV Therapy Screening. Steffen Bickel, Jasmina Bogojeska, Thomas Lengauer, and Tobias Scheffer
229Manifold Alignment using Procrustes Analysis. Chang Wang and Sridhar Mahadevan
542No-Regret Learning in Convex Games. Geoffrey J. Gordon, Amy Greenwald, and Casey Marks
655Strategy Evaluation in Extensive Games with Importance Sampling. Michael Bowling, Michael Johanson, Neil Burch, and Duane Szafron
[1b]Kernels
377Tailoring Density Estimation via Reproducing Kernel Moment Matching. Le Song, Xinhua Zhang, Alex Smola, Arthur Gretton, and Bernhard Schoelkopf
277Nonextensive Entropic Kernels. Andre F. T. Martins, Mario A. T. Figueiredo, Pedro M. Q. Aguiar, Noah A. Smith, and Eric P. Xing
216Nu-Support Vector Machine as Conditional Value-at-Risk Minimization. Akiko Takeda and Masashi Sugiyama
643A Generalization of Haussler's Convolution Kernel - Mapping Kernel. Kilho Shin and Tetsuji Kuboyama
[2b]Active Learning and Experimental design
448Optimizing Estimated Loss Reduction for Active Sampling in Rank Learning. Pinar Donmez and Jaime Carbonell
324Hierarchical sampling for active learning. Sanjoy Dasgupta and Daniel Hsu
437Active Kernel Learning. Steven C.H. Hoi and Rong Jin
687Actively Learning Level-Sets of Composite Functions. Brent Bryan and Jeff Schneider
[4b]Kernel - Including Kernel Learning
158Localized Multiple Kernel Learning. Mehmet Gonen and Ethem Alpaydin
665Composite Kernel Learning. Marie Szafranski, Yves Grandvalet, and Alain Rakotomamonjy
531Training SVM with Indefinite Kernels. Jianhui Chen and Jieping Ye
449Robust Matching and Recognition using Context-Dependent Kernels. Hichem Sahbi, Jean-Yves Audibert, Jaonary Rabarisoa, and Renaud Keriven
641An RKHS for Multi-View Learning and Manifold Co-Regularization. Vikas Sindhwani and David Rosenberg
[5b]Gaussian Processes
151Fast Gaussian Process Methods for Point Process Intensity Estimation. John Cunningham, Krishna Shenoy, and Maneesh Sahani
241Gaussian Process Product Models for Nonparametric Nonstationarity. Ryan Adams and Oliver Stegle
599Sparse Multiscale Gaussian Process Regression. Christian Walder, Kwang In Kim, and Bernhard Schoelkopf
371Topologically-Constrained Latent Variable Models. Raquel Urtasun, David Fleet, Andreas Geiger, Jovan Popovic, Trevor Darrell, and Neil Lawrence
399Bi-Level Path Following for Cross Validated Solution of Kernel Quantile Regression. Saharon Rosset
[6b]Boosting
258Random Classification Noise Defeats All Convex Potential Boosters. Philip M. Long and Rocco A. Servedio
676ManifoldBoost: Stagewise Function Approximation for Fully-, Semi- and Un-supervised Learning. Nicolas Loeff, David Forsyth, and Deepak Ramachandran
331Boosting with Incomplete Information. Gholamreza Haffari, Yang Wang, Shaojun Wang, Greg Mori, and Feng Jiao
362Maximum Likelihood Rule Ensembles. Wojciech Kotlowski, Krzysztof Dembczynski, and Roman Slowinski
[7b]Online learning
367Rank Minimization via Online Learning. Raghu Meka, Prateek Jain, Constantine Caramanis, and Inderjit Dhillon
322Confidence-Weighted Linear Classification. Mark Dredze, Koby Crammer, and Fernando Pereira
355The Projectron: a Bounded Kernel-Based Perceptron. Francesco Orabona, Joseph Keshet, and Barbara Caputo
511Efficient Bandit Algorithms for Online Multiclass Prediction. Sham M. Kakade, Shai Shalev-Shwartz, and Ambuj Tewari
242Prediction with Expert Advice for the Brier Game. Vladimir Vovk and Fedor Zhdanov
[8b]Kernels - Including scalability
166A Dual Coordinate Descent Method for Large-scale Linear SVM. Cho-Jui Hsieh, Kai-Wei Chang, Chih-Jen Lin, S. Sathiya Keerthi, and S. Sundararajan
411Optimized Cutting Plane Algorithm for Support Vector Machines. Vojtech Franc and Soeren Sonnenburg
491Fast Support Vector Machine Training and Classification on Graphics Processors. Bryan Catanzaro, Narayanan Sundaram, and Kurt Keutzer
266SVM Optimization: Inverse Dependence on Training Set Size. Shai Shalev-Shwartz and Nathan Srebro
476Improved Nystrom Low-Rank Approximation and Error Analysis. Kai Zhang, Ivor Tsang, and James Kwok
[1c]Clustering
628A Rate-Distortion One-Class Model and its Applications to Clustering. Koby Crammer, Partha Pratim Talukdar, and Fernando Pereira
196Estimating Local Optimums in EM Algorithm over Gaussian Mixture Model. Zhenjie Zhang, Bing Tian Dai, and Anthony K.H. Tung
236A Decoupled Approach to Exemplar-based Unsupervised Learning.. Sebastian Nowozin and Gökhan Bakir
168Efficient MultiClass Maximum Margin Clustering. Bin Zhao, Fei Wang, and Changshui Zhang
[2c]Distance learning and Efficient Use
215Fast Solvers and Efficient Implementations for Distance Metric Learning. Kilian Weinberger and Lawrence Saul
178Nearest Hyperdisk Methods for High-Dimensional Classification. Hakan Cevikalp, Bill Triggs, and Robi Polikar
400Fast Nearest Neighbor Retrieval for Bregman Divergences. Lawrence Cayton
340Deep Learning via Semi-Supervised Embedding. Jason Weston, Frederic Ratle, and Ronan Collobert
[4c]Semi-supervised Learning - Embeddings and Transduction
611Semi-supervised Learning of Compact Document Representations with Deep Networks. Marc'Aurelio Ranzato and Martin Szummer
382Large Scale Manifold Transduction. Michael Karlen, Jason Weston, Ayse Erkan, and Ronan Collobert
296Graph Transduction via Alternating Minimization. Jun Wang, Tony Jebara, and Shih-Fu Chang
254Stability of Transductive Regression Algorithms. Corinna Cortes, Mehryar Mohri, Dmitry Pechyony, and Ashish Rastogi
383On Multi-View Active Learning and the Combination with Semi-Supervised Learning. Wei Wang and Zhi-Hua Zhou
[5c]Semi-supervised clustering and classification
337Estimating Labels from Label Proportions. Novi Quadrianto, Alex Smola, Tiberio Caetano, and Quoc Viet Le
432Self-taught Clustering. Wenyuan Dai, Qiang Yang, Gui-Rong Xue, and Yong Yu
172Spectral Clustering with Inconsistent Advice. Tom Coleman, James Saunderson, and Anthony Wirth
145Pairwise Constraint Propagation by Semidefinite Programming for Semi-Supervised Classification. Zhenguo Li, Jianzhuang Liu, and Xiaoou Tang
528The Asymptotics of Semi-Supervised Learning in Discriminative Probabilistic Models. Nataliya Sokolovska, Olivier Cappé, and François Yvon
[6c]Discriminative vs Generative, and Energy-Based Learning
415Discriminative Parameter Learning for Bayesian Networks. Jiang Su, Harry Zhang, Charles X. Ling, and Stan Matwin
588An Asymptotic Analysis of Generative, Discriminative, and Pseudolikelihood Estimators. Percy Liang and Michael Jordan
601Classification using Discriminative Restricted Boltzmann Machines. Hugo Larochelle and Yoshua Bengio
573On the Quantitative Analysis of Deep Belief Networks. Ruslan Salakhutdinov and Iain Murray
638Training Restricted Boltzmann Machines using Approximations to the Likelihood Gradient. Tijmen Tieleman
[7c]Embeddings
163Uncorrelated Multilinear Principal Component Analysis through Successive Variance Maximization. Haiping Lu, Konstantinos Plataniotis, and Anastasios Venetsanopoulos
484Expectation-Maximization for Sparse and Non-Negative PCA. Christian David Sigg and Joachim M. Buhmann
551ICA and ISA Using Schweizer-Wolff Measure of Dependence. Sergey Kirshner and Barnabás Póczos
600Bayesian Probabilistic Matrix Factorization using Markov Chain Monte Carlo. Ruslan Salakhutdinov and Andriy Mnih
[8c]Embeddings
270A Least Squares Formulation for Canonical Correlation Analysis. Liang Sun, Shuiwang Ji, and Jieping Ye
668Closed-form Supervised Dimensionality Reduction with Generalized Linear Models. Irina Rish, Genady Grabarnilk, Guillermo Cecchi, Francisco Pereira, and Geoffrey J. Gordon
312Grassmann Discriminant Analysis: a Unifying View on Subspace-Based Learning. Jihun Hamm and Daniel Lee
582Metric Embedding for Kernel Classification Rules. Bharath Sriperumbudur, Omer Lang, and Gert Lanckriet
592Extracting and Composing Robust Features with Denoising Autoencoders. Pascal Vincent, Hugo Larochelle, Yoshua Bengio, and Pierre-Antoine Manzagol
[1d]Hidden Markov Models
182Inverting the Viterbi Algorithm: an Abstract Framework for Structure Design. Michael Schnall-Levin, Leonid Chindelevitch, and Bonnie Berger
305An HDP-HMM for Systems with State Persistence. Emily Fox, Erik Sudderth, Michael Jordan, and Alan Willsky
413Modeling Interleaved Hidden Processes. Niels Landwehr
679Beam Sampling for the Infinite Hidden Markov Model. Jurgen Van Gael, Yunus Saatci, Yee Whye Teh, and Zoubin Ghahramani
[2d]Mixture models, Dirichlet processes
460Statistical Models for Partial Membership. Katherine Heller, Sinead Williamson, and Zoubin Ghahramani
538The Dynamic Hierarchical Dirichlet Process. Lu Ren, David B. Dunson, and Lawrence Carin
554Hierarchical Kernel Stick-Breaking Process for Multi-Task Image Analysis. Qi An, Chunping Wang, Ivo Shterev, Eric Wang, Lawrence Carin, and David B. Dunson
502Data Spectroscopy: Learning Mixture Models using Eigenspaces of Convolution Operators. Tao Shi, Mikhail Belkin, and Bin Yu
[4d]Ranking and Classification with Sampling
489Democratic Approximation of Lexicographic Preference Models. Fusun Yaman, Thomas Walsh, Michael Littman, and Marie desJardins
343Unsupervised Rank Aggregation with Distance-Based Models. Alexandre Klementiev, Dan Roth, and Kevin Small
392Learning Dissimilarities by Ranking: From SDP to QP. Hua Ouyang and Alexander Gray
523Empirical Bernstein Stopping. Volodymyr Mnih, Csaba Szepesvari, and Jean-Yves Audibert
614Pointwise Exact Bootstrap Distributions of Cost Curves. Charles Dugas and David Gadoury
[5d]Sequence Data
278A Distance Model for Rhythms. Jean-Francois Paiement, Yves Grandvalet, Samy Bengio, and Douglas Eck
318A Reproducing Kernel Hilbert Space Framework for Pairwise Time Series Distances. Zhengdong Lu, Todd K. Leen, Yonghong Huang, and Deniz Erdogmus
440Sequence Kernels for Predicting Protein Essentiality. Cyril Allauzen, Mehryar Mohri, and Ameet Talwalkar
180Local Likelihood Modeling of Temporal Text Streams. Guy Lebanon and Yang Zhao
160Causal Modelling Combining Instantaneous and Lagged Effects: an Identifiable Model Based on Non-Gaussianity. Aapo Hyvarinen, Shohei Shimizu, and Patrik Hoyer
[6d]Ranking and IR
167Listwise Approach to Learning to Rank - Theory and Algorithm. Fen Xia, Tie-Yan Liu, Jue Wang, Wensheng Zhang, and Hang Li
179Query-Level Stability and Generalization in Learning to Rank. Yanyan Lan, Tie-Yan Liu, Tao Qin, Zhiming Ma, and Hang Li
470Predicting Diverse Subsets Using Structural SVMs. Yisong Yue and Thorsten Joachims
264Learning Diverse Rankings with Multi-Armed Bandits. Filip Radlinski, Robert Kleinberg, and Thorsten Joachims
[7d]Topic models
562mStruct: A New Admixture Model for Inference of Population Structure in Light of Both Genetic Admixing and Allele Mutations. Suyash Shringarpure and Eric Xing
419Memory Bounded Inference in Topic Models. Ryan Gomes, Max Welling, and Pietro Perona
667Nonnegative Matrix Factorization via Rank-One Downdate. Michael Biggs, Ali Ghodsi, and Stephen Vavasis
129Dirichlet Component Analysis: Feature Extraction for Compositional Data. Hua-Yan Wang, Qiang Yang, Hong Qin, and Hongbin Zha
[8d]NLP
304Learning to Sportscast: A Test of Grounded Language Acquisition. David Chen and Raymond Mooney
391A Unified Architecture for Natural Language Processing: Deep Neural Networks with Multitask Learning. Ronan Collobert and Jason Weston
398Modified MMI/MPE: a Direct Evaluation of the Margin in Speech Recognition. Georg Heigold, Thomas Deselaers, Ralf Schlueter, and Hermann Ney
311Fully Distributed EM for Very Large Datasets. Jason Wolfe, Aria Haghighi, and Dan Klein
673Structure Compilation: Trading Structure for Features. Percy Liang, Hal Daume, and Dan Klein
[1e]Graphs
379Graph Kernels Between Point Clouds. Francis Bach
396The Skew Spectrum of Graphs. Risi Kondor and Karsten Borgwardt
681Message-passing for Graph-structured Linear Programs: Proximal Projections, Convergence and Rounding Schemes. Pradeep Ravikumar, Alekh Agarwal, and Martin J. Wainwright
565Fast Incremental Proximity Search in Large Graphs. Purnamrita Sarkar, Andrew Moore, and Amit Prakash
[2e]Optimization
327Efficiently Solving Convex Relaxations for MAP Estimation. Pawan Kumar Mudigonda and Philip Torr
461A Quasi-Newton Approach to Nonsmooth Convex Optimization. Jin Yu, S.V.N. Vishwanathan, Simon Guenter, and Nicol Schraudolph
497Stopping Conditions for Exact Computation of Leave-One-Out Error in Support Vector Machines. Vojtech Franc, Pavel Laskov, and Klaus-R. Mueller
260On Partial Optimality in Multi-label MRFs. Pushmeet Kohli, Alexander Shekhovtsov, Carsten Rother, Vladimir Kolmogorov, and Philip Torr
[4e]Structured output, ILP and Sparsity
402Accurate Max-margin Training for Structured Output Spaces. Sunita Sarawagi and Rahul Gupta
279Training Structural SVMs when Exact Inference is Intractable. Thomas Finley and Thorsten Joachims
530Discriminative Structure and Parameter Learning for Markov Logic Networks. Tuyen Huynh and Raymond Mooney
237Laplace Maximum Margin Markov Networks. Jun Zhu, Eric Xing, and Bo Zhang
503Fast Estimation of Relational Pattern Coverage through Randomization and Maximum Likelihood. Ondrej Kuzelka and Filip Zelezny
[5e]Feature selection and sparsity
630Detecting Statistical Interactions with Additive Groves of Trees. Daria Sorokina, Rich Caruana, Mirek Riedewald, and Daniel Fink
574Sparse Bayesian Nonparametric Regression. Francois Caron and Arnaud Doucet
390Bolasso: Model Consistent Lasso Estimation through the Bootstrap. Francis Bach
113The GroupLASSO for Generalized Linear Models: Uniqueness of Solutions and Efficient Algorithms. Volker Roth and Bernd Fischer
323On the Chance Accuracies of Large Collections of Classifiers. Mark Palatucci and Andrew Carlson
[6e]Compressed Sensing and Projections
121Autonomous Geometric Precision Error Estimation in Low-level Computer Vision Tasks. Andrés Corrada-Emmanuel and Howard Schultz
209Multi-Task Compressive Sensing with Dirichlet Process Priors. Yuting Qi, Dehong Liu, David Dunson, and Lawrence Carin
459Compressed Sensing and Bayesian Experimental Design. Matthias Seeger and Hannes Nickisch
361Efficient Projections onto the L1-Ball for Learning in High Dimensions. John Duchi, Shai Shalev-Shwartz, Yoram Singer, and Tushar Chandra
[7e]Classification
455Bayes Optimal Classification for Decision Trees. Siegfried Nijssen
536Multi-Classification by Categorical Features via Clustering. Yevgeny Seldin and Naftali Tishby
150Cost-Sensitive Multi-class Classification from Probability Estimates. Deirdre O'Brien, Maya Gupta, and Robert Gray
513Polyhedral Classifier for Target Detection A Case Study: Colorectal Cancer. Murat Dundar, Matthias Wolf, Sarang Lakare, Marcos Salganicoff, and Vikas C. Raykar
[8e]Multiple Instance Learning and Learning with Missing Features
130Adaptive p-Posterior Mixture-Model Kernels for Multiple Instance Learning. Hua-Yan Wang, Qiang Yang, and Hongbin Zha
552Multiple Instance Ranking. Charles Bergeron, Jed Zaretzki, Curt Breneman, and Kristin Bennett
587Bayesian Multiple Instance Learning: Automatic Feature Selection and Inductive Transfer. Vikas Raykar, Balaji Krishnapuram, Jinbo Bi, Murat Dundar, and R. Bharat Rao
202Learning to Classify with Missing and Corrupted Features. Ofer Dekel and Ohad Shamir
272Learning from Incomplete Data with Infinite Imputations. Uwe Dick, Peter Haider, and Tobias Scheffer