Publications
See Google scholar for most up-to-date list of papers
Since the founding of you.com and AIXVentures.com I’ve not had the time and resources anymore to publish (m)any proper research papers anymore. So here are just some recent thoughts.
2020
ProGen: Language Modeling for Protein Generation, Ali Madani, Bryan McCann, Nikhil Naik, Nitish Shirish Keskar, Namrata Anand, Raphael R Eguchi, Po-Ssu Huang and Richard Socher.
[ bioRxiv link, blog ]
The AI Economist: Improving Equality and Productivity with AI-Driven Tax Policies, Stephan Zheng, Alexander Trott, Sunil Srinivasa, Nikhil Naik, Melvin Gruesbeck, David C. Parkes, Richard Socher.
[ arxiv link, blog, short video, Q&A, Press: VentureBeat, TechCrunch ]
Deep Learning-enabled Breast Cancer Hormonal Receptor Status Determination from Base-level H&E Stains, Nikhil Naik, Ali Madani, Andre Esteva, Nitish Keskar, Michael Press, Dan Ruderman, David Agus, Richard Socher
(Nature Communications 2020) [ paper, blog ]
Dye-sensitized solar cells under ambient light powering machine learning: towards autonomous smart sensors for the internet of things, Hannes Michaels, Michael Rinderle, Richard Freitag, Lacopo Benesperi, Tomas Edvinsson, Richard Socher, Alessio Gagliardib and Marina Freitag
Issue11, (Chemical Science 2020). [ paper link ]
Deep learning-enabled medical computer vision
Andre Esteva, Katherine Chou, Serena Yeung, Nikhil Naik, Ali Madani, Ali Mottaghi, Yun Liu, Eric Topol, Jeff Dean, Richard Socher (NPJ digital medicine)
Bridging Textual and Tabular Data for Cross-Domain Text-to-SQL Semantic Parsing
Xi Victoria Lin, Richard Socher, Caiming Xiong · EMNLP 2020
Catastrophic Fisher Explosion: Early Phase Fisher Matrix Impacts Generalization
Stanislaw Jastrzebski, Devansh Arpit, Oliver Astrand, Giancarlo Kerg, Huan Wang, Caiming Xiong, Richard Socher, Kyunghyun Cho, Krzysztof Geras · arXiv 2020
Find or Classify? Dual Strategy for Slot-Value Predictions on Multi-Domain Dialog State Tracking
Jian-Guo Zhang, Kazuma Hashimoto, Chien-Sheng Wu, Yao Wan, Philip S. Yu, Richard Socher, Caiming Xiong · SEM 2020
Online Structured Meta-learning
Huaxiu Yao, Yingbo Zhou, Mehrdad Mahdavi, Zhenhui Li, Richard Socher, Caiming Xiong · NeurIPS 2020
GeDi: Generative Discriminator Guided Sequence Generation
Ben Krause, Akhilesh Deepak Gotmare, Bryan McCann, Nitish Shirish Keskar, Shafiq Joty, Richard Socher, Nazneen Fatema Rajani · arXiv 2020
Explaining and Improving Model Behavior with k Nearest Neighbor Representations
Nazneen Fatema Rajani, Ben Krause, Wengpeng Yin, Tong Niu, Richard Socher, Caiming Xiong · arXiv 2020
Explaining Creative Artifacts
Lav R. Varshney, Nazneen Fatema Rajani, Richard Socher · WHI 2020
Theory-Inspired Path-Regularized Differential Network Architecture Search
Pan Zhou, Caiming Xiong, Richard Socher, Steven C. H. Hoi · NeurIPS 2020
Universal Natural Language Processing with Limited Annotations: Try Few-shot Textual Entailment as a Start
Wenpeng Yin, Nazneen Fatema Rajani, Dragomir Radev, Richard Socher, Caiming Xiong · EMNLP 2020
TOD-BERT: Pre-trained Natural Language Understanding for Task-Oriented Dialogue
Chien-Sheng Wu, Steven Hoi, Richard Socher, Caiming Xiong · EMNLP 2020
GraPPa: Grammar-Augmented Pre-Training for Table Semantic Parsing
Tao Yu, Chien-Sheng Wu, Xi Victoria Lin, Bailin Wang, Yi Chern Tan, Xinyi Yang, Dragomir Radev, Richard Socher, Caiming Xiong · arXiv 2020
Composed Variational Natural Language Generation for Few-shot Intents
Congying Xia, Caiming Xiong, Philip Yu, Richard Socher · EMNLP 2020
Central Yup'ik and Machine Translation of Low-Resource Polysynthetic Languages
Christopher Liu, Laura Dominé, Kevin Chavez, Richard Socher · arXiv 2020
Photon: A Robust Cross-Domain Text-to-SQL System
Jichuan Zeng, Xi Victoria Lin, Caiming Xiong, Richard Socher, Michael R. Lyu, Irwin King, Steven C. H. Hoi · ACL 2020
Explore, Discover and Learn: Unsupervised Discovery of State-Covering Skills
Víctor Campos, Alexander Trott, Caiming Xiong, Richard Socher, Xavier Giro-i-Nieto, Jordi Torres · arXiv 2020
An investigation of phone-based subword units for end-to-end speech recognition
Weiran Wang, Guangsen Wang, Aadyot Bhatnagar, Yingbo Zhou, Caiming Xiong, Richard Socher · Interspeech 2020
Prototypical Contrastive Learning of Unsupervised Representations
Junnan Li, Pan Zhou, Caiming Xiong, Richard Socher, Steven C. H. Hoi · arXiv 2020
BERTology Meets Biology: Interpreting Attention in Protein Language Models
Jesse Vig, Ali Madani, Lav R. Varshney, Caiming Xiong, Richard Socher, Nazneen Fatema Rajani · arXiv 2020
A Simple Language Model for Task-Oriented Dialogue
Ehsan Hosseini-Asl, Bryan McCann, Chien-Sheng Wu, Semih Yavuz, Richard Socher · arXiv 2020
DART: Open-Domain Structured Data Record to Text Generation
Dragomir Radev, Rui Zhang, Amrit Rau, Abhinand Sivaprasad, Chiachun Hsieh, Nazneen Fatema Rajani, Xiangru Tang, Aadit Vyas, Neha Verma, Pranav Krishna, Yangxiaokang Liu, Nadia Irwanto, Jessica Pan, Faiaz Rahman, Ahmad Zaidi, Murori Mutuma, Yasin Tarabar, Ankit Gupta, Tao Yu, Yi Chern Tan, Xi Victoria Lin, Caiming Xiong, Richard Socher · arXiv 2020
Towards Understanding Hierarchical Learning: Benefits of Neural Representations
Minshuo Chen, Yu Bai, Jason D. Lee, Tuo Zhao, Huan Wang, Caiming Xiong, Richard Socher · arXiv 2020
Explicit Memory Tracker with Coarse-to-Fine Reasoning for Conversational Machine Reading
Yifan Gao, Chien-Sheng Wu, Shafiq Joty, Caiming Xiong, Richard Socher, Irwin King, Michael R. Lyu, Steven C. H. Hoi · ACL 2020
CO-Search: COVID-19 Information Retrieval with Semantic Search, Question Answering, and Abstractive Summarization
Andre Esteva, Anuprit Kale, Romain Paulus, Kazuma Hashimoto, Wenpeng Yin, Dragomir Radev, Richard Socher · arXiv 2020
WOAD: Weakly Supervised Online Action Detection in Untrimmed Videos
Mingfei Gao, Yingbo Zhou, Ran Xu, Richard Socher, Caiming Xiong · arXiv 2020
Learning from Noisy Anchors for One-stage Object Detection
Hengduo Li, Zuxuan Wu, Chen Zhu, Caiming Xiong, Richard Socher, Larry S. Davis · CVPR 2020
ESPRIT: Explaining Solutions to Physical Reasoning Tasks
Nazneen Fatema Rajani, Rui Zhang, Yi Chern Tan, Stephan Zheng, Jeremy Weiss, Aadit Vyas, Abhijit Gupta, Caiming XIong, Richard Socher, Dragomir Radev · ACL 2020
It's Morphin' Time! Combating Linguistic Discrimination with Inflectional Perturbations
Samson Tan, Shafiq Joty, Min-Yen Kan, Richard Socher · ACL 2020
ERASER: A Benchmark to Evaluate Rationalized NLP Models
Jay DeYoung, Sarthak Jain, Nazneen Fatema Rajani, Eric Lehman, Caiming Xiong, Richard Socher, Byron C. Wallace · ACL 2020
Improving out-of-distribution generalization via multi-task self-supervised pretraining
Isabela Albuquerque, Nikhil Naik, Junnan Li, Nitish Keskar, Richard Socher · arXiv 2020
Towards Noise-resistant Object Detection with Noisy Annotations
Junnan Li, Caiming Xiong, Richard Socher, Steven Hoi · arXiv 2020
Taylorized Training: Towards Better Approximation of Neural Network Training at Finite Width
Yu Bai, Ben Krause, Huan Wang, Caiming Xiong, Richard Socher · arXiv 2020
Neural Bayes: A Generic Parameterization Method for Unsupervised Representation Learning
Devansh Arpit, Huan Wang, Caiming Xiong, Richard Socher, Yoshua Bengio · arXiv 2020
Tree-structured Attention with Hierarchical Accumulation
Xuan-Phi Nguyen, Shafiq Joty, Steven C. H. Hoi, Richard Socher · ICLR 2020
Non-Autoregressive Dialog State Tracking
Hung Le, Richard Socher, Steven C. H. Hoi · ICLR 2020
DivideMix: Learning with Noisy Labels as Semi-supervised Learning
Junnan Li, Richard Socher, Steven C. H. Hoi · arXiv 2020
Learning to Retrieve Reasoning Paths over Wikipedia Graph for Question Answering
Akari Asai, Kazuma Hashimoto, Hannaneh Hajishirzi, Richard Socher, Caiming Xiong · ICLR 2020
Limits of Detecting Text Generated by Large-Scale Language Models
Lav R. Varshney, Nitish Shirish Keskar, Richard Socher · ITA 2020
2019
MKD: a Multi-Task Knowledge Distillation Approach for Pretrained Language Models
Linqing Liu, Huan Wang, Jimmy Lin, Richard Socher, Caiming Xiong · arXiv 2019
A High-Quality Multilingual Dataset for Structured Documentation Translation
Kazuma Hashimoto, Raffaella Buschiazzo, James Bradbury, Teresa Marshall, Richard Socher, Caiming Xiong · WMT 2019
Discriminative Nearest Neighbor Few-Shot Intent Detection by Transferring Natural Language Inference
Jian-Guo Zhang, Kazuma Hashimoto, Wenhao Liu, Chien-Sheng Wu, Yao Wan, Philip S. Yu, Richard Socher, Caiming Xiong · EMNLP 19
CTRL: A Conditional Transformer Language Model for Controllable Generation, Nitish Shirish Keskar, Bryan McCann, Lav R. Varshney, Caiming Xiong, Richard Socher.
[ arxiv link, code (pre-trained and fine-tuning), blog ]
Genie: a generator of natural language semantic parsers for virtual assistant commands, Giovanni Campagna, Silei Xu, Mehrad Moradshahi, Richard Socher, Monica S. Lam
PLDI 2019 [ pdf link, https://almond.stanford.edu/ ]
Keeping Your Distance: Solving Sparse Reward Tasks Using Self-Balancing Shaped Rewards, Alex Trott, Stephan Zheng, Caiming Xiong, Richard Socher.
Thirty-third Conference on Neural Information Processing Systems (NeurIPS 2019).
The State of Text Summarization: A Critical Evaluation, Wojciech Kryściński, Nitish Shirish Keskar, Bryan McCann, Caiming Xiong, Richard Socher.
2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP 2019). [ arxiv link ]
WSLLN: Weakly Supervised Natural Language Localization Networks, Mingfei Gao, Larry Davis, Richard Socher, Caiming Xiong.
2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP 2019). [ arxiv link ]
Editing-based SQL Query Generation for Cross-Domain Context-Dependent Questions, Rui Zhang, Tao Yu, Heyang Er, Sungrok Shim, Eric Xue, Xi Victoria Lin, Tianze Shi, Caiming Xiong, Richard Socher and Dragomir Radev
2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP 2019). [ arxiv link ]
CoSQL: A Conversational Text-to-SQL Challenge Towards Cross-Domain Natural Language Interfaces to Databases, Tao Yu, Rui Zhang, Heyang Er, Suyi Li, Eric Xue, Bo Pang, Xi Victoria Lin, Yi Chern Tan, Tianze Shi, Zihan Li, Youxuan Jiang, Michihiro Yasunaga, Sungrok Shim, Tao Chen, Alexander Fabbri, Zifan Li, Luyao Chen, Yuwen Zhang, Shreya Dixit, Vincent Zhang, Caiming Xiong, Richard Socher, Walter Lasecki, Dragomir Radev.
2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP 2019). [ arxiv link ]
Transferable Multi-Domain State Generator for Task-Oriented Dialogue Systems, Chien-Sheng Wu, Andrea Madotto, Ehsan Hosseini-Asl, Caiming Xiong, Richard Socher, Pascale Fung
The 57th Annual Meeting of the Association for Computational Linguistics (ACL 2019). Outstanding Paper Award. [ arxiv pdf, code ]
Explain Yourself! Leveraging Language Models for Commonsense Reasoning, Nazneen Fatema Rajani, Bryan McCann, Caiming Xiong and Richard Socher
The 57th Annual Meeting of the Association for Computational Linguistics (ACL 2019). [ arxiv pdf, Blog Post, Github, Press: VentureBeat, Silicon Angle, ZDNet ]
SParC: Cross-Domain Semantic Parsing in Context, Tao Yu, Rui Zhang, Michihiro Yasunaga, Yi Chern Tan, Xi Victoria Lin, Suyi Li, Heyang Er, Irene Li, Bo Pang, Tao Chen, Emily Ji, Shreya Dixit, David Proctor, Sungrok Shim, Jonathan Kraft, Vincent Zhang, Caiming Xiong, Richard Socher and Dragomir Radev
The 57th Annual Meeting of the Association for Computational Linguistics (ACL 2019). [ arxiv pdf, Challenge and Leaderboard ]
Global-to-local Memory Pointer Networks for Task-Oriented Dialogue, Chien-Sheng Wu, Richard Socher, Caiming Xiong
International Conference on Learning Representations (ICLR 2019). [ arxiv pdf ]
Self-Monitoring Navigation Agent via Auxiliary Progress Estimation, Chih-Yao Ma, Jiasen Lu, Zuxuan Wu, Ghassan AlRegib, Zsolt Kira, Richard Socher, Caiming Xiong
International Conference on Learning Representations (ICLR 2019). [ arxiv pdf ]
Coarse-grain Fine-grain Coattention Network for Multi-evidence Question Answering, Victor Zhong, Caiming Xiong, Nitish Shirish Keskar, Richard Socher.
International Conference on Learning Representations (ICLR 2019). [ arxiv pdf ]
Competitive experience replay, Hao Liu, Alexander Trott, Richard Socher, Caiming Xiong.
International Conference on Learning Representations (ICLR 2019). [ arxiv pdf ]
Augmented Cyclic Adversarial Learning for Low Resource Domain Adaptation, Ehsan Hosseini-Asl, Yingbo Zhou, Caiming Xiong, Richard Socher.
International Conference on Learning Representations (ICLR 2019). [ arxiv pdf ]
A Closer Look at Deep Learning Heuristics: Learning rate restarts, Warmup and Distillation, Akhilesh Gotmare, Nitish Shirish Keskar, Caiming Xiong, Richard Socher.
International Conference on Learning Representations (ICLR 2019). [ arxiv pdf ]
AdaFrame: Adaptive Frame Selection for Fast Video Recognition, Zuxuan Wu, Caiming Xiong, Chih-Yao Ma, Richard Socher, Larry S Davis.
Conference on Computer Vision and Pattern Recognition (CVPR 2019). [ arxiv pdf ]
Unifying Question Answering and Text Classification via Span Extraction, Nitish Shirish Keskar, Bryan McCann, Caiming Xiong, Richard Socher
[ arxiv pdf ]
Learn to Grow: A Continual Structure Learning Framework for Catastrophic Forgetting, Xilai, Yingbo Zhou, Caiming Xiong, Richard Socher
The 36th International Conference on Machine Learning (ICML 2019). [ arxiv pdf ]
Taming MAML: Control variates for unbiased meta-reinforcement learning gradient estimation, Hao Liu, Caiming Xiong, Richard Socher
The 36th International Conference on Machine Learning (ICML 2019). [ arxiv pdf ]
On the Generalization Gap in Reparameterizable Reinforcement Learning (Huan Wang, Stephan Zheng, Caiming Xiong, Richard Socher
The 36th International Conference on Machine Learning (ICML 2019). [ arxiv pdf ]
2018
The Natural Language Decathlon: Multitask Learning as Question Answering, Bryan McCann, Nitish Shirish Keskar, Caiming Xiong, Richard Socher
[ arxiv pdf, code and leaderboard, blog post, Q&A, Press: VentureBeat, zdnet, FAZ (German), SiliconAngle ]
Multi-Hop Knowledge Graph Reasoning with Reward Shaping, Xi Victoria Lin, Richard Socher, Caiming Xiong
Conference on Empirical Methods in Natural Language Processing (EMNLP 2018). [ arxiv pdf ]
Improving Abstraction in Text Summarization, Wojciech Kryściński, Romain Paulus, Caiming Xiong, Richard Socher
Conference on Empirical Methods in Natural Language Processing (EMNLP 2018). [ arxiv pdf ]
A Multi-Discriminator CycleGAN for Unsupervised Non-Parallel Speech Domain Adaptation, Ehsan Hosseini-Asl, Yingbo Zhou, Caiming Xiong, Richard Socher
Interspeech 2018. [ arxiv pdf, blog ]
Global-Locally Self-Attentive Encoder for Dialogue State Tracking, Victor Zhong, Caiming Xiong, Richard Socher.
Association for Computational Linguistics 2018 Conference (ACL 2018). [ arxiv pdf ]
Efficient and Robust Question Answering from Minimal Context over Documents, Sewon Min, Victor Zhong, Richard Socher, Caiming Xiong.
Association for Computational Linguistics 2018 Conference (ACL 2018). [ arxiv pdf ]
End-to-End Dense Video Captioning with Masked Transformer, Luowei Zhou, Yingbo Zhou, Jason J. Corso, Richard Socher, Caiming Xiong
IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2018). (Spotlight) [ arxiv pdf ]
An Analysis of Neural Language Modeling at Multiple Scales, Stephen Merity, Nitish Shirish Keskar, Richard Socher
[ arxiv pdf, github code ]
Interpretable Counting for Visual Question Answering, Alexander Trott, Caiming Xiong, Richard Socher
International Conference on Learning Representations (ICLR 2018). [ arxiv pdf, blog post ]
Hierarchical and Interpretable Skill Acquisition in Multi-task Reinforcement Learning, Tianmin Shu, Caiming Xiong, and Richard Socher
International Conference on Learning Representations (ICLR 2018). [ arxiv pdf, blog post ]
A Deep Reinforced Model for Abstractive Summarization, Romain Paulus, Caiming Xiong, Richard Socher
International Conference on Learning Representations (ICLR 2018). [ pdf, blog post, Press: Forbes, MIT Tech Review, TechCrunch ]
Non-Autoregressive Neural Machine Translation, Jiatao Gu, James Bradbury, Caiming Xiong, Victor O.K. Li, Richard Socher
International Conference on Learning Representations (ICLR 2018). [ arxiv pdf, blog post, Press: CNBC, Venturebeat, Slator ]
DCN+: Mixed Objective and Deep Residual Coattention for Question Answering, Caiming Xiong, Victor Zhong and Richard Socher
International Conference on Learning Representations (ICLR 2018). [ arxiv pdf ]
Regularizing and Optimizing LSTM Language Models, Stephen Merity, Nitish Shirish Keskar, Richard Socher
International Conference on Learning Representations (ICLR 2018). [ pdf, code ]
A Flexible Approach to Automated RNN Architecture Generation, Stephen Merity, Martin Schrimpf, James Bradbury, Richard Socher
International Conference on Learning Representations (ICLR 2018 Workshop Track). [ arxiv pdf, blog post ]
Improving End-to-End Speech Recognition with Policy Learning, Yingbo Zhou, Caiming Xiong, Richard Socher
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2018). [ pdf, blog post ]
2017
Improving Generalization Performance by Switching from Adam to SGD, Nitish Shirish Keskar, Richard Socher
[ arxiv pdf ]
Improved Regularization Techniques for End-to-End Speech Recognition, Yingbo Zhou, Caiming Xiong, Richard Socher
[ pdf, blog post ]
Weighted Transformer Network for Machine Translation, Karim Ahmed, Nitish Shirish Keskar, Richard Socher
[ arxiv pdf, blog post ]
Seq2SQL: Generating Structured Queries from Natural Language using Reinforcement Learning, Victor Zhong, Caiming Xiong, Richard Socher
[ arxiv pdf, blog post, dataset, Press: TechCrunch, Venturebeat ]
Learned in Translation: Contextualized Word Vectors, Bryan McCann, James Bradbury, Caiming Xiong, Richard Socher
Advances in Neural Information Processing Systems (NIPS 2017). [ pdf, blog post, code, Press: MIT Tech Review ]
Revisiting Activation Regularization for Language RNNs, Stephen Merity, Bryan McCann, Richard Socher
1st Workshop on Learning to Generate Natural Language at ICML 2017. [ pdf ]
A Joint Many-Task Model: Growing a Neural Network for Multiple NLP Tasks, Kazuma Hashimoto, Caiming Xiong, Yoshimasa Tsuruoka, Richard Socher
Conference on Empirical Methods in Natural Language Processing (EMNLP 2017). Also appeared in NIPS 2016 Continual Learning and Deep Networks Workshop. [ pdf, blog post ]
Knowing When to Look: Adaptive Attention via A Visual Sentinel for Image Captioning, Jiasen Lu, Caiming Xiong, Devi Parikh, Richard Socher
IEEE Computer Vision and Pattern Recognition (CVPR 2017). [ pdf, ]
Dynamic Coattention Networks For Question Answering, Caiming Xiong, Victor Zhong, Richard Socher
International Conference on Learning Representations (ICLR 2017). [ pdf, blog post, Best model on Stanford Question Answering Dataset (At submission) ]
Quasi-Recurrent Neural Networks, James Bradbury, Stephen Merity, Caiming Xiong, Richard Socher
International Conference on Learning Representations (ICLR 2017). [ pdf, blog post ]
Tying Word Vectors and Word Classifiers: A Loss Framework for Language Modeling, Hakan Inan, Khashayar Khosravi, Richard Socher
International Conference on Learning Representations (ICLR 2017). [ pdf ]
Pointer Sentinel Mixture Models, Stephen Merity, Caiming Xiong, James Bradbury, Richard Socher
International Conference on Learning Representations (ICLR 2017) and NIPS 2016 Workshop on Multi-class and Multi-label Learning in Extremely Large Label Spaces. [ pdf, new dataset ]
2016
A Way out of the Odyssey: Analyzing and Combining Recent Insights for LSTMs, Shayne Longpre, Sabeek Pradhan, Caiming Xiong, Richard Socher
[ pdf ]
MetaMind Neural Machine Translation System for WMT 2016, James Bradbury, Richard Socher
Proceedings of the First Conference on Machine Translation. Association for Computational Linguistics.
[ pdf, 2nd Place in the competition ]
Dynamic Memory Networks for Visual and Textual Question Answering, Caiming Xiong, Stephen Merity, Richard Socher
The 33rd International Conference on Machine Learning (ICML 2016). [ pdf , New York Times, MIT Technology Review ]
Ask Me Anything: Dynamic Memory Networks for Natural Language Processing, Ankit Kumar, Ozan Irsoy, Peter Ondruska, Mohit Iyyer, James Bradbury, Ishaan Gulrajani, Victor Zhong, Romain Paulus, Richard Socher
The 33rd International Conference on Machine Learning (ICML 2016).
Previous versions appeared at NIPS 2015 Deep Learning Symposium; NIPS 2015 workshop on Reasoning, Attention and Memory Workshop
[ pdf , Wired, MIT Tech Review, MetaMind announcement ]
2015
Improved Semantic Representations From Tree-Structured Long Short-Term Memory Networks, Kai Sheng Tai, Richard Socher, and Christopher D. Manning
Association for Computational Linguistics 2015 Conference (ACL 2015). [ pdf , code ]
2014
Recursive Deep Learning for Natural Language Processing and Computer Vision, Richard Socher
PhD Thesis, Computer Science Department, Stanford University
[ pdf, 2014 Arthur L. Samuel Best Computer Science PhD Thesis Award ]
Global Belief Recursive Neural Networks, Romain Paulus, Richard Socher, Christopher D. Manning
Advances in Neural Information Processing Systems (NIPS 2014). [ pdf ]
Aspect Specific Sentiment Analysis using Hierarchical Deep Learning, Himabindu Lakkaraju, Richard Socher, Chris Manning.
NIPS Workshop on Deep Learning and Representation Learning, 2014. [ pdf ]
Glove: Global Vectors for Word Representation, Jeffrey Pennington, Richard Socher and Christopher D. Manning
Conference on Empirical Methods in Natural Language Processing (EMNLP 2014). [ pdf , website with word vectors ]. ACL 2024 Test-of-Time paper award
A Neural Network for Factoid Question Answering over Paragraphs, Mohit Iyyer, Jordan Boyd-Graber, Leonardo Claudino, Richard Socher and Hal Daumé III
Conference on Empirical Methods in Natural Language Processing (EMNLP 2014). [ pdf, website with dataset, code, etc. ].
Grounded Compositional Semantics for Finding and Describing Images with Sentences, Richard Socher, Andrej Karpathy, Quoc V. Le, Christopher D. Manning, Andrew Y. Ng.
Transactions of the Association for Computational Linguistics (TACL 2014), Presented at ACL 2014. [ pdf ].
Scaling Short-answer Grading by Combining Peer Assessment with Algorithmic Scoring, Chinmay Kulkarni, Richard Socher, Michael S. Bernstein, Scott R. Klemmer.
2014 ACM Conference on Learning at Scale [ pdf ].
2013
Demonstration: etcml.com - easy text classification with machine learning, Richard Socher, Romain Paulus, Bryan McCann, Kai Sheng Tai, JiaJi Hu, Andrew Y. Ng.
Advances in Neural Information Processing Systems (NIPS 2013). [ Website to easily train and share text classifiers; Press: GigaOM, Stanford ]
Reasoning With Neural Tensor Networks for Knowledge Base Completion, Richard Socher*, Danqi Chen*, Christopher D. Manning, Andrew Y. Ng.
Advances in Neural Information Processing Systems (NIPS 2013). [ pdf, website ]
Zero-Shot Learning Through Cross-Modal Transfer, Richard Socher, Milind Ganjoo, Christopher D. Manning, Andrew Y. Ng.
Advances in Neural Information Processing Systems (NIPS 2013). [ pdf, website ]
Grounded Compositional Semantics for Finding and Describing Images with Sentences, Richard Socher, Quoc V. Le, Christopher D. Manning, Andrew Y. Ng.
Deep Learning Workshop at NIPS 2013 (see TACL 2014 version)
Recursive Deep Models for Semantic Compositionality Over a Sentiment Treebank, Richard Socher, Alex Perelygin, Jean Wu, Jason Chuang, Chris Manning, Andrew Ng and Chris Potts.
Conference on Empirical Methods in Natural Language Processing (EMNLP 2013, Oral). [ pdf, Supplementary Material, Website with Live Demo and Downloads; Press: Stanford release, Wired, Boston Globe Related Kaggle Competition ]; ACL 2023 Test-of-Time paper award
Bilingual Word Embeddings for Phrase-Based Machine Translation, Will Zou, Richard Socher, Daniel Cer and Christopher Manning.
Conference on Empirical Methods in Natural Language Processing (EMNLP 2013, Short). [ pdf ]
Parsing with Compositional Vector Grammars, Richard Socher, John Bauer, Christopher D. Manning and Andrew Y. Ng.
Association for Computational Linguistics 2013 Conference (ACL 2013). [ pdf , website ]
Better Word Representations with Recursive Neural Networks for Morphology, Thang Luong, Richard Socher, Christopher D. Manning.
Conference on Computational Natural Language Learning (CoNLL 2013). [ pdf, website with vectors ]
Learning New Facts From Knowledge Bases With Neural Tensor Networks and Semantic Word Vectors, Danqi Chen, Richard Socher, Christopher D. Manning, Andrew Y. Ng.
International Conference on Learning Representations (ICLR 2013, Workshop Track). [ pdf, website ]
Zero-Shot Learning Through Cross-Modal Transfer, Richard Socher, Milind Ganjoo, Hamsa Sridhar, Osbert Bastani, Christopher D. Manning, Andrew Y. Ng.
International Conference on Learning Representations (ICLR 2013, Workshop Track, Oral). [ pdf, website ]
2012
Convolutional-Recursive Deep Learning for 3D Object Classification, Richard Socher, Brody Huval, Bharath Bhat, Christopher D. Manning and Andrew Y. Ng.
Advances in Neural Information Processing Systems (NIPS 2012). [ pdf, website ]
Semantic Compositionality through Recursive Matrix-Vector Spaces, Richard Socher, Brody Huval, Christopher D. Manning and Andrew Y. Ng.
Conference on Empirical Methods in Natural Language Processing (EMNLP 2012, Oral). [ pdf, website ]
Improving Word Representations via Global Context and Multiple Word Prototypes, Eric H. Huang, Richard Socher, Christopher D. Manning and Andrew Y. Ng.
Association for Computational Linguistics 2012 Conference (ACL 2012). [ pdf, website ]
Stanford’s System for Parsing the English Web, David McClosky, Wanxiang Che, Marta Recasens, Mengqiu Wang, Richard Socher, and Christopher D. Manning
In Proceedings of First Workshop on Syntactic Analysis of Non-Canonical Language (SANCL at NAACL, 2012). [ pdf, bib ]
2011
Dynamic Pooling and Unfolding Recursive Autoencoders for Paraphrase Detection, Richard Socher, Eric H. Huang, Jeffrey Pennington, Andrew Y. Ng, and Christopher D. Manning.
Advances in Neural Information Processing Systems (NIPS 2011). [ pdf, website ]
Semi-Supervised Recursive Autoencoders for Predicting Sentiment Distributions, Richard Socher, Jeffrey Pennington, Eric Huang, Andrew Y. Ng, and Christopher D. Manning.
Conference on Empirical Methods in Natural Language Processing (EMNLP 2011, Oral). [ pdf, website ]
Parsing Natural Scenes and Natural Language with Recursive Neural Networks, Richard Socher, Cliff Lin, Andrew Y. Ng, and Christopher D. Manning.
The 28th International Conference on Machine Learning (ICML 2011). Distinguished Application Paper Award. [ pdf, video, website ]
Spectral Chinese Restaurant Processes: Nonparametric Clustering Based on Similarities, Richard Socher, Andrew Maas, Christopher D. Manning.
Fourteenth International Conference on Artificial Intelligence and Statistics (AISTATS 2011). [ pdf ]
2010
Learning Continuous Phrase Representations and Syntactic Parsing with Recursive Neural Networks, Richard Socher, Christopher D. Manning, Andrew Y. Ng.
Deep Learning and Unsupervised Feature Learning Workshop - NIPS 2010, Oral. [ pdf (added details on 3/3/2012) ]
A Gibbs Sampler for Spatial Clustering with the Distance-dependent Chinese Restaurant Process, Richard Socher and Christopher D. Manning. Monte Carlo Methods for Modern Applications Workshop - NIPS 2010. [ pdf ]
Connecting Modalities: Semi-supervised Segmentation and Annotation of Images Using Unaligned Text Corpora, Richard Socher and Li Fei-Fei.
IEEE Computer Vision and Pattern Recognition (CVPR 2010). [ pdf ]
2009
A Bayesian analysis of dynamics in free recall, Richard Socher, Sam J. Gershman, Adler Perotte, Per Sederberg, Ken A. Norman, and David M. Blei.
Advances in Neural Information Processing Systems 22 (NIPS 2009). [ pdf ]
Towards Total Scene Understanding: Classification, Annotation and Segmentation in an Automatic Framework, Li-Jia Li, Richard Socher, and Li Fei-Fei.
IEEE Computer Vision and Pattern Recognition (CVPR 2009, Oral). [ pdf ]
ImageNet: A Large-Scale Hierarchical Image Database, Jia Deng, Wei Dong, Richard Socher, Li-Jia Li, Kai Li, and Li Fei-Fei.
IEEE Computer Vision and Pattern Recognition (CVPR 2009). [ pdf ] Won test of time award
2008
A Learning Based Hierarchical Model for Vessel Segmentation, Richard Socher, Adrian Barbu, and Dorin Comaniciu.
In 5th IEEE International Symposium on Biomedical Imaging: From Nano to Macro (ISBI 2008, Oral). [ pdf ]
Theses
PhD Thesis: Recursive Deep Learning for Natural Language Processing and Computer Vision, Computer Science Department, Stanford University
Masters Thesis: A Learning-Based Hierarchical Model for Vessel Segmentation, Saarland University, 2008, grade 1.0 A+
Bachelor Thesis: Automatic Extension of Semantic Lexicons with a Bootstrapping Algorithm, Leipzig University, 2006, grade 1.0 A+
Former Students
Small subset of students or interns that I supervised at some point.
Jeffrey Pennington, Google Brain
Mohit Iyyer, professor in computer science at UMass Amherst
Tanay Tandon, Founder at Athelas
Ankit Kumar - Co-Founder & CTO - Ubiquity6 Inc.