Publications

See Google scholar for most up-to-date list of papers

Since the founding of you.com and AIXVentures.com I’ve not had the time and resources anymore to publish (m)any proper research papers anymore. So here are just some recent thoughts.

2020

ProGen: Language Modeling for Protein Generation, Ali Madani, Bryan McCann, Nikhil Naik, Nitish Shirish Keskar, Namrata Anand, Raphael R Eguchi, Po-Ssu Huang and Richard Socher.
bioRxiv linkblog ]

The AI Economist: Improving Equality and Productivity with AI-Driven Tax Policies, Stephan Zheng, Alexander Trott, Sunil Srinivasa, Nikhil Naik, Melvin Gruesbeck, David C. Parkes, Richard Socher.
arxiv linkblogshort videoQ&A, Press: VentureBeatTechCrunch ]

Deep Learning-enabled Breast Cancer Hormonal Receptor Status Determination from Base-level H&E Stains, Nikhil Naik, Ali Madani, Andre Esteva, Nitish Keskar, Michael Press, Dan Ruderman, David Agus, Richard Socher
(Nature Communications 2020) [ paperblog ]

Dye-sensitized solar cells under ambient light powering machine learning: towards autonomous smart sensors for the internet of things, Hannes Michaels, Michael Rinderle, Richard Freitag, Lacopo Benesperi, Tomas Edvinsson, Richard Socher, Alessio Gagliardib and Marina Freitag
Issue11, (Chemical Science 2020). [ paper link ]

Deep learning-enabled medical computer vision
Andre Esteva, Katherine Chou, Serena Yeung, Nikhil Naik, Ali Madani, Ali Mottaghi, Yun Liu, Eric Topol, Jeff Dean, Richard Socher (NPJ digital medicine)

Bridging Textual and Tabular Data for Cross-Domain Text-to-SQL Semantic Parsing
Xi Victoria Lin, Richard Socher, Caiming Xiong · EMNLP 2020

Catastrophic Fisher Explosion: Early Phase Fisher Matrix Impacts Generalization
Stanislaw Jastrzebski, Devansh Arpit, Oliver Astrand, Giancarlo Kerg, Huan Wang, Caiming Xiong, Richard Socher, Kyunghyun Cho, Krzysztof Geras · arXiv 2020

Find or Classify? Dual Strategy for Slot-Value Predictions on Multi-Domain Dialog State Tracking
Jian-Guo Zhang, Kazuma Hashimoto, Chien-Sheng Wu, Yao Wan, Philip S. Yu, Richard Socher, Caiming Xiong · SEM 2020

Online Structured Meta-learning
Huaxiu Yao, Yingbo Zhou, Mehrdad Mahdavi, Zhenhui Li, Richard Socher, Caiming Xiong · NeurIPS 2020

GeDi: Generative Discriminator Guided Sequence Generation
Ben Krause, Akhilesh Deepak Gotmare, Bryan McCann, Nitish Shirish Keskar, Shafiq Joty, Richard Socher, Nazneen Fatema Rajani · arXiv 2020

Explaining and Improving Model Behavior with k Nearest Neighbor Representations
Nazneen Fatema Rajani, Ben Krause, Wengpeng Yin, Tong Niu, Richard Socher, Caiming Xiong · arXiv 2020

Explaining Creative Artifacts
Lav R. Varshney, Nazneen Fatema Rajani, Richard Socher · WHI 2020

Theory-Inspired Path-Regularized Differential Network Architecture Search
Pan Zhou, Caiming Xiong, Richard Socher, Steven C. H. Hoi · NeurIPS 2020

Universal Natural Language Processing with Limited Annotations: Try Few-shot Textual Entailment as a Start
Wenpeng Yin, Nazneen Fatema Rajani, Dragomir Radev, Richard Socher, Caiming Xiong · EMNLP 2020

TOD-BERT: Pre-trained Natural Language Understanding for Task-Oriented Dialogue
Chien-Sheng Wu, Steven Hoi, Richard Socher, Caiming Xiong · EMNLP 2020

GraPPa: Grammar-Augmented Pre-Training for Table Semantic Parsing
Tao Yu, Chien-Sheng Wu, Xi Victoria Lin, Bailin Wang, Yi Chern Tan, Xinyi Yang, Dragomir Radev, Richard Socher, Caiming Xiong · arXiv 2020

Composed Variational Natural Language Generation for Few-shot Intents
Congying Xia, Caiming Xiong, Philip Yu, Richard Socher · EMNLP 2020

Central Yup'ik and Machine Translation of Low-Resource Polysynthetic Languages
Christopher Liu, Laura Dominé, Kevin Chavez, Richard Socher · arXiv 2020

Photon: A Robust Cross-Domain Text-to-SQL System
Jichuan Zeng, Xi Victoria Lin, Caiming Xiong, Richard Socher, Michael R. Lyu, Irwin King, Steven C. H. Hoi · ACL 2020

Explore, Discover and Learn: Unsupervised Discovery of State-Covering Skills
Víctor Campos, Alexander Trott, Caiming Xiong, Richard Socher, Xavier Giro-i-Nieto, Jordi Torres · arXiv 2020

An investigation of phone-based subword units for end-to-end speech recognition
Weiran Wang, Guangsen Wang, Aadyot Bhatnagar, Yingbo Zhou, Caiming Xiong, Richard Socher · Interspeech 2020

Prototypical Contrastive Learning of Unsupervised Representations
Junnan Li, Pan Zhou, Caiming Xiong, Richard Socher, Steven C. H. Hoi · arXiv 2020

BERTology Meets Biology: Interpreting Attention in Protein Language Models
Jesse Vig, Ali Madani, Lav R. Varshney, Caiming Xiong, Richard Socher, Nazneen Fatema Rajani · arXiv 2020

A Simple Language Model for Task-Oriented Dialogue
Ehsan Hosseini-Asl, Bryan McCann, Chien-Sheng Wu, Semih Yavuz, Richard Socher · arXiv 2020

DART: Open-Domain Structured Data Record to Text Generation
Dragomir Radev, Rui Zhang, Amrit Rau, Abhinand Sivaprasad, Chiachun Hsieh, Nazneen Fatema Rajani, Xiangru Tang, Aadit Vyas, Neha Verma, Pranav Krishna, Yangxiaokang Liu, Nadia Irwanto, Jessica Pan, Faiaz Rahman, Ahmad Zaidi, Murori Mutuma, Yasin Tarabar, Ankit Gupta, Tao Yu, Yi Chern Tan, Xi Victoria Lin, Caiming Xiong, Richard Socher · arXiv 2020

Towards Understanding Hierarchical Learning: Benefits of Neural Representations
Minshuo Chen, Yu Bai, Jason D. Lee, Tuo Zhao, Huan Wang, Caiming Xiong, Richard Socher · arXiv 2020

Explicit Memory Tracker with Coarse-to-Fine Reasoning for Conversational Machine Reading
Yifan Gao, Chien-Sheng Wu, Shafiq Joty, Caiming Xiong, Richard Socher, Irwin King, Michael R. Lyu, Steven C. H. Hoi · ACL 2020

CO-Search: COVID-19 Information Retrieval with Semantic Search, Question Answering, and Abstractive Summarization
Andre Esteva, Anuprit Kale, Romain Paulus, Kazuma Hashimoto, Wenpeng Yin, Dragomir Radev, Richard Socher · arXiv 2020

WOAD: Weakly Supervised Online Action Detection in Untrimmed Videos
Mingfei Gao, Yingbo Zhou, Ran Xu, Richard Socher, Caiming Xiong · arXiv 2020

Learning from Noisy Anchors for One-stage Object Detection
Hengduo Li, Zuxuan Wu, Chen Zhu, Caiming Xiong, Richard Socher, Larry S. Davis · CVPR 2020

ESPRIT: Explaining Solutions to Physical Reasoning Tasks
Nazneen Fatema Rajani, Rui Zhang, Yi Chern Tan, Stephan Zheng, Jeremy Weiss, Aadit Vyas, Abhijit Gupta, Caiming XIong, Richard Socher, Dragomir Radev · ACL 2020

It's Morphin' Time! Combating Linguistic Discrimination with Inflectional Perturbations
Samson Tan, Shafiq Joty, Min-Yen Kan, Richard Socher · ACL 2020

ERASER: A Benchmark to Evaluate Rationalized NLP Models
Jay DeYoung, Sarthak Jain, Nazneen Fatema Rajani, Eric Lehman, Caiming Xiong, Richard Socher, Byron C. Wallace · ACL 2020

Improving out-of-distribution generalization via multi-task self-supervised pretraining
Isabela Albuquerque, Nikhil Naik, Junnan Li, Nitish Keskar, Richard Socher · arXiv 2020

Towards Noise-resistant Object Detection with Noisy Annotations
Junnan Li, Caiming Xiong, Richard Socher, Steven Hoi · arXiv 2020

Taylorized Training: Towards Better Approximation of Neural Network Training at Finite Width
Yu Bai, Ben Krause, Huan Wang, Caiming Xiong, Richard Socher · arXiv 2020

Neural Bayes: A Generic Parameterization Method for Unsupervised Representation Learning
Devansh Arpit, Huan Wang, Caiming Xiong, Richard Socher, Yoshua Bengio · arXiv 2020

Tree-structured Attention with Hierarchical Accumulation
Xuan-Phi Nguyen, Shafiq Joty, Steven C. H. Hoi, Richard Socher · ICLR 2020

Non-Autoregressive Dialog State Tracking
Hung Le, Richard Socher, Steven C. H. Hoi · ICLR 2020

DivideMix: Learning with Noisy Labels as Semi-supervised Learning
Junnan Li, Richard Socher, Steven C. H. Hoi · arXiv 2020

Learning to Retrieve Reasoning Paths over Wikipedia Graph for Question Answering
Akari Asai, Kazuma Hashimoto, Hannaneh Hajishirzi, Richard Socher, Caiming Xiong · ICLR 2020

Limits of Detecting Text Generated by Large-Scale Language Models
Lav R. Varshney, Nitish Shirish Keskar, Richard Socher · ITA 2020

2019

MKD: a Multi-Task Knowledge Distillation Approach for Pretrained Language Models
Linqing Liu, Huan Wang, Jimmy Lin, Richard Socher, Caiming Xiong · arXiv 2019

A High-Quality Multilingual Dataset for Structured Documentation Translation
Kazuma Hashimoto, Raffaella Buschiazzo, James Bradbury, Teresa Marshall, Richard Socher, Caiming Xiong · WMT 2019

Discriminative Nearest Neighbor Few-Shot Intent Detection by Transferring Natural Language Inference
Jian-Guo Zhang, Kazuma Hashimoto, Wenhao Liu, Chien-Sheng Wu, Yao Wan, Philip S. Yu, Richard Socher, Caiming Xiong · EMNLP 19

CTRL: A Conditional Transformer Language Model for Controllable Generation, Nitish Shirish Keskar, Bryan McCann, Lav R. Varshney, Caiming Xiong, Richard Socher.
arxiv linkcode (pre-trained and fine-tuning)blog ]

Genie: a generator of natural language semantic parsers for virtual assistant commands, Giovanni Campagna, Silei Xu, Mehrad Moradshahi, Richard Socher, Monica S. Lam
PLDI 2019 [ pdf linkhttps://almond.stanford.edu/ ]

Keeping Your Distance: Solving Sparse Reward Tasks Using Self-Balancing Shaped Rewards, Alex Trott, Stephan Zheng, Caiming Xiong, Richard Socher.
Thirty-third Conference on Neural Information Processing Systems (NeurIPS 2019).

The State of Text Summarization: A Critical Evaluation, Wojciech Kryściński, Nitish Shirish Keskar, Bryan McCann, Caiming Xiong, Richard Socher.
2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP 2019). [ arxiv link ]

WSLLN: Weakly Supervised Natural Language Localization Networks, Mingfei Gao, Larry Davis, Richard Socher, Caiming Xiong.
2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP 2019). [ arxiv link ]

Editing-based SQL Query Generation for Cross-Domain Context-Dependent Questions, Rui Zhang, Tao Yu, Heyang Er, Sungrok Shim, Eric Xue, Xi Victoria Lin, Tianze Shi, Caiming Xiong, Richard Socher and Dragomir Radev
2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP 2019). [ arxiv link ]

CoSQL: A Conversational Text-to-SQL Challenge Towards Cross-Domain Natural Language Interfaces to Databases, Tao Yu, Rui Zhang, Heyang Er, Suyi Li, Eric Xue, Bo Pang, Xi Victoria Lin, Yi Chern Tan, Tianze Shi, Zihan Li, Youxuan Jiang, Michihiro Yasunaga, Sungrok Shim, Tao Chen, Alexander Fabbri, Zifan Li, Luyao Chen, Yuwen Zhang, Shreya Dixit, Vincent Zhang, Caiming Xiong, Richard Socher, Walter Lasecki, Dragomir Radev.
2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP 2019). [ arxiv link ]

Transferable Multi-Domain State Generator for Task-Oriented Dialogue Systems, Chien-Sheng Wu, Andrea Madotto, Ehsan Hosseini-Asl, Caiming Xiong, Richard Socher, Pascale Fung
The 57th Annual Meeting of the Association for Computational Linguistics (ACL 2019). Outstanding Paper Award. [ arxiv pdfcode ]

Explain Yourself! Leveraging Language Models for Commonsense Reasoning, Nazneen Fatema Rajani, Bryan McCann, Caiming Xiong and Richard Socher
The 57th Annual Meeting of the Association for Computational Linguistics (ACL 2019). [ arxiv pdfBlog PostGithub, Press: VentureBeatSilicon AngleZDNet ]

SParC: Cross-Domain Semantic Parsing in Context, Tao Yu, Rui Zhang, Michihiro Yasunaga, Yi Chern Tan, Xi Victoria Lin, Suyi Li, Heyang Er, Irene Li, Bo Pang, Tao Chen, Emily Ji, Shreya Dixit, David Proctor, Sungrok Shim, Jonathan Kraft, Vincent Zhang, Caiming Xiong, Richard Socher and Dragomir Radev
The 57th Annual Meeting of the Association for Computational Linguistics (ACL 2019). [ arxiv pdfChallenge and Leaderboard ]

Global-to-local Memory Pointer Networks for Task-Oriented Dialogue, Chien-Sheng Wu, Richard Socher, Caiming Xiong
International Conference on Learning Representations (ICLR 2019). [ arxiv pdf ]

Self-Monitoring Navigation Agent via Auxiliary Progress Estimation, Chih-Yao Ma, Jiasen Lu, Zuxuan Wu, Ghassan AlRegib, Zsolt Kira, Richard Socher, Caiming Xiong
International Conference on Learning Representations (ICLR 2019). [ arxiv pdf ]

Coarse-grain Fine-grain Coattention Network for Multi-evidence Question Answering, Victor Zhong, Caiming Xiong, Nitish Shirish Keskar, Richard Socher.
International Conference on Learning Representations (ICLR 2019). [ arxiv pdf ]

Competitive experience replay, Hao Liu, Alexander Trott, Richard Socher, Caiming Xiong.
International Conference on Learning Representations (ICLR 2019). [ arxiv pdf ]

Augmented Cyclic Adversarial Learning for Low Resource Domain Adaptation, Ehsan Hosseini-Asl, Yingbo Zhou, Caiming Xiong, Richard Socher.
International Conference on Learning Representations (ICLR 2019). [ arxiv pdf ]

A Closer Look at Deep Learning Heuristics: Learning rate restarts, Warmup and Distillation, Akhilesh Gotmare, Nitish Shirish Keskar, Caiming Xiong, Richard Socher.
International Conference on Learning Representations (ICLR 2019). [ arxiv pdf ]

AdaFrame: Adaptive Frame Selection for Fast Video Recognition, Zuxuan Wu, Caiming Xiong, Chih-Yao Ma, Richard Socher, Larry S Davis.
Conference on Computer Vision and Pattern Recognition (CVPR 2019). [ arxiv pdf ]

Unifying Question Answering and Text Classification via Span Extraction, Nitish Shirish Keskar, Bryan McCann, Caiming Xiong, Richard Socher
arxiv pdf ]

Learn to Grow: A Continual Structure Learning Framework for Catastrophic Forgetting, Xilai, Yingbo Zhou, Caiming Xiong, Richard Socher
The 36th International Conference on Machine Learning (ICML 2019). [ arxiv pdf ]

Taming MAML: Control variates for unbiased meta-reinforcement learning gradient estimation, Hao Liu, Caiming Xiong, Richard Socher
The 36th International Conference on Machine Learning (ICML 2019). [ arxiv pdf ]

On the Generalization Gap in Reparameterizable Reinforcement Learning (Huan Wang, Stephan Zheng, Caiming Xiong, Richard Socher
The 36th International Conference on Machine Learning (ICML 2019). [ arxiv pdf ]

2018

The Natural Language Decathlon: Multitask Learning as Question Answering, Bryan McCann, Nitish Shirish Keskar, Caiming Xiong, Richard Socher
arxiv pdfcode and leaderboardblog postQ&A, Press: VentureBeatzdnetFAZ (German)SiliconAngle ]

Multi-Hop Knowledge Graph Reasoning with Reward Shaping, Xi Victoria Lin, Richard Socher, Caiming Xiong
Conference on Empirical Methods in Natural Language Processing (EMNLP 2018). [ arxiv pdf ]

Improving Abstraction in Text Summarization, Wojciech Kryściński, Romain Paulus, Caiming Xiong, Richard Socher
Conference on Empirical Methods in Natural Language Processing (EMNLP 2018). [ arxiv pdf ]

A Multi-Discriminator CycleGAN for Unsupervised Non-Parallel Speech Domain Adaptation, Ehsan Hosseini-Asl, Yingbo Zhou, Caiming Xiong, Richard Socher
Interspeech 2018. [ arxiv pdfblog ]

Global-Locally Self-Attentive Encoder for Dialogue State Tracking, Victor Zhong, Caiming Xiong, Richard Socher.
Association for Computational Linguistics 2018 Conference (ACL 2018). [ arxiv pdf ]

Efficient and Robust Question Answering from Minimal Context over Documents, Sewon Min, Victor Zhong, Richard Socher, Caiming Xiong.
Association for Computational Linguistics 2018 Conference (ACL 2018). [ arxiv pdf ]

End-to-End Dense Video Captioning with Masked Transformer, Luowei Zhou, Yingbo Zhou, Jason J. Corso, Richard Socher, Caiming Xiong
IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2018). (Spotlight) [ arxiv pdf ]

An Analysis of Neural Language Modeling at Multiple Scales, Stephen Merity, Nitish Shirish Keskar, Richard Socher
arxiv pdfgithub code ]

Interpretable Counting for Visual Question Answering, Alexander Trott, Caiming Xiong, Richard Socher
International Conference on Learning Representations (ICLR 2018). [ arxiv pdfblog post ]

Hierarchical and Interpretable Skill Acquisition in Multi-task Reinforcement Learning, Tianmin Shu, Caiming Xiong, and Richard Socher
International Conference on Learning Representations (ICLR 2018). [ arxiv pdfblog post ]

A Deep Reinforced Model for Abstractive Summarization, Romain Paulus, Caiming Xiong, Richard Socher
International Conference on Learning Representations (ICLR 2018). [ pdfblog post, Press: ForbesMIT Tech ReviewTechCrunch ]

Non-Autoregressive Neural Machine Translation, Jiatao Gu, James Bradbury, Caiming Xiong, Victor O.K. Li, Richard Socher
International Conference on Learning Representations (ICLR 2018). [ arxiv pdfblog post, Press: CNBCVenturebeatSlator ]

DCN+: Mixed Objective and Deep Residual Coattention for Question Answering, Caiming Xiong, Victor Zhong and Richard Socher
International Conference on Learning Representations (ICLR 2018). [ arxiv pdf ]

Regularizing and Optimizing LSTM Language Models, Stephen Merity, Nitish Shirish Keskar, Richard Socher
International Conference on Learning Representations (ICLR 2018). [ pdfcode ]

A Flexible Approach to Automated RNN Architecture Generation, Stephen Merity, Martin Schrimpf, James Bradbury, Richard Socher
International Conference on Learning Representations (ICLR 2018 Workshop Track). [ arxiv pdf, blog post ]

Improving End-to-End Speech Recognition with Policy Learning, Yingbo Zhou, Caiming Xiong, Richard Socher
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2018). [ pdfblog post ]

2017

Improving Generalization Performance by Switching from Adam to SGD, Nitish Shirish Keskar, Richard Socher
arxiv pdf ]

Improved Regularization Techniques for End-to-End Speech Recognition, Yingbo Zhou, Caiming Xiong, Richard Socher
pdfblog post ]

Weighted Transformer Network for Machine Translation, Karim Ahmed, Nitish Shirish Keskar, Richard Socher
arxiv pdfblog post ]

Seq2SQL: Generating Structured Queries from Natural Language using Reinforcement Learning, Victor Zhong, Caiming Xiong, Richard Socher
arxiv pdfblog postdataset, Press: TechCrunchVenturebeat ]

Learned in Translation: Contextualized Word Vectors, Bryan McCann, James Bradbury, Caiming Xiong, Richard Socher
Advances in Neural Information Processing Systems (NIPS 2017). [ pdfblog postcode, Press: MIT Tech Review ]

Revisiting Activation Regularization for Language RNNs, Stephen Merity, Bryan McCann, Richard Socher
1st Workshop on Learning to Generate Natural Language at ICML 2017. [ pdf ]

A Joint Many-Task Model: Growing a Neural Network for Multiple NLP Tasks, Kazuma Hashimoto, Caiming Xiong, Yoshimasa Tsuruoka, Richard Socher
Conference on Empirical Methods in Natural Language Processing (EMNLP 2017). Also appeared in NIPS 2016 Continual Learning and Deep Networks Workshop. [ pdfblog post ]

Knowing When to Look: Adaptive Attention via A Visual Sentinel for Image Captioning, Jiasen Lu, Caiming Xiong, Devi Parikh, Richard Socher
IEEE Computer Vision and Pattern Recognition (CVPR 2017). [ pdf, ]

Dynamic Coattention Networks For Question Answering, Caiming Xiong, Victor Zhong, Richard Socher
International Conference on Learning Representations (ICLR 2017). [ pdfblog postBest model on Stanford Question Answering Dataset (At submission) ]

Quasi-Recurrent Neural Networks, James Bradbury, Stephen Merity, Caiming Xiong, Richard Socher
International Conference on Learning Representations (ICLR 2017). [ pdfblog post ]

Tying Word Vectors and Word Classifiers: A Loss Framework for Language Modeling, Hakan Inan, Khashayar Khosravi, Richard Socher
International Conference on Learning Representations (ICLR 2017). [ pdf ]

Pointer Sentinel Mixture Models, Stephen Merity, Caiming Xiong, James Bradbury, Richard Socher
International Conference on Learning Representations (ICLR 2017) and NIPS 2016 Workshop on Multi-class and Multi-label Learning in Extremely Large Label Spaces. [ pdfnew dataset ]

2016

A Way out of the Odyssey: Analyzing and Combining Recent Insights for LSTMs, Shayne Longpre, Sabeek Pradhan, Caiming Xiong, Richard Socher
pdf ]

MetaMind Neural Machine Translation System for WMT 2016, James Bradbury, Richard Socher
Proceedings of the First Conference on Machine Translation. Association for Computational Linguistics.
pdf2nd Place in the competition ]

Dynamic Memory Networks for Visual and Textual Question Answering, Caiming Xiong, Stephen Merity, Richard Socher
The 33rd International Conference on Machine Learning (ICML 2016). [ pdf , New York TimesMIT Technology Review ]

Ask Me Anything: Dynamic Memory Networks for Natural Language Processing, Ankit Kumar, Ozan Irsoy, Peter Ondruska, Mohit Iyyer, James Bradbury, Ishaan Gulrajani, Victor Zhong, Romain Paulus, Richard Socher
The 33rd International Conference on Machine Learning (ICML 2016).
Previous versions appeared at NIPS 2015 Deep Learning Symposium; NIPS 2015 workshop on Reasoning, Attention and Memory Workshop
pdf , WiredMIT Tech ReviewMetaMind announcement ]

2015

Improved Semantic Representations From Tree-Structured Long Short-Term Memory Networks, Kai Sheng Tai, Richard Socher, and Christopher D. Manning
Association for Computational Linguistics 2015 Conference (ACL 2015). [ pdf , code ]

2014

Recursive Deep Learning for Natural Language Processing and Computer Vision, Richard Socher
PhD Thesis, Computer Science Department, Stanford University
pdf, 2014 Arthur L. Samuel Best Computer Science PhD Thesis Award ]

Global Belief Recursive Neural Networks, Romain Paulus, Richard Socher, Christopher D. Manning
Advances in Neural Information Processing Systems (NIPS 2014). [ pdf ]

Aspect Specific Sentiment Analysis using Hierarchical Deep Learning, Himabindu Lakkaraju, Richard Socher, Chris Manning.
NIPS Workshop on Deep Learning and Representation Learning, 2014. [ pdf ]

Glove: Global Vectors for Word Representation, Jeffrey Pennington, Richard Socher and Christopher D. Manning
Conference on Empirical Methods in Natural Language Processing (EMNLP 2014). [ pdf , website with word vectors ]. ACL 2024 Test-of-Time paper award

A Neural Network for Factoid Question Answering over Paragraphs, Mohit Iyyer, Jordan Boyd-Graber, Leonardo Claudino, Richard Socher and Hal Daumé III
Conference on Empirical Methods in Natural Language Processing (EMNLP 2014). [ pdfwebsite with dataset, code, etc. ].

Grounded Compositional Semantics for Finding and Describing Images with Sentences, Richard Socher, Andrej Karpathy, Quoc V. Le, Christopher D. Manning, Andrew Y. Ng.
Transactions of the Association for Computational Linguistics (TACL 2014), Presented at ACL 2014. [ pdf ].

Scaling Short-answer Grading by Combining Peer Assessment with Algorithmic Scoring, Chinmay Kulkarni, Richard Socher, Michael S. Bernstein, Scott R. Klemmer.
2014 ACM Conference on Learning at Scale [ pdf ].

2013

Demonstration: etcml.com - easy text classification with machine learning, Richard Socher, Romain Paulus, Bryan McCann, Kai Sheng Tai, JiaJi Hu, Andrew Y. Ng.
Advances in Neural Information Processing Systems (NIPS 2013). [ Website to easily train and share text classifiers; Press: GigaOMStanford ]

Reasoning With Neural Tensor Networks for Knowledge Base Completion, Richard Socher*, Danqi Chen*, Christopher D. Manning, Andrew Y. Ng.
Advances in Neural Information Processing Systems (NIPS 2013). [ pdfwebsite ]

Zero-Shot Learning Through Cross-Modal Transfer, Richard Socher, Milind Ganjoo, Christopher D. Manning, Andrew Y. Ng.
Advances in Neural Information Processing Systems (NIPS 2013). [ pdfwebsite ]

Grounded Compositional Semantics for Finding and Describing Images with Sentences, Richard Socher, Quoc V. Le, Christopher D. Manning, Andrew Y. Ng.
Deep Learning Workshop at NIPS 2013 (see TACL 2014 version)

Recursive Deep Models for Semantic Compositionality Over a Sentiment Treebank, Richard Socher, Alex Perelygin, Jean Wu, Jason Chuang, Chris Manning, Andrew Ng and Chris Potts.
Conference on Empirical Methods in Natural Language Processing (EMNLP 2013, Oral). [ pdfSupplementary MaterialWebsite with Live Demo and Downloads; Press: Stanford releaseWiredBoston Globe Related Kaggle Competition ]; ACL 2023 Test-of-Time paper award

Bilingual Word Embeddings for Phrase-Based Machine Translation, Will Zou, Richard Socher, Daniel Cer and Christopher Manning.
Conference on Empirical Methods in Natural Language Processing (EMNLP 2013, Short). [ pdf ]

Parsing with Compositional Vector Grammars, Richard Socher, John Bauer, Christopher D. Manning and Andrew Y. Ng.
Association for Computational Linguistics 2013 Conference (ACL 2013). [ pdf , website ]

Better Word Representations with Recursive Neural Networks for Morphology, Thang Luong, Richard Socher, Christopher D. Manning.
Conference on Computational Natural Language Learning (CoNLL 2013). [ pdfwebsite with vectors ]

Learning New Facts From Knowledge Bases With Neural Tensor Networks and Semantic Word Vectors, Danqi Chen, Richard Socher, Christopher D. Manning, Andrew Y. Ng.
International Conference on Learning Representations (ICLR 2013, Workshop Track). [ pdfwebsite ]

Zero-Shot Learning Through Cross-Modal Transfer, Richard Socher, Milind Ganjoo, Hamsa Sridhar, Osbert Bastani, Christopher D. Manning, Andrew Y. Ng.
International Conference on Learning Representations (ICLR 2013, Workshop Track, Oral). [ pdfwebsite ]

2012

Convolutional-Recursive Deep Learning for 3D Object Classification, Richard Socher, Brody Huval, Bharath Bhat, Christopher D. Manning and Andrew Y. Ng.
Advances in Neural Information Processing Systems (NIPS 2012). [ pdfwebsite ]

Semantic Compositionality through Recursive Matrix-Vector Spaces, Richard Socher, Brody Huval, Christopher D. Manning and Andrew Y. Ng.
Conference on Empirical Methods in Natural Language Processing (EMNLP 2012, Oral). [ pdfwebsite ]

Improving Word Representations via Global Context and Multiple Word Prototypes, Eric H. Huang, Richard Socher, Christopher D. Manning and Andrew Y. Ng.
Association for Computational Linguistics 2012 Conference (ACL 2012). [ pdfwebsite ]

Stanford’s System for Parsing the English Web, David McClosky, Wanxiang Che, Marta Recasens, Mengqiu Wang, Richard Socher, and Christopher D. Manning
In Proceedings of First Workshop on Syntactic Analysis of Non-Canonical Language (SANCL at NAACL, 2012). [ pdfbib ]

2011

Dynamic Pooling and Unfolding Recursive Autoencoders for Paraphrase Detection, Richard Socher, Eric H. Huang, Jeffrey Pennington, Andrew Y. Ng, and Christopher D. Manning.
Advances in Neural Information Processing Systems (NIPS 2011). [ pdfwebsite ]

Semi-Supervised Recursive Autoencoders for Predicting Sentiment Distributions, Richard Socher, Jeffrey Pennington, Eric Huang, Andrew Y. Ng, and Christopher D. Manning.
Conference on Empirical Methods in Natural Language Processing (EMNLP 2011, Oral). [ pdfwebsite ]

Parsing Natural Scenes and Natural Language with Recursive Neural Networks, Richard Socher, Cliff Lin, Andrew Y. Ng, and Christopher D. Manning.
The 28th International Conference on Machine Learning (ICML 2011). Distinguished Application Paper Award. [ pdfvideowebsite ]

Spectral Chinese Restaurant Processes: Nonparametric Clustering Based on Similarities, Richard Socher, Andrew Maas, Christopher D. Manning.
Fourteenth International Conference on Artificial Intelligence and Statistics (AISTATS 2011). [ pdf ]

2010

Learning Continuous Phrase Representations and Syntactic Parsing with Recursive Neural Networks, Richard Socher, Christopher D. Manning, Andrew Y. Ng.
Deep Learning and Unsupervised Feature Learning Workshop - NIPS 2010, Oral. [ pdf (added details on 3/3/2012) ]

A Gibbs Sampler for Spatial Clustering with the Distance-dependent Chinese Restaurant Process, Richard Socher and Christopher D. Manning. Monte Carlo Methods for Modern Applications Workshop - NIPS 2010. [ pdf ]

Connecting Modalities: Semi-supervised Segmentation and Annotation of Images Using Unaligned Text Corpora, Richard Socher and Li Fei-Fei.
IEEE Computer Vision and Pattern Recognition (CVPR 2010). [ pdf ]

2009

A Bayesian analysis of dynamics in free recall, Richard Socher, Sam J. Gershman, Adler Perotte, Per Sederberg, Ken A. Norman, and David M. Blei.
Advances in Neural Information Processing Systems 22 (NIPS 2009). [ pdf ]

Towards Total Scene Understanding: Classification, Annotation and Segmentation in an Automatic Framework, Li-Jia Li, Richard Socher, and Li Fei-Fei.
IEEE Computer Vision and Pattern Recognition (CVPR 2009, Oral). [ pdf ]

ImageNet: A Large-Scale Hierarchical Image Database, Jia Deng, Wei Dong, Richard Socher, Li-Jia Li, Kai Li, and Li Fei-Fei.
IEEE Computer Vision and Pattern Recognition (CVPR 2009). [ pdf ] Won test of time award

2008

A Learning Based Hierarchical Model for Vessel Segmentation, Richard Socher, Adrian Barbu, and Dorin Comaniciu.
In 5th IEEE International Symposium on Biomedical Imaging: From Nano to Macro (ISBI 2008, Oral). [ pdf ]

Theses

PhD ThesisRecursive Deep Learning for Natural Language Processing and Computer Vision, Computer Science Department, Stanford University

Masters Thesis: A Learning-Based Hierarchical Model for Vessel Segmentation, Saarland University, 2008, grade 1.0 A+

Bachelor Thesis: Automatic Extension of Semantic Lexicons with a Bootstrapping Algorithm, Leipzig University, 2006, grade 1.0 A+

Former Students

Small subset of students or interns that I supervised at some point.

  • Jeffrey Pennington, Google Brain

  • Mohit Iyyer, professor in computer science at UMass Amherst

  • Tanay Tandon, Founder at Athelas

  • Ankit Kumar - Co-Founder & CTO - Ubiquity6 Inc.