Chris Kedzie

email: [kedzie@cs.columbia.edu]
web:

(About) And you may ask yourself, well, how did I get here?
Salutations! I am senior researcher at Microsoft Semantic Machinesworking on conversational interfaces. I was a PhD student (2014-2020) in the Department of Computer Science at Columbia University, where I worked in the Natural Language Processing (NLP) group with my advisor Kathy McKeown and many other wonderful colleagues. Before that I worked as a composer’s assistant and even before that, I studied classical guitar and music theory/composition at Loyola Marymount University. My advisor and I were recently very fortunate to receive a best paper award at INLG 2019.

(Research) And you may ask yourself, "How do I work this?"
I am interested in computational models of natural language generation and understanding. I am actively investigating methods for improving neural network-based language models via interaction with secondary models of semantics and/or syntactic structure. Currently, I am exploring various cooperative learning schemes where a semantic parser is used to validate the outputs of a learned neural language generation model, not only at test time, but also as a teacher providing noisey supervision during training. Some of my research interests include:

Text Generation: Deep Neural Network (DNN) models of text generation, paraphrase, and summarization.
Faithful/Controllable Generation: Conditional generation of natural language from formal meaning representations/semantics/data, with an emphasis on ensuring the correctness of the generated text with respect to model inputs.
Inductive Bias in Text Generation Tasks: Understanding when humans find text or data interesting, salient, or otherwise remarkable, and building models to do the same in the context of document/data summarization.

I also apply machine learning to natural language data to make useful predictions, such as predicting the importance of text for summarization or extracting signals from the web for social scientists.

Selected Publications

For a complete list please see my Google Scholar profile.

Controllable Meaning Representation to Text Generation: Linearization and Data Augmentation Strategies

Chris Kedzie and Kathleen McKeown.

in Proceedings of Empirical Methods in Natural Language Processing. 2020.

[pdf]

Incorporating Terminology Constraints in Automatic Post-Editing

David Wan, Chris Kedzie, Faisal Ladhak, Marine Carpuat, and Kathleen McKeown.

in Proceedings of the Fifth Conference on Machine Translation. 2020.

[pdf] [code]

A Good Sample is Hard to Find: Noise Injection Sampling and Self-Training for Neural Language Generation Models

Chris Kedzie and Kathleen McKeown.

in Proceedings of the 12th International Conference on Natural Language Generation. 2019. (Best Paper Award)

[pdf] [code]

Low-Level Linguistic Controls for Style Transfer and Content Preservation

Katy Gero, Chris Kedzie, Jonathan Reeve, and Lydia Chilton.

in Proceedings of the 12th International Conference on Natural Language Generation. 2019.

[pdf] [code]

Content Selection in Deep Learning Models of Summarization

Chris Kedzie, Kathleen McKeown, Hal Daume III.

in Proceedings of Empirical Methods in Natural Language Processing. 2018.

[pdf] [code]

Real-Time Web Scale Event Summarization Using Sequential Decision Making

Chris Kedzie, Fernando Diaz, and Kathleen McKeown.

in Proceedings of the International Joint Conference on Artificial Intelligence. 2016.

[pdf]

Predicting Salient Updates for Disaster Summarization

Chris Kedzie, Kathleen McKeown, and Fernando Diaz.

in Proceedings of the 53nd Annual Meeting of the Association for Computational Linguistics. 2015.

[pdf]

Multimodal social media analysis for gang violence prevention

Philipp Blandfort, Desmond U Patton, William R Frey, Svebor Karaman, Surabhi Bhargava, Fei-Tzin Lee, Siddharth Varia, Chris Kedzie, Michael B Gaskell, Rossano Schifanella, Kathleen McKeown, Shih-Fu Chang.

in Proceedings of the International AAAI Conference on Web and Social Media. 2019.

[pdf] [code]

Detecting Gang-Involved Escalation on Social Media Using Context

Serina Chang, Ruiqi Zhong, Ethan Adams, Fei-Tzin Lee, Siddharth Varia, Desmond U Patton, William R Frey, Chris Kedzie, Kathleen McKeown.

in Proceedings of Empirical Methods in Natural Language Processing. 2018.

[pdf] [code]

CV

Publications