NewSum-EMNLP 2021: Program

NewSum-EMNLP 2021: Program (9am - 6pm AST, Nov. 10th)

09:00 - 10:30

Morning Session I ( Zoom Link)
Chair: Fei Liu
Co-Chair: Yue Dong

09:00 - 09:10

Open remarks
NewSum Organizers

09:10 - 10:00

Keynote I - Sashi Narayan (Google)
Learning from Past: Bringing Planning Back to Neural Generators
Traditional NLG systems in Reiter and Dale’s vision were inherently grounded and controllable, thanks to a planning stage which played a crucial role in ordering and structuring the information, and in grounding the generation of text to the plan. Modern neural generation systems have advanced NLG beyond our imagination, yet some of the most desired properties such as grounding and controllability have been lost and are still to be mastered. In this talk, I will discuss why we need to bring back planning to neural generation and to make generation systems more grounded, controllable, inspectable and trustworthy. I will present several pieces of evidence supporting this direction exploring existing work in data-to-text and story generation, and in summarization.

10:00 - 10:10

Sentence-level Planning for Especially Abstractive Summarization
Andreas Marfurt¹ and James Henderson²
¹Idiap Research Institute and EPFL, ²Idiap Research Institute

10:10 - 10:20

Template-aware Attention Model for Earnings Call Report Generation
Yangchen Huang, Seyed Danial Mohseni Taheri, Prashant Dhingra
JP Morgan Chase & Company

10:20 - 10:25

Knowledge and Keywords Augmented Abstractive Sentence Summarization
Shuo Guan
NYU Courant

10:25 - 10:30

Rewards with Negative Examples for Reinforced Topic-Focused Abstractive Summarization
Khalil Mrini¹, Can Liu², Markus Dreyer²
¹University of California, San Diego, ²Amazon.com

10:30 - 11:00

Coffee break GatherTown Link

11:00 - 12:00

Morning session II ( Zoom Link)
Chair: Yue Dong
Co-Chair: Jackie Cheung

11:00 - 11:50

Keynote II - Sebastian Gehrmann (Google)
Breaking News: It’s time to fix the evaluation of generated text
Language generation has undergone multiple paradigm shifts from constructed grammars and modular systems toward end-to-end supervised (neural) approaches, and now, almost every system is built on pretrained models. As a result, how generated text looks has changed a lot; it is now much more fluent and most of its issues relate to its content. Yet, we still use the same metrics, some of the same corpora, and how to conduct human evaluations remains a mystery. Throughout this talk, we will explore many examples of broken evaluations in summarization and other generation applications. I will discuss the implications that broken evaluation pipelines have on model development and the overall progress in the field. And I will show some promising results on developing evaluation suites, learned metrics, and meta-evaluations that have the potential to improve how generated text is evaluated.

11:50 - 12:00

A Novel Wikipedia based Dataset for Monolingual and Cross-Lingual Summarization
Mehwish Fatima¹ and Michael Strube²
¹Heidelberg Institute for Theoretical Studies (HITS gGmbH), ²Heidelberg Institute for Theoretical Studies

12:00 - 13:00

Lunch break GatherTown Link

13:00 - 14:30

Afternoon session I ( Zoom Link)
Chair: Lu Wang
Co-Chair: Yue Dong

13:00 - 13:50

Keynote III - Asli Celikyilmaz (Facebook AI Research)
Tune in To Your Language Model for Better Text Generation
With today’s neural language models, we can teach computers to summarize online meetings, write creative stories or articles about an event, hold longer conversations in customer-service applications, chit-chat about daily activities with individuals, describe pictures to visually impaired, to name a few. In this talk, I will discuss challenges and shortcomings of building such systems with the current neural text generation models focusing on issues relating to collecting and annotating training datasets and building new architectures to model the intrinsic structure of conversations. I will present our recent approaches that imbue transformer based neural generators with structural representations by way of implicit memory architectures and latent structural embeddings. I will conclude my talk pointing to avenues for future research.

13:50 - 13:55

Evaluation of Summarization Systems across Gender, Age, and Race
Anna Jørgensen¹ and Anders Søgaard²
¹University of Amsterdam, ²University of Copenhagen

13:55 - 14:00

Evaluation of Abstractive Summarisation Models with Machine Translation in Deliberative Processes
Miguel Arana-Catania¹, Rob Procter¹, Yulan He¹, Maria Liakata²
¹University of Warwick, ²Queen Mary University of London

14:00 - 14:10

Capturing Speaker Incorrectness: Speaker-Focused Post-Correction for Abstractive Dialogue Summarization
Dongyub Lee¹, Jungwoo Lim², Taesun Whang³, chanhee lee², Seungwoo Cho⁴, Mingun Park⁵, Heuiseok Lim²
¹Kakao Corp, ²Korea University, ³Wisenut Inc., ⁴Kakao Enterprise at South Korea, ⁵Microsoft

14:10 - 14:20

Measuring Similarity of Opinion-bearing Sentences
Wenyi Tay¹, Xiuzhen Zhang¹, Stephen Wan², Sarvnaz Karimi²
¹RMIT University, ²CSIRO

14:20 - 14:30

EASE: Extractive-Abstractive Summarization End-to-End using the Information Bottleneck Principle
Haoran Li¹, Arash Einolghozati¹, Srinivasan Iyer¹, Bhargavi Paranjape², Yashar Mehdad³, Sonal Gupta¹, Marjan Ghazvininejad⁴
¹Facebook, ²University of Washington, ³Facebook AI, ⁴Facebook AI Research

14:30 - 15:00

Coffee break GatherTown Link

15:00 - 15:35

Afternoon session II ( Zoom Link)
Chair: Jackie Cheung
Co-Chair: Giuseppe Carenini

15:00 - 15:10

Context or No Context? A preliminary exploration of human-in-the-loop approach for Incremental Temporal Summarization in meetings
Nicole Beckage, Shachi H Kumar, Saurav Sahay, Ramesh Manuvinakurike
Intel Labs

15:10 - 15:20

Are We Summarizing the Right Way? A Survey of Dialogue Summarization Data Sets
Don Tuggener¹, Margot Mieskes², Jan Deriu¹, Mark Cieliebak¹
¹Zurich University of Applied Sciences, ²University of Applied Sciences, Darmstadt

15:20 - 15:30

Modeling Endorsement for Multi-Document Abstractive Summarization
Logan Lebanoff¹, Bingqing Wang², Zhe Feng³, Fei Liu⁴
¹Soar Technology, Inc., ²Bosch Research & Technology Center North America, ³Bosch, ⁴University of Central Florida

15:30 - 15:35

SUBSUME: A Dataset for Subjective Summary Extraction from Wikipedia Documents
Nishant Yadav¹, Matteo Brucato², Anna Fariha², Oscar Youngquist², Julian Killingback³, Alexandra Meliou⁴, Peter Haas²
¹UMass Amherst, ²University of Massachusetts Amherst, ³The University of Massachusetts Amherst, ⁴University of Massachusetts, Amherst

15:35 - 16:15

EMNLP Finding papers - Summarization ( Zoom Link)
Chair: Jackie Cheung
Co-Chair: Giuseppe Carenini

15:35 - 15:40

Exploring Multitask Learning for Low-Resource Abstractive Summarization
Ahmed Magooda, Diane Litman, Mohamed Elaraby
University of Pittsburgh

15:40 - 15:45

Mitigating Data Scarceness through Data Synthesis, Augmentation and Curriculum for Abstractive Summarization
Ahmed Magooda and Diane Litman
University of Pittsburgh

15:45 - 15:50

TWEETSUMM - A Dialog Summarization Dataset for Customer Service
Guy Feigenblat , Chulaka Gunasekara , Benjamin Sznajder , Ranit Aaronov, David Konopnicki, Sachindra Joshi
IBM Research

15:50 - 15:55

Convex Aggregation for Opinion Summarization
Hayate Iso¹, Xiaolan Wang¹, Yoshihiko Suhara¹, Stefanos Angelidis², Wang-Chiew Tan³
¹Megagon Labs, ²University of Edinburgh, ³Facebook AI

15:55 - 16:00

"Let Your Characters Tell Their Story'': A Dataset for Character-Centric Narrative Understanding
Faeze Brahman¹, Meng Huang², Oyvind Tafjord³, Chao Zhao⁴, Mrinmaya Sachan⁵, Snigdha Chaturvedi⁶
¹UC Santa Cruz, ²University of Chicago, ³AI2, ⁴University of North Carolina at Chapel Hill, ⁵ETH Zurich, ⁶University of North Carolina, Chapel Hill

16:00 - 16:05

Retrieval Augmented Code Generation and Summarization
Md Rizwan Parvez¹, Wasi Ahmad², Saikat Chakraborty³, Baishakhi Ray³, Kai-Wei Chang⁴
¹University of California Los Angeles, ²University of California, Los Angeles, ³Columbia University, ⁴UCLA

16:05 - 16:10

MiRANews: Dataset and Benchmarks for Multi-Resource-Assisted News Summarization
Xinnuo Xu¹, Ondřej Dušek², Shashi Narayan³, Verena Rieser¹, Ioannis Konstas¹
¹Heriot-Watt University, ²Charles University, ³Google

16:10 - 16:15

Leveraging Pretrained Models for Automatic Summarization of Doctor-Patient Conversations
Longxiang Zhang¹, Renato Negrinho², Arindam Ghosh³, Vasudevan Jagannathan³, Hamid Reza Hassanzadeh⁴, Thomas Schaaf¹, Matthew R. Gormley²
¹3M | M*Modal, ²Carnegie Mellon University, ³3M, ⁴NLP Researcher at 3M HIS

16:15 - 16:45

Coffee break GatherTown Link

16:45 - 18:00

Afternoon session III
Chair: Giuseppe Carenini
Co-Chair: Fei Liu, Jackie Cheung, Lu Wang, Yue Dong

16:45 - 16:55

TLDR9+: A Large Scale Resource for Extreme Summarization of Social Media Posts
Sajad Sotudeh¹, Hanieh Deilamsalehy², Franck Dernoncourt², Nazli Goharian¹
¹Georgetown University, ²Adobe Research

16:55 - 17:00

A New Dataset and Efficient Baselines for Document-level Text Simplification in German
Annette Rios, Nicolas Spring, Tannon Kew, Marek Kostrzewa, Andreas Säuberli, Mathias Müller, Sarah Ebling
University of Zurich

17:00 - 18:00

Mentoring Program ( Zoom Link)