First Workshop on Scholarly Document Processing (SDP 2020)

Muthu Kumar Chandrasekaran, Guy Feigenblat, Dayne Freitag, Tirthankar Ghosal, Michal Shmueli-Scheuer, Eduard Hovy, Petr Knoth, David Konopnicki, Philipp Mayr, Robert Patton, Dominika Tkaczyk and Anita de Waard

Live Session 1: Nov 19, Live Session 1: Nov 19 (13:45-22:10 UTC)
SDP is a full day workshop that provides an interdisciplinary venue for researchers interested in any aspect of mining scientific literature. SDP includes a research track and three shared tasks: 6th CL-SciSumm, 1st LongSumm, 1st LaySumm.

Time (PDT) Event Hosts
Nov 19, (13:45-14:00 UTC)

Opening Remarks

Philipp Mayr
Nov 19, (14:00-14:15 UTC)

Teaser for Shared Tasks (5mins each)

tbd
Nov 19, (14:15-15:35 UTC)

Research Track: Session 1 COVID-19 document processing

Tirthankar Ghosal
Nov 19, (14:15-14:35 UTC)

Wu et al.: Acknowledgement Entity Recognition in CORD-19 Papers

TBD
Nov 19, (14:35-14:55 UTC)

Bhambhoria et al.: A Smart System to Generate and Validate Question Answer Pairs for COVID-19 Literature.

TBD
Nov 19, (14:55-15:15 UTC)

Zhang et al.: Covidex: Neural Ranking Models and Keyword Search Infrastructure for the COVID-19 Open Research Dataset.

TBD
Nov 19, (15:15-15:35 UTC)

Satish et al.: The impact of preprint servers in the formation of novel ideas.

TBD
Nov 19, (15:40-16:25 UTC)

Keynote 1 (incl. QA): Kuansan Wang Mitigating scholarly corpus biases with citations: A case study on CORD-19

Philipp Mayr
Nov 19, (16:25-16:50 UTC)

Break

TBD
Nov 19, (16:50-17:50 UTC)

Research Track: Session 2 SDP mixed session

Dayne Freitag
Nov 19, (16:50-17:10 UTC)

Berger et al.: Effective Distributed Representations for Academic Expert Search.

TBD
Nov 19, (17:10-17:30 UTC)

Kim et al.: Learning CNF Blocking for Large-scale Author Name Disambiguation.

TBD
Nov 19, (17:30-17:50 UTC)

Müller: Reconstructing Manual Information Extraction with DB-to-Document Backprojection: Experiments in the Life Science Domain.

TBD
Nov 19, (17:50-18:30 UTC)

Poster Pitches

tbd
Nov 19, (18:30-19:00 UTC)

Break

TBD
Nov 19, (19:00-19:50 UTC)

Research Track: Session 3: Short papers and Findings

Muthu
Nov 19, (19:00-19:10 UTC)

Ling & Chen: DeepPaperComposer: A Simple Solution for Training Data Preparation for Parsing Research Papers

TBD
Nov 19, (19:10-19:20 UTC)

Medic & Snajder: Improved Local Citation Recommendation Based on Context Enhanced with Global Information

TBD
Nov 19, (19:20-19:30 UTC)

Cao et al.: Will This Idea Spread Beyond Academia? Understanding Knowledge Transfer of Scientific Concepts across Text Corpora (Findings of EMNLP)

TBD
Nov 19, (19:30-19:40 UTC)

Subramanian et al.: MedICaT: A Dataset of Medical Images, Captions, and Textual References (Findings of EMNLP)

TBD
Nov 19, (19:40-19:50 UTC)

Noh et al.: Literature Retrieval for Precision Medicine with Neural Matching and Faceted Summarization (Findings of EMNLP)

TBD
Nov 19, (19:50-20:20 UTC)

Overview of Results of the Shared Tasks

Muthu, Anita, Guy, Michal
Nov 19, (20:20-20:30 UTC)

Break

TBD
Nov 19, (20:30-21:15 UTC)

Keynote 2: Steinn Sigurdsson The future of arXiv and knowledge discovery in open science

Tirthankar Ghosal
Nov 19, (21:15-22:00 UTC)

Plenary Regroup & Panel (OC + Keynote Speakers)

tbd
Nov 19, (22:00-22:10 UTC)

Closing

TBD