First Workshop on Scholarly Document Processing (SDP 2020)

Muthu Kumar Chandrasekaran, Guy Feigenblat, Dayne Freitag, Tirthankar Ghosal, Michal Shmueli-Scheuer, Eduard Hovy, Petr Knoth, David Konopnicki, Philipp Mayr, Robert Patton, Dominika Tkaczyk and Anita de Waard

Live Session 1: Nov 19, 13:45-22:10 UTC / 13:45-22:10 GMT
SDP is a full day workshop that provides an interdisciplinary venue for researchers interested in any aspect of mining scientific literature. SDP includes a research track and three shared tasks: 6th CL-SciSumm, 1st LongSumm, 1st LaySumm.

Time (PDT) Event Hosts
Nov 19, 13:45-14:00 UTC / 13:45-14:00 GMT

Opening Remarks

Philipp Mayr
Nov 19, 14:00-14:15 UTC / 14:00-14:15 GMT

Teaser for Shared Tasks (5mins each)

tbd
Nov 19, 14:15-15:35 UTC / 14:15-15:35 GMT

Research Track: Session 1 COVID-19 document processing

Tirthankar Ghosal
Nov 19, 14:15-14:35 UTC / 14:15-14:35 GMT

Wu et al.: Acknowledgement Entity Recognition in CORD-19 Papers

TBD
Nov 19, 14:35-14:55 UTC / 14:35-14:55 GMT

Bhambhoria et al.: A Smart System to Generate and Validate Question Answer Pairs for COVID-19 Literature.

TBD
Nov 19, 14:55-15:15 UTC / 14:55-15:15 GMT

Zhang et al.: Covidex: Neural Ranking Models and Keyword Search Infrastructure for the COVID-19 Open Research Dataset.

TBD
Nov 19, 15:15-15:35 UTC / 15:15-15:35 GMT

Satish et al.: The impact of preprint servers in the formation of novel ideas.

TBD
Nov 19, 15:40-16:25 UTC / 15:40-16:25 GMT

Keynote 1 (incl. QA): Kuansan Wang Mitigating scholarly corpus biases with citations: A case study on CORD-19

Philipp Mayr
Nov 19, 16:25-16:50 UTC / 16:25-16:50 GMT

Break

TBD
Nov 19, 16:50-17:50 UTC / 16:50-17:50 GMT

Research Track: Session 2 SDP mixed session

Dayne Freitag
Nov 19, 16:50-17:10 UTC / 16:50-17:10 GMT

Berger et al.: Effective Distributed Representations for Academic Expert Search.

TBD
Nov 19, 17:10-17:30 UTC / 17:10-17:30 GMT

Kim et al.: Learning CNF Blocking for Large-scale Author Name Disambiguation.

TBD
Nov 19, 17:30-17:50 UTC / 17:30-17:50 GMT

Müller: Reconstructing Manual Information Extraction with DB-to-Document Backprojection: Experiments in the Life Science Domain.

TBD
Nov 19, 17:50-18:30 UTC / 17:50-18:30 GMT

Poster Pitches

tbd
Nov 19, 18:30-19:00 UTC / 18:30-19:00 GMT

Break

TBD
Nov 19, 19:00-19:50 UTC / 19:00-19:50 GMT

Research Track: Session 3: Short papers and Findings

Muthu
Nov 19, 19:00-19:10 UTC / 19:00-19:10 GMT

Ling & Chen: DeepPaperComposer: A Simple Solution for Training Data Preparation for Parsing Research Papers

TBD
Nov 19, 19:10-19:20 UTC / 19:10-19:20 GMT

Medic & Snajder: Improved Local Citation Recommendation Based on Context Enhanced with Global Information

TBD
Nov 19, 19:20-19:30 UTC / 19:20-19:30 GMT

Cao et al.: Will This Idea Spread Beyond Academia? Understanding Knowledge Transfer of Scientific Concepts across Text Corpora (Findings of EMNLP)

TBD
Nov 19, 19:30-19:40 UTC / 19:30-19:40 GMT

Subramanian et al.: MedICaT: A Dataset of Medical Images, Captions, and Textual References (Findings of EMNLP)

TBD
Nov 19, 19:40-19:50 UTC / 19:40-19:50 GMT

Noh et al.: Literature Retrieval for Precision Medicine with Neural Matching and Faceted Summarization (Findings of EMNLP)

TBD
Nov 19, 19:50-20:20 UTC / 19:50-20:20 GMT

Overview of Results of the Shared Tasks

Muthu, Anita, Guy, Michal
Nov 19, 20:20-20:30 UTC / 20:20-20:30 GMT

Break

TBD
Nov 19, 20:30-21:15 UTC / 20:30-21:15 GMT

Keynote 2: Steinn Sigurdsson The future of arXiv and knowledge discovery in open science

Tirthankar Ghosal
Nov 19, 21:15-22:00 UTC / 21:15-22:00 GMT

Plenary Regroup & Panel (OC + Keynote Speakers)

tbd
Nov 19, 22:00-22:10 UTC / 22:00-22:10 GMT

Closing

TBD