faceted browsing for acl anthology

12
Faceted browsing for ACL Anthology Praveen Bysani

Upload: galvin

Post on 14-Feb-2016

60 views

Category:

Documents


0 download

DESCRIPTION

Faceted browsing for ACL Anthology. Praveen Bysani. ACL Anthology. a digital archive of research papers in CL and NLP contains over 20,100 papers free of cost a rchive for sister conferences and journals. Current browser. d irect and navigational search hard to navigate - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Faceted  browsing for ACL Anthology

Faceted browsing for ACL Anthology

Praveen Bysani

Page 2: Faceted  browsing for ACL Anthology

ACL Anthology

• a digital archive of research papers in CL and NLP

• contains over 20,100 papers

• free of cost

• archive for sister conferences and journals

Page 3: Faceted  browsing for ACL Anthology

Current browser

• direct and navigational search

• hard to navigate

• non-customized search

• non-sortable results

Page 4: Faceted  browsing for ACL Anthology

Faceted browsing

• Combination of navigational and direct search paradigms

• Facets are properties of information elements

• Access to organized information

• Ability to explore the collection in multiple dimensions through filters

Page 5: Faceted  browsing for ACL Anthology

Faceted Browsing

• RoR + Blacklight plugin

• Apache Solr

• Metadata from XML

• Blacklight customization for XML

Page 6: Faceted  browsing for ACL Anthology

Show view

Page 7: Faceted  browsing for ACL Anthology

Index View

Page 8: Faceted  browsing for ACL Anthology

More cookies..

• User Feedback• Comment/ Share / Like • Suggestions for correcting the meta data

• Ability to export bib in six formats

• Author pages• List of publications• Co-authors

Page 9: Faceted  browsing for ACL Anthology

• Third-party annotations• Automatically annotate articles with new metadata• Anthology as a corpus • API to make anthology an object of study

• OAI compatible• allows metadata harvesting

• @ http://aclanthology.heroku.com/

Page 10: Faceted  browsing for ACL Anthology

Challenges

• Normalizing the quality of anthology meta data information

• SIG Information• yaml files• no identifiers provided

• DOI• from acm• changes in names of papers, authors

Page 11: Faceted  browsing for ACL Anthology

Similar works

ACL Author Network

• bibliometrics

ACL Search Bench

• Semantic search

Page 12: Faceted  browsing for ACL Anthology

Plans for the future• A common data schema to integrate all

• Indexing the whole text data

• Range queries for year facet

• Exporting total volume bibliography

• Enriching author pages