Things you can do dumping your Invenio database into a flat file

Talk

Title

Things you can do dumping your Invenio database into a flat file

Video

If you experience any problem watching the video, click the download button below

Mp4:		Medium (800 kbps)	High (2000 kbps)	More..
	800 kbps 800 kbps 1000 kbps

Copy-paste this code into your page:
<iframe width="640" height="360" frameborder="0" src="https://cds.cern.ch/video/2259672?showTitle=true" allowfullscreen></iframe>
Copy-paste this code into your page to include both slides and lecture:
<iframe src="https://mediastream.cern.ch/MediaArchive/Video/Public2/weblecture-player/index.html?year=2017&lecture=557956c1" width="1020px" height="600px" allowfullscreen scrolling="no" frameborder="0"></iframe>

Author(s)

Jorba, Ferran (speaker) (Universitat Autònoma de Barcelona)

Corporate author(s)

CERN. Geneva

Imprint

2017-03-22. - Streaming video.

Series

(Invenio User Group Workshops)
(Invenio User Group Workshop 2017)

Lecture note

on 2017-03-22T09:00:00

Subject category

Invenio User Group Workshops

Abstract

Invenio database design and interfaces are optimized for fast end user search and retrieval. As administrators, we can add indexes at will and use them via web or API. However, many maintenance tasks are not well covered with those indexes. For most of those cases, reading the records sequentialy is the optimal solution. However, if the database is large enough, reading them via Invenio API may take hours, while the system slows down and it may become unresponsive. In this presentation I'll show a small Python tool that uses Invenio API and a SQLite database as cache to keep an up to date flat file with your bibliographic records. We'll see how whith this flat file it is much faster and easier to do tasks like generate specialised statistics, quality control, automatic record enrichment or cleaning, or even creating exotic indexes or counters.

Submitted by

jean-yves.le.meur@cern.ch

Back to search

Record created 2017-04-13, last modified 2022-11-02

Similar records

External links:

Talk details

Event details

Add to personal basket
Export as BibTeX, MARC, MARCXML, DC, EndNote, NLM, RefWorks

CERN Document Server

Access articles, reports and multimedia content in HEP

Main menu

CERN Accelerating science