Help us make Indico better by taking this survey! Aidez-nous à améliorer Indico en répondant à ce sondage !

11–13 Mar 2024
CERN
Europe/Zurich timezone

Closed Domain QA System for LBL ScienceIT: Fine-Tuned and Retrieval Augmented Generation Models

13 Mar 2024, 14:30
15m
503/1-001 - Council Chamber (CERN)

503/1-001 - Council Chamber

CERN

162
Show room on map
Lightning talk Technology & Research Technology Bricks: advanced integration

Speaker

Fengchen Liu (Lawrence Berkeley National Laboratory)

Description

This paper proposes the development of a closed-domain Question-Answering (QA) system for LBL ScienceIT, using the ScienceIT website as the data source. The focus is on evaluating different models, specifically two fine-tuned pre-trained language models and three retrieval-augmented generation (RAG) models. Through this comparison, insights into the performance of these models, based on several evaluation metrics, are derived, ultimately highlighting the potential of a certain approach for the specific task. Through this comparative study, we aspire not only to present a robust QA framework for LBL ScienceIT but also to shed light on the dynamics of model selection and optimization for domain-specific tasks, setting the stage for future advancements in the realm of specialized QA systems.

Primary author

Fengchen Liu (Lawrence Berkeley National Laboratory)

Co-author

Jordan Jung (Lawrence Berkeley National Laboratory)

Presentation materials