RoQLlama: A Lightweight Romanian Adapted Language Model
The remarkable achievements obtained by open-source large language models (LLMs) in recent years have predominantly been concentrated on tasks involving the English language. In this paper, we aim to advance the performance of Llama2 models on Romanian tasks. We tackle the problem of reduced computi...
Gespeichert in:
Hauptverfasser: | , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | The remarkable achievements obtained by open-source large language models
(LLMs) in recent years have predominantly been concentrated on tasks involving
the English language. In this paper, we aim to advance the performance of
Llama2 models on Romanian tasks. We tackle the problem of reduced computing
resources by using QLoRA for training. We release RoQLlama-7b, a quantized LLM,
which shows equal or improved results compared to its full-sized counterpart
when tested on seven Romanian downstream tasks in the zero-shot setup. Also, it
consistently achieves higher average scores across all few-shot prompts.
Additionally, we introduce a novel Romanian dataset, namely RoMedQA, which
contains single-choice medical questions in Romanian. |
---|---|
DOI: | 10.48550/arxiv.2410.04269 |