AIPI 590 Large Language Models

Project 1 - Fine Tuning LLM

Files:

  • model.ipynb
    • notebook containing the code for fine tuning the Llama 3 model using QLoRa
  • data/train.json
    • json file containing the training set provided in the FINQA paper
  • data/test.json
    • json file containing the validation set provided in the FINQA paper

Process:

The focal property of interest is analysis financial documents for numerical reasoning. Specifically numerical reasoning over quarterly financial filings with the SEC. The Llama-3-8B model was chosen to fine tune using the QLoRa approach. This approach was chosen due to the paper's findings of a performance increase while utilizing minimal memory and hardware. The aggressive quantization seemed to significantly decreased training time while offering increased performance on financial analysis.

Evaluation:

Rouge Score

ROUGE Score Base Model QLoRa Fine Tuned Model
ROUGE-1 0.05104785 0.25257307
ROUGE-2 0.01158752 0.10479990
ROUGE-L 0.05104785 0.25175429

Collaborators:

  • Keese Phillips

Attribution:

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for keesephillips/qlora-llama-3-8b

Finetuned
(476)
this model