Skip to content

DongSky/MIRAGE

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

MIRAGE and Logos

MIRAGE: Assessing Hallucination in Multimodal Reasoning Chains of MLLM

paper data Logos_train_data Logos-3B Logos-7B

Eval

Our evaluation code is based on VIC

  1. Download eval data into eval code directory, then switch to this directory

  2. execute following code for inference

python -m Vic.benchmark_test -p mirage.tsv -i original (for reasoning mllms)

or 

python -m Vic.benchmark_test -p mirage.tsv -i cot (for vanilla mllms)
  1. Evaluation, specifically, for accuracy:
python -m Vic.benchmark_eval -b mirage -p output_inference_results.tsv

Logos Train

We implement our train code based on OpenRLHF. We will upload our version soon.

The training data has been released in Logos_train_data.

About

Data, checkpoints, and evaluation code have been released.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages