ActiveVisionLab/CrossScorePublic

NotificationsYou must be signed in to change notification settings
Fork0
Star14

(ECCV 2024) CrossScore: Towards Multi-View Image Evaluation and Scoring

License

BSD-3-Clause license

14 stars 0 forks Branches Tags Activity

Star

Notifications

You must be signed in to change notification settings

Branches Tags

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.vscode		.vscode
ckpt		ckpt
config		config
dataloading		dataloading
model		model
task		task
utils		utils
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
environment.yaml		environment.yaml
predict.sh		predict.sh
pyrightconfig.json		pyrightconfig.json

Repository files navigation

CrossScore: Towards Multi-View Image Evaluation and Scoring

Project Page |arXiv

Zirui Wang,Wenjing Bian,Victor Adrian Prisacariu.

Active Vision Lab (AVL),University of Oxford.

Table of Content

Environment

We provide aenvironment.yaml file to set up aconda environment:

git clone https://github.com/ActiveVisionLab/CrossScore.gitcd CrossScoreconda env create -f environment.yamlconda activate CrossScore

Data

TLDR: download thisfile (~3GB),put it indatadir:

mkdir datadircd datadirwget https://www.robots.ox.ac.uk/~ryan/CrossScore/MFR_subset_demo.tar.gztar -xzvf MFR_subset_demo.tar.gzrm MFR_subset_demo.tar.gzcd ..

To demonstrate a minimum working example for training and inferencing steps shown below,we provide a small pre-processed subset.The is a subset ofMap-Free Relocalisation (MFR)and is pre-processed using3D Gaussian Splatting (3DGS).This small demo dataset is available at thislink (~3GB).This is the file in TLDR.We only use this demo subset to present the expected dataloading structure.

In our actual training, our model is trained using MFR that pre-processed by three NVS methods:3DGS,TensoRF, andNeRFacto.Due to the preprocessed file size (~2TB), it is challenging to directly sharethis pre-processed data. One work around is to release a data pre-processing scriptfor MFR, which we are still tidying up.We aim to release the pre-processing script in Dec 2024.

Training

We train our model with two NVIDIA A5000 (24GB) GPUs for about two days.However, the model should perform reasonably well after 12 hours of training.It is also possible to train with a single GPU.

python task/train.py trainer.devices='[0,1]'# 2 GPUs# python task/train.py trainer.devices='[0]'  # 1 GPU

Inferencing

We provide an example command to predict CrossScore for NVS rendered imagesby referencing real captured images.

git lfs install&& git lfs pull# get our ckpt using git LFSbash predict.sh

After running the script, our CrossScore score maps should be written topredict dir.The output should be similar to ourdemo videoon our project page.

Todo

Create a HuggingFace demo page.
Release ECCV quantitative results related scripts.
Release data processing scripts
Release PyPI and Conda package.

Acknowledgement

This research is supported by anARIAresearch gift grant from Meta Reality Lab. We gratefully thankShangzhe Wu,Tengda Han,Zihang Lai for insightful discussions, andMichael Hobley for proofreading.

Citation

@inproceedings{wang2024crossscore,title={CrossScore: Towards Multi-View Image Evaluation and Scoring},author={Zirui Wang and Wenjing Bian and Victor Adrian Prisacariu},booktitle={ECCV},year={2024}}

About

(ECCV 2024) CrossScore: Towards Multi-View Image Evaluation and Scoring

crossscore.active.vision/

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

License

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

CrossScore: Towards Multi-View Image Evaluation and Scoring

Table of Content

Environment

Data

Training

Inferencing

Todo

Acknowledgement

Citation

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Languages