VicenteVivan/geo-clipPublic

NotificationsYou must be signed in to change notification settings
Fork35
Star238

This is an official PyTorch implementation of our NeurIPS 2023 paper "GeoCLIP: Clip-Inspired Alignment between Locations and Images for Effective Worldwide Geo-localization"

License

MIT license

238 stars 35 forks Branches Tags Activity

Star

Notifications

You must be signed in to change notification settings

Branches Tags

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
figures		figures
geoclip		geoclip
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py

Repository files navigation

🌎 GeoCLIP: Clip-Inspired Alignment between Locations and Images for Effective Worldwide Geo-localization

📍 Try out our demo!

Description

GeoCLIP addresses the challenges of worldwide image geo-localization by introducing a novel CLIP-inspired approach that aligns images with geographical locations, achieving state-of-the-art results on geo-localization and GPS to vector representation on benchmark datasets (Im2GPS3k, YFCC26k, GWS15k, and the Geo-Tagged NUS-Wide Dataset). Our location encoder models the Earth as a continuous function, learning semantically rich, CLIP-aligned features that are suitable for geo-localization. Additionally, our location encoder architecture generalizes, making it suitable for use as a pre-trained GPS encoder to aid geo-aware neural architectures.

Method

Similarly to OpenAI's CLIP, GeoCLIP is trained contrastively by matching Image-GPS pairs. By using the MP-16 dataset, composed of 4.7M Images taken across the globe, GeoCLIP learns distinctive visual features associated with different locations on earth.

🚧 Repo Under Construction 🔨

📎 Getting Started: API

You can install GeoCLIP's module using pip:

pip install geoclip

or directly from source:

git clone https://github.com/VicenteVivan/geo-clipcd geo-clippython setup.py install

🗺️📍 Worldwide Image Geolocalization

Usage: GeoCLIP Inference

importtorchfromgeoclipimportGeoCLIPmodel=GeoCLIP()image_path="image.png"top_pred_gps,top_pred_prob=model.predict(image_path,top_k=5)print("Top 5 GPS Predictions")print("=====================")foriinrange(5):lat,lon=top_pred_gps[i]print(f"Prediction{i+1}: ({lat:.6f},{lon:.6f})")print(f"Probability:{top_pred_prob[i]:.6f}")print("")

🌐 Worldwide GPS Embeddings

In our paper, we show that once trained, our location encoder can assist other geo-aware neural architectures. Specifically, we explore our location encoder's ability to improve multi-class classification accuracy. We achieved state-of-the-art results on the Geo-Tagged NUS-Wide Dataset by concatenating GPS features from our pre-trained location encoder with an image's visual features. Additionally, we found that the GPS features learned by our location encoder, even without extra information, are effective for geo-aware image classification, achieving state-of-the-art performance in the GPS-only multi-class classification task on the same dataset.

Usage: Pre-Trained Location Encoder

importtorchfromgeoclipimportLocationEncodergps_encoder=LocationEncoder()gps_data=torch.Tensor([[40.7128,-74.0060], [34.0522,-118.2437]])# NYC and LA in lat, longps_embeddings=gps_encoder(gps_data)print(gps_embeddings.shape)# (2, 512)

Acknowledgments

This project incorporates code from Joshua M. Long's Random Fourier Features Pytorch. For the original source, visithere.

Citation

If you find GeoCLIP beneficial for your research, please consider citing us with the following BibTeX entry:

@inproceedings{geoclip,  title={GeoCLIP: Clip-Inspired Alignment between Locations and Images for Effective Worldwide Geo-localization},  author={Vivanco, Vicente and Nayak, Gaurav Kumar and Shah, Mubarak},  booktitle={Advances in Neural Information Processing Systems},  year={2023}}

About

This is an official PyTorch implementation of our NeurIPS 2023 paper "GeoCLIP: Clip-Inspired Alignment between Locations and Images for Effective Worldwide Geo-localization"

arxiv.org/abs/2309.16020

Releases

No releases published

Packages

No packages published

Languages

Python100.0%

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

License

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

🌎 GeoCLIP: Clip-Inspired Alignment between Locations and Images for Effective Worldwide Geo-localization

📍 Try out our demo!

Description

Method

📎 Getting Started: API

🗺️📍 Worldwide Image Geolocalization

Usage: GeoCLIP Inference

🌐 Worldwide GPS Embeddings

Usage: Pre-Trained Location Encoder

Acknowledgments

Citation

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages

Uh oh!

Languages

Movatterモバイル変換

License

VicenteVivan/geo-clip

Folders and files

Latest commit

History

Repository files navigation

🌎 GeoCLIP: Clip-Inspired Alignment between Locations and Images for Effective Worldwide Geo-localization

📍 Try out our demo!

Description

Method

📎 Getting Started: API

🗺️📍 Worldwide Image Geolocalization

Usage: GeoCLIP Inference

🌐 Worldwide GPS Embeddings

Usage: Pre-Trained Location Encoder

Acknowledgments

Citation

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages0

Uh oh!

Languages

Packages