You signed in with another tab or window.Reload to refresh your session.You signed out in another tab or window.Reload to refresh your session.You switched accounts on another tab or window.Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+8Lines changed: 8 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -320,3 +320,11 @@ lb = chug.create_loader(
320
320
ii=iter(lb)
321
321
sample=next(ii)
322
322
```
323
+
324
+
##Acknowledgement
325
+
326
+
`chug` evolve from the`webdataset` datapipeline used successfully in the[OpenCLIP](https://github.com/mlfoundations/open_clip) project. Thanks to all the contributors in that project. Future work will likely involve closing the loop and leveraging`chug` in OpenCLIP for increased capability.
327
+
328
+
The image/document augmentations in`chug` rely on a number of external influences. Our document oriented`doc_better` torchvision augmentations are influenced by`nougat`, and the`doc_nougat` is a direct adaptation of the[`albumentations`](https://albumentations.ai/) +`cv2` document pipeline in[`nougat`](https://github.com/facebookresearch/nougat). Several image augmentations leverage existing work in the`timm` library.
329
+
330
+
Also, big thanks to the maintainers of[`webdataset`](https://github.com/webdataset/webdataset) and Huffing Face[`datasets`](https://github.com/huggingface/datasets).