This package provides infrastructure to make text datasets availablewithin R, even when they are too large to store within an R package orare licensed in such a way that prevents them from being included inOSS-licensed packages.
Do you want to add a new dataset to the textdata package?
prefix_*.R in theR/folder, where* is the name of the dataset. Supportedprefixes includedataset_lexicon_download_*(),process_*() anddataset_*().download_*() function should take 1 argument namedfolder_path. It has 2 tasks, first it should check if thefile is already downloaded. If it is already downloaded it should returninvisible(). If the file isn’t at the path it shoulddownload the file to said path.process_*() function should take 2 arguments,folder_path andname_path.folder_path denotes the the path to the file returned bydownload_* andname_path is the path to wherethe polished data should live. Main point ofprocess_*() isto turn the downloaded file into a .rds file containing a tidytibble.dataset_*() function should wrap theload_dataset().process_*() function to the named listprocess_functions in the file process_functions.R.download_*() function to the named listdownload_functions in the file download_functions.R.print_info list in the info.R file.dataset_*.R to the@include tags indownload_functions.R.README.Rmd._pkgdown.yml.NEWS.md file.What are the guidelines for adding datasets?
word instead ofwords for columnnames.For datasets that comes with a testing and training dataset. Let theuser pick which one to retrieve with asplit argumentsimilar to howdataset_ag_news() is doing.