Load WARC (Web ARChive) files into Apache Spark using 'sparklyr'. This allows to read files from the Common Crawl project <http://commoncrawl.org/>.
| Version: | 0.1.6 |
| Imports: | DBI,sparklyr,Rcpp |
| LinkingTo: | Rcpp |
| Published: | 2022-01-11 |
| DOI: | 10.32614/CRAN.package.sparkwarc |
| Author: | Javier Luraschi [aut], Yitao Li |
| Maintainer: | Edgar Ruiz <edgar at rstudio.com> |
| BugReports: | https://github.com/r-spark/sparkwarc |
| License: | Apache License 2.0 |
| NeedsCompilation: | yes |
| SystemRequirements: | C++11 |
| Materials: | README |
| CRAN checks: | sparkwarc results[issues need fixing before 2025-12-18] |
| Reference manual: | sparkwarc.html ,sparkwarc.pdf |
| Package source: | sparkwarc_0.1.6.tar.gz |
| Windows binaries: | r-devel:sparkwarc_0.1.6.zip, r-release:sparkwarc_0.1.6.zip, r-oldrel:sparkwarc_0.1.6.zip |
| macOS binaries: | r-release (arm64):sparkwarc_0.1.6.tgz, r-oldrel (arm64):sparkwarc_0.1.6.tgz, r-release (x86_64):sparkwarc_0.1.6.tgz, r-oldrel (x86_64):sparkwarc_0.1.6.tgz |
| Old sources: | sparkwarc archive |
Please use the canonical formhttps://CRAN.R-project.org/package=sparkwarcto link to this page.