Provides comprehensive tools for extracting and analyzing scientific content from PDF documents, including citation extraction, reference matching, text analysis, and bibliometric indicators. Supports multi-column PDF layouts, 'CrossRef' API <https://www.crossref.org/documentation/retrieve-metadata/rest-api/> integration, and advanced citation parsing.
| Version: | 0.2.1 |
| Depends: | R (≥ 4.1.0) |
| Imports: | base64enc (≥ 0.1-3),dplyr (≥ 1.1.0),httr2 (≥ 0.2.0),igraph,jsonlite (≥ 2.0.0),magrittr (≥ 2.0.4),openalexR (≥2.0.2),pdftools (≥ 3.6.0),purrr (≥ 1.1.0),stringr (≥1.5.2),tibble (≥ 3.3.0),tidyr (≥ 1.3.0),tidytext (≥0.4.3),visNetwork (≥ 2.1.4) |
| Suggests: | knitr,plotly,RColorBrewer,rmarkdown,scales,stringdist,testthat (≥ 3.0.0),mockery |
| Published: | 2025-12-12 |
| DOI: | 10.32614/CRAN.package.contentanalysis |
| Author: | Massimo Aria [cre, aut, cph], Corrado Cuccurullo [aut] |
| Maintainer: | Massimo Aria <aria at unina.it> |
| BugReports: | https://github.com/massimoaria/contentanalysis/issues |
| License: | GPL (≥ 3) |
| URL: | https://github.com/massimoaria/contentanalysis, |
| NeedsCompilation: | no |
| Materials: | README,NEWS |
| CRAN checks: | contentanalysis results |