Movatterモバイル変換

The goal of mrap is to provide wrapper functions to reduce the user’seffort in writing machine-readable data with thedtreg package. Theset of all-in-one wrappers will cover functions fromstatsand other well-known packages. These are very easy to use, seeExample III: an all-in-one wrapper for anova. Thepackage also contains wrappers for analytical schemata used byTIB Knowledge Loom. Thisvignette discusses in detail how to apply such a wrapper to write theresults of your data analysis as JSON-LD in five steps:

2. Check arguments

The wrappers are very easy in use, when the required arguments arespecified correctly, which is crucial for transparent reporting ofresults. This section explains how to do it.

2.1. Code string

Argumentcode_string should be a string (in R, acharacter vector). The argument cannot be omitted; please indicate “N/A”if this information is not provided. InExampleI, we use the following codestring:'stats::t.test(setosa, virginica, var.equal = FALSE)'

Package name

To specify the name of the package in the code is always a goodpractice. In mrap, we made it a requirement, and you will get an errormessage if thecode_string does not containpackage::function. In most cases, it is the beginning ofthe string, but we allow for generic method summary, in this case it issummary(package::function(formula)). For base R, pleaseindicatebase::.

Data name

Your data can be a string (URL), a named list, or a data frame (seeInput data below). In case of a string, you can addthe data name manually (seeModify the instance);if your data is a named list, as inExample I,mrap easily extracts the elements’ names. In these cases, thecode_string does not play a role, and the data name is notspecified in it. However, if your data is a single data frame, and youwant mrap to extract its name from thecode_string, pleaseindicate it as'data = dataset_name'(e.g.,'data = iris'), although most R packages allow for merelydataset_name.

Target variable(s)

Our wrappers extract the name of a target variable from thecode_string if the variable is before the~sign in the formula:

"package::function(Petal.Length ~ Species), data = iris""package::function(iris$Petal.Length ~ iris$Species), data = iris"

We also allow for a few target variables in special cases such asMANOVA:

"package::function(cbind(Petal.Length, Petal.Width) ~ Species), data = iris"

Alternatively, a target variable can be explicitly specified in twoor more vectors:

"package::function(setosa$Petal.Length, virginica$Petal.Length)"

In the following case we cannot extract the name, and you can add thetarget label manually to the instance:

"package::function(one_vector, another_vector)"

You will get a warning reminding to do it.

Level variable(s)

Incode_string, level variable is recognized by ourwrappers in “x | level” or “x || level” syntax:

"lme4::lmer(Reaction ~ Days + (Days | Subject), data = sleepstudy)""lme4::lmer(Reaction ~ Days + (Days || Subject), data = sleepstudy)"

A level can be written more than once in a formula, in this case mrapalso recognizes it:

"lme4::lmer(math ~ homework + (homework | schid) + (class_size | schid))"

More than one level is possible, mrap will capture all levelnames:

"lme4::lmer(math ~ homework + (1 | schid) + (1 | classid))"

If we cannot extract the name, you will get a warning reminding youto add the level label manually to the instance.

2.2. Input data

Argumentinput_data can be:

a string, which is either a file name or a URL

is.character("ABC")

a dataframe

is.data.frame(iris)

a named list for a few vectors or data frames:

species_list<-list("setosa"= setosa,"virginica"= virginica)# check it is a listis.list(species_list)# check that the list is namednames(species_list)

Please be sure that the argument is one of these three types. Youwill get an error message if a type is wrong (for instance, a listinstead of a named list).

2.3. Test results or named list results

Argumenttest_results can be either a data frame or alist of data frames. You can check whether you are writing down theargument correctly. For a data frame:

is.data.frame(iris)

For a list of data frames:

# assume you have a few data frames in a listiris_new<- iris[,-1]my_results<-list(iris, iris_new)# check each of them in a loopfor (elementin my_results) {print(is.data.frame(element))}

Argumentnamed_list_results is only used for thealgorithm_evaluation schema.

Example I: group comparison

Let us assume you conducted a t-test on the Iris data comparing petallength in setosa and virginica species:

data(iris)library(dplyr)setosa<- iris|>  dplyr::filter(Species=="setosa")|>  dplyr::select(Petal.Length)virginica<- iris|>  dplyr::filter(Species=="virginica")|>  dplyr::select(Petal.Length)tt<- stats::t.test(setosa, virginica,var.equal =FALSE)

The results of the test should be presented as a data frame:

df_results<-data.frame(t.statistic = tt$statistic,df = tt$parameter,p.value = tt$p.value)rownames(df_results)<-"value"

Now, let us follow the steps described above to create agroup_comparison instance, modify it, include indata_analysis instance, and write it as a JSON-LD file:

inst_gc<-  mrap::group_comparison("stats::t.test(setosa, virginica, var.equal = FALSE)",list("setosa"= setosa,"virginica"= virginica),    df_results  )inst_gc$targets<-"Petal.Length"inst_da<- mrap::data_analysis(inst_gc)json<- mrap::to_jsonld(inst_da)write(json,"data-analysis-1.json")

Example III: an all-in-one wrapper for anova

Currently, mrap contains an all-in-one wrapper forstats::aov function, and more such wrappers will be addedin the future. Let us assume you are currently usingstats::aov for conducting your ANOVA tests:

data(iris)anova_stats_results<- stats::aov(Petal.Length~ Species,data = iris)

The all-in-one wrapper is as easy in use as the originalfunction:

aov<- mrap::stats_aov(Petal.Length~ Species,data = iris)

The wrapper returns a list, the first element of which is theresulting object from the original function:

anova_mrap_results<- aov$anova

The second element is agroup_comparison instance:

inst_gc_anova<- aov$dtreg_object

The instance includes all required information. Of course, there isstill a possibility to modify it, e.g., to add a label:

inst_gc_anova$label<-"my_fancy_results"

This can be further included in thedata_analysis instance and written asJSON-LD file as explained above.

Movatterモバイル変換

Introduction to mrap

1. Select a wrapper

2. Check arguments

2.1. Code string

Package name

Data name

Target variable(s)

Level variable(s)

2.2. Input data

2.3. Test results or named list results

3. Create an instance

4. Modify the instance

5. Include the instance into the overarching`data_analysis` instance

6. Write JSON-LD

Example I: group comparison

Example II: algorithm evaluation

Example III: an all-in-one wrapper for anova

Movatterモバイル変換

Introduction to mrap

1. Select a wrapper

2. Check arguments

2.1. Code string

Package name

Data name

Target variable(s)

Level variable(s)

2.2. Input data

2.3. Test results or named list results

3. Create an instance

4. Modify the instance

5. Include the instance into the overarchingdata_analysis instance

6. Write JSON-LD

Example I: group comparison

Example II: algorithm evaluation

Example III: an all-in-one wrapper for anova

5. Include the instance into the overarching`data_analysis` instance