ShinyItemAnalysis

Description

ShinyItemAnalysis provides analysis of educational tests (such as admission tests) and their items including:

Exploration of total and standard scores on Summary page.
Correlation structure and predictive validity analysis on Validity page.
Item and distractor analysis on Item analysis page.
Item analysis by logistic models on Regression page.
Item analysis by item response theory models on IRT models page.
Differential item functioning (DIF) and differential distractor functioning (DDF) methods on DIF/Fairness page.

This application is based on the free statistical software R and its shiny package.

For all graphical outputs a download button is provided. Moreover, on Reports page HTML or PDF report can be created. Additionaly, all application outputs are complemented by selected R code hence the similar analysis can be run and modified in R.

Data

For demonstration purposes, by default, 20-item dataset GMAT from R difNLR package is used. Other three datasets are available: GMAT2 and MSAT-B from difNLR package and Medical 100 from ShinyItemAnalysis package. You can change the dataset (and try your own one) on page Data.

Availability

Application can be downloaded as R package from CRAN. It is also available online at Czech Academy of Sciences . Charles University or shinyapps.io .

Version

Current version of ShinyItemAnalysis available on CRAN is 1.2.6. Version available online is 1.2.6. The newest development version available on GitHub is 1.2.6.
See also older versions: 0.1.0, 0.2.0, 1.0.0, 1.1.0.

Authors and contributors

Patricia
Martinkova

Adela
Drabinova

Ondrej
Leder

Jakub
Houdek

Lubomir
Stepanek

List of packages used

library(corrplot)
library(CTT)
library(data.table)
library(deltaPlotR)
library(DT)
library(difNLR)
library(difR)
library(ggplot2)
library(grid)

library(gridExtra)
library(knitr)
library(latticeExtra)
library(ltm)
library(mirt)
library(moments)
library(msm)
library(nnet)
library(plotly)

library(psych)
library(psychometric)
library(reshape2)
library(rmarkdown)
library(shiny)
library(shinyjs)
library(stringr)
library(WrightMap)
library(xtable)

References

To cite package ShinyItemAnalysis in publications please use:

Martinkova P., Drabinova A., Leder O., & Houdek J. (2018). ShinyItemAnalysis: Test and item analysis via shiny. R package version 1.2.6. https://CRAN.R-project.org/package=ShinyItemAnalysis

Martinkova, P., Drabinova, A., & Houdek, J. (2017). ShinyItemAnalysis: Analyza prijimacich a jinych znalostnich ci psychologickych testu [ShinyItemAnalysis: Analyzing admission and other educational and psychological tests]. TESTFORUM, 6(9), 16-35. doi:10.5817/TF2017-9-129

Bug reports

If you discover a problem with this application please contact the project maintainer at martinkova(at)cs.cas.cz or use GitHub.

Acknowledgments

Project was supported by grant funded by Czech Science foundation under number GJ15-15856Y.

License

This program is free software and you can redistribute it and or modify it under the terms of the GNU GPL 3 as published by the Free Software Foundation. This program is distributed in the hope that it will be useful, but without any warranty; without even the implied warranty of merchantability of fitness for a particular purpose.

Data
Data exploration

Data

Training datasets

For demonstration purposes, 20-item dataset GMAT and dataset GMATkey from R difNLR package are used. On this page, you may select one of four datasets offered from difNLR and ShinyItemAnalysis packages or you may upload your own dataset (see below). To return to demonstration dataset, refresh this page in your browser (F5) .

Used dataset GMAT (Martinkova, et al., 2017) is generated based on parameters of real Graduate Management Admission Test (GMAT) data set (Kingston et al., 1985). However, first two items were generated to function differently in uniform and non-uniform way respectively. The data set represents responses of 2,000 subjects (1,000 males, 1,000 females) to multiple-choice test of 20 items. The distribution of total scores is the same for both groups. See Martinkova, et al. (2017) for further discussion.

Dataset GMAT2 (Drabinova & Martinkova, 2016) is also generated based on parameters of GMAT (Kingston et al., 1985) from R difNLR package . Again, first two items were generated to function differently in uniform and non-uniform way respectively. The data set represents responses of 1,000 subjects (500 males, 500 females) to multiple-choice test of 20 items.

Dataset MSAT-B (Drabinova & Martinkova, 2017) is a subset of real Medical School Admission Test in Biology in Czech Republic. The data set represents responses of 1,407 subjects (484 males, 923 females) to multiple-choice test of 20 items. First item was previously detected as functioning differently. For more details of item selection see Drabinova and Martinkova (2017). Dataset can be found in R difNLR package.

Dataset Medical 100 is a real data set of admission test to medical school from R ShinyItemAnalysis package. The data set represents responses of 2,392 subjects (750 males, 1,633 females and 9 subjects without gender specification) to multiple-choice test of 100 items.

Select dataset

Upload your own datasets

Main data file should contain responses of individual students (rows) to given items (columns). Header may contain item names, no row names should be included. If responses are in unscored ABCD format, the key provides correct response for each item. If responses are scored 0-1, key is vector of 1s.

Group is 0-1 vector, where 0 represents reference group and 1 represents focal group. Its length need to be the same as number of individual students in main dataset. If the group is not provided then it wont be possible to run DIF and DDF detection procedures on DIF/Fairness page.

Criterion variable is either discrete or continuous vector (e.g. future study success or future GPA in case of admission tests) which should be predicted by the measurement. Again, its length needs to be the same as number of individual students in the main dataset. If the criterion variable is not provided then it wont be possible to run validity analysis in Predictive validity section on Validity page.

In all data sets header should be either included or excluded. Columns of dataset are by default renamed to Item and number of particular column. If you want to keep your own names, check box Keep items names below. Missing values in scored dataset are by default evaluated as 0. If you want to keep them as missing, check box Keep missing values below.

Choose data (csv file)

Browse...

Choose key (csv file)

Browse...

Choose groups for DIF (optional)

Browse...

Choose criterion variable (optional)

Browse...

Data specification

Header

Keep items names

Keep missing values

Separator

Comma

Semicolon

Tab

Quote

None

Double Quote

Single Quote

Data exploration

Here you can explore uploaded dataset. Rendering of tables can take some time.

Main dataset

Key (correct answers)

Scored test

Group vector

Criterion variable vector

Analysis of total scores

Summary table

Table below summarizes basic characteristics of total scores including minimum and maximum, mean, median, standard deviation, skewness and kurtosis. The kurtosis here is estimated by sample kurtosis $\frac{m_4}{m_2^2}$, where $m_4$ is the fourth central moment and $m_2$ is the second central moment. The skewness is estimated by sample skewness $\frac{m_3}{m_2^{3/2}}$, where $m_3$ is the third central moment. The kurtosis for normally distributed scores is near the value of 3 and the skewness is near the value of 0.

Histogram of total score

Cut-score

For selected cut-score, blue part of histogram shows students with total score above the cut-score, grey column shows students with total score equal to the cut-score and red part of histogram shows students below the cut-score.

Description

Data

Availability

Version

Authors and contributors

List of packages used

References

Bug reports

Acknowledgments

License

Data

Training datasets

Upload your own datasets

Data specification

Data exploration

Main dataset

Key (correct answers)

Scored test

Group vector

Criterion variable vector

Analysis of total scores

Summary table

Histogram of total score

Selected R code

Standard scores

Table by score

Selected R code

Correlation structure

Polychoric correlation heat map

Scree plot

Selected R code

Predictive validity

Descriptive plots of criterion variable on total score

Correlation of criterion variable and total score

Selected R code

Predictive validity

Distractor plot

Correlation of criterion variable and scored item

Selected R code

Traditional item analysis

Item difficulty/discrimination plot

Cronbach's alpha

Traditional item analysis table

Selected R code

Distractor analysis

Distractors plot

Table with counts

Table with proportions

Barplot of item response patterns

Histogram of total scores

Table of total scores by groups

Selected R code

Logistic regression on total scores

Plot with estimated logistic curve

Equation

Table of parameters

Selected R code

Logistic regression on standardized total scores

Plot with estimated logistic curve

Equation

Table of parameters

Selected R code

Logistic regression on standardized total scores with IRT parameterization

Plot with estimated logistic curve

Equation

Table of parameters

Selected R code

Nonlinear regression on standardized total scores with IRT parameterization

Plot with estimated nonlinear curve

Equation

Table of parameters

Selected R code

Logistic regression model selection

Table of comparison statistics

Selected R code

Multinomial regression on standardized total scores

Plot with estimated curves of multinomial regression

Equation

Table of parameters

Selected R code