ShinyItemAnalysis - TEST AND ITEM ANALYSIS

Description

ShinyItemAnalysis provides analysis of educational tests (such as admission tests) and their items including:

Exploration of total and standard scores on Summary page.
Item and distractor analysis on Traditional Analysis page.
Item analysis by logistic models on Regression page.
Item analysis by item response theory models on IRT models page.
Differential item functioning (DIF) and differential distractor functioning (DDF) methods on DIF/Fairness page.

This application is based on the free statistical software R and its Shiny package.

For all graphical outputs a download button is provided. Moreover, on Reports page HTML or PDF report can be created. Additionaly, all application outputs are complemented by selected R code hence the similar analysis can be run and modified in R.

You can also download ShinyItemAnalysis package from CRAN to use it offline or run it faster.

Data

For demonstration purposes, by default, 20-item dataset GMAT from R difNLR package is used. Other three datasets are available: GMAT2 and Medical 20 DIF from difNLR package and Medical 100 from ShinyItemAnalysis package. You can change the dataset (and try your own one) on page Data.

Version

Current version of ShinyItemAnalysis is 1.1.0

See also older versions: 0.1.0, 0.2.0, 1.0.0

List of packages used

library(corrplot)
library(CTT)
library(deltaPlotR)
library(DT)
library(difNLR)
library(difR)
library(ggplot2)
library(grid)
library(gridExtra)
library(latticeExtra)
library(ltm)
library(mirt)
library(moments)
library(msm)
library(nnet)
library(psych)
library(psychometric)
library(reshape2)
library(rmarkdown)
library(shiny)
library(shinyjs)
library(stringr)
library(WrightMap)

Authors

Patricia Martinkova, Institute of Computer Science, Czech Academy of Sciences

Adela Drabinova

Ondrej Leder

Jakub Houdek

Bug reports

If you discover a problem with this application please contact the project maintainer at martinkova(at)cs.cas.cz or use GitHub.

Acknowledgments

Project was supported by grant funded by Czech Science foundation under number GJ15-15856Y.

License

This program is free software you can redistribute it and or modify it under the terms of the GNU General Public License as published by the Free Software Foundation either version 3 of the License or at your option any later version.

This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.

Data

For demonstration purposes, 20-item dataset GMAT and dataset GMATkey from R difNLR package are used. On this page, you may select one of four datasets offered from difNLR and ShinyItemAnalysis packages or you may upload your own dataset (see below). To return to demonstration dataset, refresh this page in your browser (F5) .

Used dataset GMAT is generated based on parameters of real Graduate Management Admission Test (GMAT) data set (Kingston et al., 1985). However, first two items were generated to function differently in uniform and non-uniform way respectively. The data set represents responses of 2,000 subjects (1,000 males, 1,000 females) to multiple-choice test of 20 items. The distribution of total scores is the same for both groups.

Dataset GMAT2 is also generated based on parameters of GMAT (Kingston et al., 1985) from R difNLR package . Again, first two items were generated to function differently in uniform and non-uniform way respectively. The data set represents responses of 1,000 subjects (500 males, 500 females) to multiple-choice test of 20 items.

Dataset Medical 20 DIF is a subset of real admission test to medical school from R difNLR package. First item was previously detected as functioning differently. The data set represents responses of 1,407 subjects (484 males, 923 females) to multiple-choice test of 20 items. For more details of item selection see Drabinova & Martinkova (2016).

Dataset Medical 100 is a real data set of admission test to medical school from R ShinyItemAnalysis package . The data set represents responses of 3,204 subjects to multiple-choice test of 100 items. There is no group membership variable in the data set hence it is not possible to run DIF or DDF detection procedures.

Select dataset

Upload your own datasets

Main dataset should contain responses of individual students (rows) to given items (columns). Header may contain item names, no row names should be included. If responses are in unscored ABCD format, the key provides correct response for each item. If responses are scored 0-1, key is vector of 1s. Group is 0-1 vector, where 0 represents reference group and 1 represents focal group. Its length need to be the same as number of individual students in main dataset. If the group is not provided then it wont be possible to run DIF and DDF detection procedures. In all data sets header should be either included or excluded.

Choose data (csv file)

Browse...

Choose key (csv file)

Browse...

Choose groups for DIF (optional)

Browse...

Data specification

Header

Separator

Comma

Semicolon

Tab

Quote

None

Double Quote

Single Quote

Data check

Key (correct answers)

Scored test

Group vector

Analysis of total scores

Summary table

Histogram of total score

Cut-Score

For selected cut-score, blue part of histogram shows students with total score above the cut-score, grey column shows students with Total Score equal to cut-score and red part of histogram shows students below the cut-score.

Description

Data

Version

List of packages used

Authors

Bug reports

Acknowledgments

License

Data

Upload your own datasets

Data specification

Data check

Key (correct answers)

Scored test

Group vector

Analysis of total scores

Summary table

Histogram of total score

Selected R code

Standard scores

Table by score

Selected R code

Correlation structure

Polychoric correlation heat map

Scree plot

Selected R code

Traditional item analysis

Item difficulty/discrimination graph

Cronbach's alpha

Traditional item analysis table

Selected R code

Distractor analysis

Distractors plot

Table with counts

Table with proportions

Histogram of total scores

Table of total scores by groups

Selected R code

Logistic regression on total scores

Plot with estimated logistic curve

Equation

Table of parameters

Selected R code

Logistic regression on standardized total scores

Plot with estimated logistic curve

Equation

Table of parameters

Selected R code

Logistic regression on standardized total scores with IRT parameterization

Plot with estimated logistic curve

Equation

Table of parameters

Selected R code

Nonlinear regression on standardized total scores

Plot with estimated nonlinear curve

Equation

Table of parameters

Selected R code

Logistic regression model selection

Table of comparison statistics

Selected R code

Multinomial regression on standardized total scores

Plot with estimated curves of multinomial regression

Equation

Table of parameters

Selected R code

One parameter Item Response Theory model

Equation

Item characteristic curves

Item information curves

Test information function

Table of parameters

Scatter plot of factor scores and standardized total scores

Selected R code

Two parameter Item Response Theory model

Equation

Item characteristic curves

Item information curves

Test information function

Table of parameters