# Data Explorer

This component is used by students to briefly summarize data as it is presented and perform various statistical tests. This component may be used as a standalone feature in a lesson. If you wish to allow students the ability to import a `.csv`

file, set the `'data`

option to be `false`

.

#### Example

In creating a data explorer, one needs a *.json dataset. This step may be accomplished by including the following in the header of a *.isle file:

```
require:
dataName: "./dataName.json"
```

Consider a dataset called "heartdisease" with the following variables:

**Gender**:*Categorical***Drugs**:*Categorical***Complications**:*Categorical***Cost**:*Continuous***Age**:*Continuous***Interventions**:*Continuous***ERVisit**:*Continuous***Comorbidities**:*Continuous***Duration**:*Continuous*

We will include the explorer with the following code:

```
<DataExplorer
id="heartdisease"
data={heartdisease}
categorical={[ 'Gender', 'Drugs', 'Complications' ]}
continuous={[ 'Cost', 'Age', 'Interventions', 'ERVisit', 'Comorbidities', 'Duration' ]}
tests={[]}
/>
```

#### Options

**data**: The name of the .json dataset that will be used in the explorer. If you wish to allow students to enter in a`.csv`

file, set to`{false}`

.**onSelect**: A function to be passed.**tabs**: An array containing additional tabs to be used in the display interface. Each entry in the array will include a string 'title' and content. For example, if one wishes to add an additional tab with a text content of "A Note", the following code would be used:`tabs={[ {title: 'Note', content: <div>A Note</div>} ]}`

**Questions**: Questions to be asked of the dataset. May also be embedded in a question object independent of the data explorer apparatus.**transformer**: A bool value indicating whether a transformation is to be applied to a variable.**statistics**: Relevant statistical operations that can be applied to data, such as "mean" or "correlation". The following operations are enabled by default: -- Mean -- Median -- Min -- Max -- Interquartile Range -- Standard Deviation -- Variance -- Correlation**plots**: Specification of which plots may be produced by the explorer. The following may be used and are enabled by default: -- Bar Chart -- Pie Chart -- Histogram -- Box Plot -- Scatterplot -- Heat Map -- Mosaic Plot**tables**: Two-way tables that may be compiled from the data. One maty choose either (or both) of the following options: Frequency Table, Contingency Table**tests**: Statistical tests that may be applied to the data. Must be some combination of the following, all of which are enabled by default: -- One-Sample Mean Test -- One-Sample Proportion Test -- Two-Sample Mean Test -- Two-Sample Proportion Test -- Correlation Test -- Chi-squared Independence Test -- One-Way ANOVA**models**: Modelling paradigms to be used in creating models from the variables presented. Currently this functionality is limited to "Simple Linear Regression" (which is enabled by default), yet will be extended in the future.**categorical**: An array of string variables that represent the categorical variables in a dataset.**continuous**: An array of string variables that represent the continuous variables in a dataset.**distributions**: Distributions that may be used in calculating probabilities. This functionality exists independently of the dataset provided. Currently limited to normal, uniform and exponential distributions.

#### Default Options

```
DataExplorer.defaultProps = {
data: {},
onSelect(){},
tabs: [],
questions: null,
transformer: false,
statistics: [
'Mean',
'Median',
'Min',
'Max',
'Range',
'Interquartile Range',
'Standard Deviation',
'Variance',
'Correlation'
],
plots: [
'Bar Chart',
'Pie Chart',
'Histogram',
'Box Plot',
'Scatterplot',
'Heat Map',
'Mosaic Plot'
],
tables: [
'Frequency Table',
'Contingency Table'
],
tests: [
'One-Sample Mean Test',
'One-Sample Proportion Test',
'Two-Sample Mean Test',
'Two-Sample Proportion Test',
'Correlation Test',
'Chi-squared Independence Test',
'One-Way ANOVA'
],
models: [
'Simple Linear Regression'
],
categorical: [],
continuous: [],
distributions: [ 'Normal', 'Uniform', 'Exponential' ]
};
```