Welcome to the ISSS608 G5 User Guide. This user guide is designed to provide documentation for people who will use our shiny application
This tab show you the Choropleth map to show the distribution in US
And if you click the botton show the defination on variables

It will pop up our variable dictionary

Use this tab to do exploratory data analysis.
Users select the factor of interest
Observe how the graphs are plotted:
Select a demographic of interest
Observe how the proportions on the graph change
Form the analysis on building the model.There are also two sub tabs, Model Insight and Model Visualization. The data set is also inserted to display by the control button

Click Side bar above to select the variable you want to view

Then the coordinate bar chart will show like

And the left side, variable explanation will also change.

First is to choose the panel that you want to see

Then in the first panel, we have odd ratio plot and chiq plot.
Also, if you want to see the data set that we use to build the model, you can click the Show data table button.

Then in the second panel, we have side bar to visual more graphs, you can choose diagnose ill fitting plot or KS Chart observation. And we add the defination of each graph in the middle.

50% training /50% test,
60% training /40% test
70% training /30% test
80% training /20% test

2.Next, the user can select the number of k. The parameter called k that refers to the number of groups that a given data sample is to be split into. Of the k subsamples, a single subsample is retained as the validation data for testing the model, and the remaining k − 1 subsamples are used as training data. The cross-validation process is then repeated k times, with each of the k subsamples used exactly once as the validation data.



50% training /50% test,
60% training /40% test
70% training /30% test
80% training /20% test
The parameter called k that refers to the number of groups that a given data sample is to be split into. Of the k subsamples, a single subsample is retained as the validation data for testing the model, and the remaining k − 1 subsamples are used as training data. The cross-validation process is then repeated k times, with each of the k subsamples used exactly once as the validation data.
The user will choose the first machine learning model to be used. The user can scroll through the list or type in the key words in the list.
Once the first model is chosen, the user can click on “Submit” to plot the Variable Importance chart for the model.
