Remote Data Mining And Management Job In Data Science And Analytics

Need an R expert to analyze a dataset according to requirements

Find more Data Mining And Management remote jobs posted recently Worldwide

Deliveries should include your R script, and R markdown. You have to use the template attached

Background
In this project, we study the dataset from a very influential randomized experient. Tennesses Student/Teacher Achievement Ratio study (Project STAR) was conducted in the late 1980s to evaluate the effect of class size on test scores. This dataset has been used as a classic examples in many textbooks and research papers. You are encouraged to read more about the experiment design and how others analyze this dataset. This document only provides a brief explanation of the dataset that suffices for this course project.

The study randomly assigned students to small classes, regular classes, and regular classes with a teachers aide. In order to randomize properly, schools were enrolled only if they had enough studybody to have at least one class of each type. Once the schools were enrolled, students were randomly assigned to the three types of classes, and one teacher was randomly assigned to one class.

The dataset contains scaled scores for math and reading from kindergarten to 3rd grade. We will only examine the math scores in 1st grade in this project.

Tasks
Any computational tasks should be completed using R.

Install the AER package and load the STAR dataset.
For each of the three class types, draw histgrams of gender, ethnicity, and birth (birth quarter) for participated students.
Write down a linear regression model to study the association between the class types and the scaled math scores in the 1st grade. You may want to include other covariates (predictors) of your choice. Explain your notation.
Fit the model in Task 3 and show your fits in the report with tables or plots.
Construct 95% confidence intervals for the coefficients of the class types.
Interpret the point estimates and the confidence intervals in a way that a non-statistician can understand.
Test the null hypothesis that there are no differences in math scaled scores across class types in the 1st grade. Justify your choice of test.
Explain your test result in a way that a non-statistician can understand.
Conduct model diagnostic and/or sensitivity analysis.
About the recuiter
Member since Mar 14, 2020
Omanakkuttan N
from Valletta, Malta

Skills & Expertise Required

Data Science & Analytics Data Mining & Management 

Candidate shortlisted and hiredHiring open till - Jun 13, 2022

Work from Anywhere

40 hrs / week

Fixed Type

Remote Job

$13.90

Cost

Looking for help? Checkout our video tutorial
How to search and apply for jobs

How to apply? Do you have more questions about the Job?
See frequently asked questions

Similar Projects

Need an R expert to analyze a dataset according to requirements

Deliveries should include your R script, and R markdown. You have to use the template attached

Background
In this project, we study the dataset from a very influential randomized experient. Tennesses Student/Teacher Achievement Ratio stu...read more

Sr. Data Engineer

Looking for Sr. Data Engineer to meet the data needs of Fileo Insights clients. This role develops data processes, provides expert guidance on methods of optimizing data flows and reporting including data modeling, ETL, and indexing to support the bu...read more

Google Analytics Custom Dash & Reporting/Goals Setup

I need a few things on a business Im working on...

1) GA custom dashboard setup (not generic as I can do that) with important metrics and quick look data

2) Goals and funnel setups

3) I also need help with the best social...read more