Remote Data Mining And Management Job In Data Science And Analytics

Need an R expert to analyze a dataset according to requirements

Find more Data Mining And Management remote jobs posted recently Worldwide

Deliveries should include your R script, and R markdown. You have to use the template attached

Background
In this project, we study the dataset from a very influential randomized experient. Tennesses Student/Teacher Achievement Ratio study (Project STAR) was conducted in the late 1980s to evaluate the effect of class size on test scores. This dataset has been used as a classic examples in many textbooks and research papers. You are encouraged to read more about the experiment design and how others analyze this dataset. This document only provides a brief explanation of the dataset that suffices for this course project.

The study randomly assigned students to small classes, regular classes, and regular classes with a teachers aide. In order to randomize properly, schools were enrolled only if they had enough studybody to have at least one class of each type. Once the schools were enrolled, students were randomly assigned to the three types of classes, and one teacher was randomly assigned to one class.

The dataset contains scaled scores for math and reading from kindergarten to 3rd grade. We will only examine the math scores in 1st grade in this project.

Tasks
Any computational tasks should be completed using R.

Install the AER package and load the STAR dataset.
For each of the three class types, draw histgrams of gender, ethnicity, and birth (birth quarter) for participated students.
Write down a linear regression model to study the association between the class types and the scaled math scores in the 1st grade. You may want to include other covariates (predictors) of your choice. Explain your notation.
Fit the model in Task 3 and show your fits in the report with tables or plots.
Construct 95% confidence intervals for the coefficients of the class types.
Interpret the point estimates and the confidence intervals in a way that a non-statistician can understand.
Test the null hypothesis that there are no differences in math scaled scores across class types in the 1st grade. Justify your choice of test.
Explain your test result in a way that a non-statistician can understand.
Conduct model diagnostic and/or sensitivity analysis.
About the recuiter
Member since Mar 14, 2020
Hotel Citypride
from Giurgiu, Romania

Skills & Expertise Required

Data Science & Analytics Data Mining & Management 

Candidate shortlisted and hiredHiring open till - Apr 27, 2024

Work from Anywhere

40 hrs / week

Fixed Type

Remote Job

$19.19

Cost

Looking for help? Checkout our video tutorial
How to search and apply for jobs

How to apply? Do you have more questions about the Job?
See frequently asked questions

Similar Projects

Need help creating carbon reduction and sustainability calculators

Need a way to calculate the impact of specific individual actions on energy and carbon use, including choice of food, transportation, and home energy choices. Calculator needs to be updateable as new information becomes available on existing choices...read more

Data Analysist (Tableau | Power BI | Salesforce | SQL)

Were looking for a data analyst who can help us with various projects. The apps we primarily use are Tableau, Salesforce, Power BI, and Periscope. It would be helpful if you also had a good amount of experience writing SQL and SOQL. We would also...read more

Google Analytics audit + strategy

We are looking for help with an audit of our current Google Tag Manager and Google Analytics setup to ensure that the appropriate data is being captured and stored correctly.

Additionally, we are looking for help within GA to setup appropria...read more