Sign In
Not register? Register Now!
Pages:
1 page/β‰ˆ275 words
Sources:
1 Source
Style:
APA
Subject:
Mathematics & Economics
Type:
Statistics Project
Language:
English (U.S.)
Document:
MS Word
Date:
Total cost:
$ 5.18
Topic:

Statistics Project: Diabetes Dataset

Statistics Project Instructions:

Choose one of the links to locate a data set or create your own by finding a set of data. For this discussion, your data set must be quantitative. If you choose a data set that has two variables, you may use one variable for this discussion, and in an upcoming unit (Unit 4), you may use both variables. Some good sites to obtain data are:
Data.go
https://www(dot)data(dot)gov/
Once on this site, there is a search bar that will automatically populate categories for some data sets. You can also type in your own category for the data you are interested in.
In addition to the sites above, you can use the internet to create your own set of data. Some other ideas for data sets include:
Prices of homes for sale in your town or city
Costs of flights by one airline to different locations
Interest rates from banks
Distances to favorite vacation destinations, or distances of brightest stars
Numbers of letters in words used for a spelling contest
Enrollment rates at a college or university
Prices of cars at a dealership
There is no end to the list of possible data sets for this activity, so get creative! Whichever dataset you use, be sure to include a description or reference of where you found the data.
Be sure your data set includes more than 25 data items, but less than 40 data items. If you locate a data set that has more than 40 items, you may choose to use any number of items between 25 and 40. If your chosen data set includes 25 or fewer data items, you may make up enough or repeat some of the items to have more than 25 data items. Enter the data into this spreadsheet (also located in Course Resources). Post your data into column B to show the calculations for the mean, median, mode, and standard deviation.
For your post, name the population which this data set represents. Which measure of center (mean, median, or mode) best represents your data? Explain why you think so. Remember to attach your Excel spreadsheet to your discussion post.

Statistics Project Sample Content Preview:

Statistics Project
Name
Institution
Course
Professor
Date
The dataset I chose is the Diabetes Dataset that I retrieved from Kaggle.com. It contains data of at least 21 years old female patients of Pima Indian heritage. I will analyze a sample of 40 patients' Diastolic blood pressure (mm Hg).
My dataset represents a population of all 21 years old and above female patients of Pima Indian heritage. A population is an entire group of objects, things, or individuals from which a representative sample is selected for a particular study (Bhandari, 2021). It represents a distinct group of observations, individuals, or objects with a common characteristic.
Different statistical measures can be used to illustrate data numerically. They are often categorized into three groups: Measures of center, position, and dispersion. A measure of center is used to describe a dataset with a single value representing the middle or center of its distribution. The most popularly used measures of the center include median, mode, and mean.
Median
This measure of center returns the middle value of a set of values sorted in an ascending or descending order. Fifty percent of the observations or values are positioned on either side of the median since it divides the distribution in half (Beyer, 2021). This measure is not affected by extreme values; hence it is suitable for datasets with outliers or skewed distributions....
Updated on
Get the Whole Paper!
Not exactly what you need?
Do you need a custom essay? Order right now:

πŸ‘€ Other Visitors are Viewing These APA Statistics Project Samples:

HIRE A WRITER FROM $11.95 / PAGE
ORDER WITH 15% DISCOUNT!