Advanced Stats: Ramya's POV

Posts

Showing posts from September, 2022

Module 6: Normal life of Normal Distributions

September 30, 2022

Hi everyone! For this week's assignment, we are looking into normal distributions which is the most common type of distribution in the world of statistics. As a continuous distribution it never fails to be anything short of a bell curve with standard normal distributions having a mean of 0 and a standard deviation of 1. A. Consider a population consisting of the following values, which represents the number of ice cream purchases during the academic year for each of the five housemates. 8, 14, 16, 10, 11 a. Compute the mean of this population. b. Select a random sample of size 2 out of the five members. Sample is 16 and 8 c. Compute the mean and standard deviation of your sample. d. Compare the Mean and Standard deviation of your sample to the entire population of this set (8,14, 16, 10, 11). The mean of the population is 11.8, while the sample mean is 12. The standard deviation of the population is 2.856, while the sample standard deviation is 5.656. T...

Module 5: Significance of significance testing (hypothesis to correlations)

September 25, 2022

Hi everyone! This week we have learned about hypothesis testing, correlation, and confidence intervals. The director of manufacturing at a cookies needs to determine whether a new machine is production a particular type of cookies according to the manufacturer's specifications, which indicate that cookies should have a mean of 70 and standard deviation of 3.5 pounds. A sample pf 49 of cookies reveals a sample mean breaking strength of 69.1 pounds. A. State the null and alternative hypothesis H₀ : μ = 70 pounds Hₐ : μ ≠ 70 pounds B. Is there evidence that the machine is not meeting the manufacturer's specifications for average strength? Use a 0.05 level of significance No, we do not have evidence that the machine is not meeting the manufacturer's specifications for average strength because the z test statistic (-1.8) is greater than the critical value (1.96) on the lower tail in the rejection zone from -1.96 to - infinity. C. Com...

Module 4: Probably applying Probability Rules

September 18, 2022

Hi everyone! For this week's assignment, we are applying the probability rules we have learned like the addition, multiplication, and conditional rules. An important distinction in choosing the right rule is if the events you analyze are mutually exclusive or not and independent or not. Mutually exclusive means these events cannot happen at the same time like riding a bike to work versus who walks to work on a given day. Independent events means an outcome of one event does not affect the outcome of the other. We apply the addition rule with mutual exclusivity or "or" problems and the multiplication rules with the independence or "and" problems. If mutually exclusive P(A or B) is P(A) + P(B), but if not then P(A or B) = P(A) + P(B) - P(A and B). If independent P(A and B) = P(A) * P(B), but if not then P(A and B) = P(A) * P(B|A). Let's do some stats! 1. A. Based on Table 1 What is the probability of: ...

Module 3: Describing descriptive statistics (Set 1 vs Set 2)

September 10, 2022

Hi everyone! This week we are applying our knowledge of descriptive statistics into R functions. Two sets of data have been given which can easily be made into vectors in R. The first set of data is called "set1" and the second set of data is called "set2." Descriptive statistics describes a data set through central tendency measures (mean, median, mode) and variance measures (range, quartiles, standard deviation, variance). Instead of doing this by hand, R has functions to make this process easier! I have created two lists, which are basically cooler vectors that can have more than one data type, to showcase all the measures for each data set. I have also utilized the names function to make it easier to make comparisons of each measure from the sets. Here is the code ....(drumroll) The Output! Set 1 ...

Module 2: The meaning of myMean()

September 02, 2022

Hi everyone! For this week's assignment, we were asked to evaluate a given function " myMean ." So far I have learned the R programming language makes use of variables, functions, and comments. We can assign variables with lists of data to be stored for easier use in other functions. In this example, assignment2 is our variable that holds 9 data values. The myMean function returns the mean of the assignment2 data. This is done using the sum function by adding all the values and dividing it by the length function, or adding the number of values. By just retyping myMean and running the script editor you get the output of 18.66667, seen in the R console. A neat trick I utilized is the comments in my code (what starts with #) to do the explanation of this function as well!👇 -Ramya's POV