[FRIAM] statistics question

Russell Standish lists at hpcoders.com.au
Thu Mar 13 16:55:52 EDT 2025


A simple check you could do is check for Benford's law: https://en.wikipedia.org/wiki/Benford%27s_law

On Thu, Mar 13, 2025 at 08:33:09PM +0000, Marcus Daniels wrote:
> You want a Wilconxon test.  Here is an example for R.
> 
> # Load necessary libraries
> 
> library(ggplot2)  # For plotting
> 
> library(stats)   # For statistical tests (base R)
> 
>  
> 
> # Example data (replace with your actual data)
> 
> # admitted_ids <- c(1, 3, 5, 7, 10, 12)  # Example admitted student IDs
> 
> # waitlisted_ids <- c(15, 18, 20, 22, 25, 30)  # Example waitlisted student IDs
> 
>  
> 
> # Combine data into a data frame for plotting
> 
> data <- data.frame(
> 
>   ID = c(admitted_ids, waitlisted_ids),
> 
>   Status = factor(c(rep("Admitted", length(admitted_ids)),
> 
>                     rep("Waitlisted", length(waitlisted_ids))))
> 
> )
> 
>  
> 
> # 1. Plot histograms with density
> 
> ggplot(data, aes(x = ID, fill = Status)) +
> 
>   geom_histogram(aes(y = ..density..), alpha = 0.5, position = "identity") +
> 
>   labs(title = "Distribution of Student IDs", x = "Student ID", y = "Density")
> +
> 
>   theme_minimal()
> 
>  
> 
> # 2. Box plot to compare medians and spread
> 
> ggplot(data, aes(x = Status, y = ID, fill = Status)) +
> 
>   geom_boxplot() +
> 
>   labs(title = "Box Plot of Student IDs by Status", y = "Student ID") +
> 
>   theme_minimal()
> 
>  
> 
> # 3. Mann-Whitney U Test (non-parametric, good for skewed data)
> 
> mw_test <- wilcox.test(admitted_ids, waitlisted_ids, alternative = "two.sided")
> 
> print("Mann-Whitney U Test:")
> 
> print(mw_test)
> 
>  
> 
> # 4. Kolmogorov-Smirnov Test (compare distributions)
> 
> ks_test <- ks.test(admitted_ids, waitlisted_ids)
> 
> print("Kolmogorov-Smirnov Test:")
> 
> print(ks_test)
> 
>  
> 
> # 5. Compare means
> 
> mean_admitted <- mean(admitted_ids)
> 
> mean_waitlisted <- mean(waitlisted_ids)
> 
> cat("Mean ID (Admitted):", mean_admitted, "\n")
> 
> cat("Mean ID (Waitlisted):", mean_waitlisted, "\n")
> 
>  
> 
> # 6. Optional: T-Test (if data is roughly normal)
> 
> t_test <- t.test(admitted_ids, waitlisted_ids)
> 
> print("Two-Sample T-Test:")
> 
> print(t_test)
> 
>  
> 
> # 7. Quantify skewness (requires 'moments' package)
> 
> # Install if needed: install.packages("moments")
> 
> library(moments)
> 
> skew_admitted <- skewness(admitted_ids)
> 
> skew_waitlisted <- skewness(waitlisted_ids)
> 
> cat("Skewness (Admitted):", skew_admitted, "\n")
> 
> cat("Skewness (Waitlisted):", skew_waitlisted, "\n")
> 
>  
> 
> # 8. Logistic regression (modeling probability of admission)
> 
> model <- glm(Status ~ ID, data = data, family = "binomial")
> 
> summary(model)
> 
>  
> 
> From: Friam <friam-bounces at redfish.com> on behalf of cody dooderson
> <d00d3rs0n at gmail.com>
> Date: Thursday, March 13, 2025 at 1:28 PM
> To: The Friday Morning Applied Complexity Coffee Group <friam at redfish.com>
> Subject: [FRIAM] statistics question
> 
> I have a question concerning preschool admissions. The kindergarten that my
> daughter went to for preschool has a "random lottery" for admissions. They
> published a list of all of the student ids of the students who got in and the
> ones who did not and were put on a waitlist. 
> 
> She did not get in, so I decided that it was unfair and plotted the data. What
> statistical tricks should I use to figure out if the lottery was random or not?
> 
> I have attached a plot of the data in question. To me, the plot looks slightly
> skewed towards the low numbers. The lower numbers are kids that signed up for
> the lottery earlier and I hypothesise have  favorable connections in the
> school.
> 
>  
> 
> [cid]
> 
> 
> _ Cody Smith _
> 
> d00d3rs0n at gmail.com
> 





> .- .-.. .-.. / ..-. --- --- - . .-. ... / .- .-. . / .-- .-. --- -. --. / ... --- -- . / .- .-. . / ..- ... . ..-. ..- .-..
> FRIAM Applied Complexity Group listserv
> Fridays 9a-12p Friday St. Johns Cafe   /   Thursdays 9a-12p Zoom https://bit.ly/virtualfriam
> to (un)subscribe http://redfish.com/mailman/listinfo/friam_redfish.com
> FRIAM-COMIC http://friam-comic.blogspot.com/
> archives:  5/2017 thru present https://redfish.com/pipermail/friam_redfish.com/
>   1/2003 thru 6/2021  http://friam.383.s1.nabble.com/


-- 

----------------------------------------------------------------------------
Dr Russell Standish                    Phone 0425 253119 (mobile)
Principal, High Performance Coders     hpcoder at hpcoders.com.au
                      http://www.hpcoders.com.au
----------------------------------------------------------------------------



More information about the Friam mailing list