Skip to main content

Using an Ecologists' Measure of Diversity in Higher Education

Diversity is a topic a lot of us in higher education think about and write about and work towards, and yet, we don't really have a common definition of what it means. At its most basic level, we simply talk about the percentage of our students who are non-white. And, of course, if you compare colleges today to those in the 1950's, this makes perfect sense, and allows us to give ourselves all a pat on the back.

But the success of Asian students over the past few decades has complicated this: While they are not white, their large numbers at the nation's most selective institutions, and performance on college admissions examinations, makes us occasionally shift the discussion to under-represented students of color, which today might include Native American and Alaska Natives, Latino or Hispanic students, African-American students, Asians who are Hawaiian or Pacific Islander, and students of two or more races or ethnicity. This of course causes us to wonder whether a student of mixed Asian/Caucasian ethnicity should count, and to remember that technically, Hispanic is not a race. It's all very confusing.

On top of that, there are institutions who serve large numbers of under-represented students (HBCUs, for instance) that are not very diverse in the clinical sense: Almost everyone enrolled in those institutions are African-American. How do we think about decribing diversity that makes sense to everyone?

One way to do it is to use a measure called Simpson's Diversity Index. You can read about it here if you'd like, but it essentially says that once you come up with a category and count the population, you can calculate the likelihood that choosing any two members at random presents a mismatch of type. For instance, at a college in Puerto Rico, if you randomly select two students, the chances they are of different ethnicities is probably very small: You'll usually get two Hispanic students. Go to Howard University, and odds are you'll select two African-American students on your trials. This translates into a lower Simpson's number. If you have a university that is truly more diverse in the ecological sense, you'll see that number go up.  All the numbers in the index are between zero and one.

Of course, it's short-sighted to measure diversity just on race or ethnicity, but it's the thing we have the best data on. We can add other elements into the mix, but since the data are pre-aggregated, we cannot break the groups into subgroups (for instance, wealthy White students vs. poor White students.) This would yield better insight.

Look below. The first view shows all four-year, public and private not-for-profit colleges and universities in the US, and their Simpson's Diversity Index as calculated from total undergradute enrollment in 2013 Fall. On the first view, the bars are colored by freshman admissions rate, with an interesting theory suggesting that if your admit rate is low, you could be more diverse if you really wanted to be. In the tool tip that pops up when you hover over a bar (like in the screenshot right below), you'll see the breakdown of enrollment by ethnicity.


And if you hover over several bars in the same range, you'll see you can get to similar numbers in very different ways. So, even among diverse institutions, there are very different student body mixes in play.

On the second tab, you'll see some element of economic diversity added in: Pell Grant eligibility as a color. The chart is a scatter of Simpson's and Admission rates.

One note: I calculated the index two ways, using as the base number only those with known ethnicity, and then those whose ethnicity was not listed.  I think the first number is probably a better tool, but I did include it the second in the tool if you're interested.

Do you see anything interesting here? I'd love to hear it.



Comments

Popular posts from this blog

Freshman Migration, 1986 to 2020

(Note: I discovered that in IPEDS, Penn State Main Campus now reports with "The Pennsylvania State University" as one system.  So when you'd look at things over time, Penn State would have data until 2018, and then The Penn....etc would show up in 2020.  I found out Penn State main campus still reports its own data on the website, so I went there, and edited the IPEDS data by hand.  So if you noticed that error, it should be corrected now, but I'm not sure what I'll do in years going forward.) Freshman migration to and from the states is always a favorite visualization of mine, both because I find it a compelling and interesting topic, and because I had a few breakthroughs with calculated variables the first time I tried to do it. If you're a loyal reader, you know what this shows: The number of freshman and their movement between the states.  And if you're a loyal viewer and you use this for your work in your business, please consider supporting the costs

The Highly Rejective Colleges

If you're not following Akil Bello on Twitter, you should be.  His timeline is filled with great insights about standardized testing, and he takes great effort to point out racism (both subtle and not-so-subtle) in higher education, all while throwing in references to the Knicks and his daughter Enid, making the experience interesting, compelling, and sometimes, fun. Recently, he created the term " highly rejective colleges " as a more apt description for what are otherwise called "highly selective colleges."  As I've said before, a college that admits 15% of applicants really has a rejections office, not an admissions office.  The term appears to have taken off on Twitter, and I hope it will stick. So I took a look at the highly rejectives (really, that's all I'm going to call them from now on) and found some interesting patterns in the data. Take a look:  The 1,132 four-year, private colleges and universities with admissions data in IPEDS are incl

Changes in AP Scores, 2022 to 2024

Used to be, with a little work, you could download very detailed data on AP results from the College Board website: For every state, and for every course, you could see performance by ethnicity.  And, if you wanted to dig really deep, you could break out details by private and public schools, and by grade level.  I used to publish the data every couple of years. Those days are gone.  The transparency The College Board touts as a value seems to have its limits, and I understand this to some extent: Racists loved to twist the data using single-factor analysis, and that's not good for a company who is trying to make business inroads with under-represented communities as they cloak their pursuit of revenue as an altruistic push toward access. They still publish data, but as I wrote about in my last post , it's far less detailed; what's more, what is easily accessible is fairly sterile, and what's more detailed seems to be structured in a way that suggests the company doesn&