Panama-8813

DIY Big Year: A Geeky Look At Data

Meet Our Team

NEWS & UPDATES

Stay up-to-date with new tours, special offers and exciting news. We'll also share some hints and tips for travel, photography and birding. We will NEVER share nor sell your information!

  • Please help us send the information for trip styles in which you are most interested.
  • This field is for validation purposes and should be left unchanged.

Oct 2, 2017 | by Greg Miller
Scenic Bixby Creek Bridge in Big Sur California

Scenic Bixby Creek Bridge in Big Sur California – photo by Greg Miller

If you know me then you know I like numbers. A lot. Actually. I love data. It can be so powerful. But it can also be misleading and confusing.

In an early DIY Big Year post I told you about some eBird data that I have been wrangling with for over a year now. In this post I want to give you a peak at some different ways to look at the data I have collected.

Let’s have some fun with Top 10 lists. I have data rolled up for 299 Counties in the United States. The Counties are found all over. All 50 States are represented. Checklist data is from eBird (http://ebird.org) from 2006-2016 as of September 2016. Area and population data come from census.gov.

10 Largest Counties
RankCountyArea (sq mi)
1San Bernardino County, CA20,057
2Coconino County, AZ18,619
3Nye County, NV18,182
4Kenai Peninsula County, AK16,075
5Mohave County, AZ13,311
6Inyo County, CA10,181
7Maricopa County, AZ9,200
8Pima County, AZ9,187
9Kern County, CA8,132
10Yavapai County, AZ8,124
10 Smallest Counties
RankCountyArea (sq mi)
299New York County, NY23
298San Francisco County, CA47
297Suffolk County, MA58
296Richmond County, NY58
295Kings County, NY71
294Newport County, RI102
293Queens County, NY109
292Los Alamos County, NM109
291Clarke County, GA119
290Philadelphia County, PA134
10 Counties with Largest Population
RankCounty2016 Population Estimate
1Los Angeles County, CA10,137,915
2Cook County, IL5,203,499
3Harris County, TX4,589,928
4Maricopa County, AZ4,242,997
5San Diego County, CA3,317,749
6Orange County, CA3,172,532
7Miami-Dade County, FL2,712,945
8Kings County, NY2,629,150
9Dallas County, TX2,574,984
10Riverside County, CA2,387,741
10 Counties with Smallest Population
RankCounty2016 Population Estimate
299Cameron Parish, LA6,882
298Custer County, SD8,596
297Brewster County, TX9,200
296Mono County, CA13,981
295San Juan County, WA16,339
294Socorro County, NM17,027
293Mariposa County, CA17,410
292Inyo County, CA18,144
291Los Alamos County, NM18,147
290Teton County, WY23,191
10 Most Densely Populated Counties
RankCountyPopulation per sq mi
1New York County, NY71,999
2Kings County, NY37,124
3Queens County, NY21,497
4San Francisco County, CA18,581
5Suffolk County, MA13,486
6Philadelphia County, PA11,692
7Richmond County, NY8,155
8Cook County, IL5,504
9Nassau County, NY4,782
10Bergen County, NJ4,031
10 Least Densely Populated Counties
RankCountyPopulation per sq mi
299Brewster County, TX1.5
298Inyo County, CA1.8
297Nye County, NV2.4
296Socorro County, NM2.6
295Kenai Peninsula County, AK3.6
294Mono County, CA4.6
293Cameron Parish, LA5.4
292Custer County, SD5.5
291Teton County, WY5.8
290Coconino County, AZ7.6
10 Counties with Highest Number of Checklists
RankCountyTotal Checklists
1Los Angeles County, CA124,721
2Cook County, IL110,781
3Pima County, AZ104,968
4Tompkins County, NY89,995
5San Diego County, CA87,942
6Middlesex County, MA75,238
7King County, WA73,768
8Essex County, MA72,725
9Harris County, TX69,955
10St. Louis County, MN69,352
10 Counties with Lowest Number of Checklists
RankCountyTotal Checklists
299Custer County, SD2,034
298Hancock County, MS2,097
297Ward County, ND2,238
296Pulaski County, KY3,218
295Cass County, ND3,324
294Harrison County, MS3,527
293Dodge County, NE4,349
292Nye County, NV4,450
291Benton County, AR4,618
290Washington County, AR4,621
10 Counties with Highest Number of Species
RankCountyTotal Species
1Los Angeles County, CA494
2San Diego County, CA488
3Santa Barbara County, CA448
4Cochise County, AZ440
5San Francisco County, CA439
5Ventura County, CA439
7Cameron County, TX434
8Pima County, AZ431
9Orange County, CA429
10Humboldt County, CA428
10 Counties with Lowest Number of Species
RankCountyTotal Species
299Kauai County, HI141
298Hawaii County, HI155
297Honolulu County, HI171
296Spartanburg County, SC204
295Kanawha County, WV222
294Fulton County, GA231
293Chemung County, NY236
292Anchorage County, AK236
291Greenville County, SC237
290Herkimer County, NY241
10 Counties with Highest Number of Checklists per capita
RankCountyChecklists per capita
1Brewster County, TX1.24
2Cameron Parish, LA1.09
3Los Alamos County, NM1.03
4Mariposa County, CA0.98
5Addison County, VT0.91
6San Juan County, WA0.89
7Tompkins County, NY0.86
8Santa Cruz County, AZ0.85
9Mono County, CA0.76
10Inyo County, CA0.75
10 Counties with Lowest Number of Checklists per capita
RankCountyChecklists per capita
299Clark County, NV0.00602
298Dallas County, TX0.00734
297Shelby County, TN0.00743
296Wayne County, MI0.00849
295Queens County, NY0.00867
294Honolulu County, HI0.00958
293Broward County, FL0.00963
292Tarrant County, TX0.00980
291Tulsa County, OK0.01016
290Providence County, RI0.01097

 

There you have it—a preliminary look at the data I am using. The cross section of data is pretty diverse. It has highly populated areas as well as those that are sparsely populated. Some are large in area. Some are small.

But all of these Counties have one thing in common—they are the most checklists submitted in the United States (or for some States, the most checklists in the State. See my criteria in my previous posts).

Do the most populated Counties have the most checklists? Not always. Do the Counties with the most checklists have the highest number of species? Not always. Is there a significant correlation between population and total checklists submitted? Nope. How about between checklists submitted and number of species? No again.

So how does this all figure into planning a Big Year? I’m glad you asked. Because I am going to tell you anyway. We live in an age with dizzying amounts of data. Having data can be powerful. But only if you are able to harness the information to help you in a way that makes sense.

What am I talking about? Our goal for a Big Year is to see as many unique species in one calendar year as possible. And remember I hope to make this affordable and efficient, too. A bigger population means are larger number of checklists submitted, but only up to a point. And a larger number of checklists means a larger number of species, but again, only up to a point.

All of the data above is interesting. But it does not yet answer the questions about where and when to go birding for the greatest number of unique species in the shortest amount of time.

The total number of species listed above is for the whole period from 2006-2016. It encompasses all seasons. So that number really is not a great indicator of where to go and a poor indicator of when to go.

There are 299 Counties. And each County has 4 weeks of data per month. A month is always 4 weeks in eBird. The first week is the 1st through the 7th. The second week is the 8th through the 14th. The third week is the 15th through the 21st. And the fourth (and last) week is the 22nd through the end of the month. So 299 Counties x 48 weeks of data gives one a large number of possibilities of where to go and when. In fact, that number of possibilities is 14,352. (Have I told you how much I like numbers?)

Now you may be asking, out of 14,352 possibilities where does one even begin to guess where to go and when? Oh, you thought that was complicated. Throw in 984 species into the mix. Yep. You will need more than a hand calculator. You could do it in a spreadsheet if you could do a pivot table of 7 million rows and then sort it. Good luck with that.

A database is a perfect solution. It is unparalleled in its power to perform on problems like this. It can make calculations on mind-boggling amounts of data and retrieve the information in a matter of seconds (ok, minutes for some of our questions). And this is what I did. I took all those spreadsheets of downloaded data and loaded them into a database. (I used Sql Server Express)

In my next post I will tell you how I used this data to arrive at some answers to the questions of the best places to go at the best times of year to maximize the total number of species on each trip. You won’t want to miss that one!

Leave a Comment

You must be logged in to post a comment.