In the last three years, I have learned that I love working with data. This came as a huge surprise to me, but I'm sure my parents could have predicted it all along. I'm not wild about entering data, but my job mainly involves writing programs to manipulate and analyze data that are already in digital form, and I don't even mind spending an hour or so doing entry a couple of times a year if necessary. Yesterday it became necessary -- we needed data from the census on the number of people working in various industries in every county in Minnesota and Iowa in 1970 and 1980. Entering data from census volumes is pretty straightforward. The only trick is making sure to read the same line all the way across the page. One of the variables I entered yesterday was the number of people in mining in each county in 1970. By that point, there weren't so many people in mining, at least not in Minnesota or Iowa. Most counties had numbers in the one or two digits. So when I got to one that was in the thousands, I double checked to make sure I hadn't jumped lines. I hadn't, and I don't think it was a typo either -- it was St. Louis County, Minnesota, which is where my father's parents grew up, and his paternal grandfather had indeed been a miner. I guess they are still mining there. When I'm entering this kind of data, I often think about the names of the counties, and am always surprised to realize how many have Indian names. I also am amazed at how much the format of the census tables influences my perception of these places. The census lists counties in alphabetical order, but that order comes to seem natural, such that when two counties listed next to each other have the same value of a certain variable, I have to remember that this is purely coincidental, as two alphabetically-adjacent counties may be geographically nowhere near each other. Moreover, the same value (say, number of people in mining) might have totally different meanings depending on the context (total labor force). Thinking about all this while I'm entering data makes the project a lot more interesting, but probably explains why my head starts to spin after about an hour of it!
Recent Comments