Did US taxpayers get a good deal? Census 1940 site was built for free

National Archives

The website at 1940census.archives.gov is operated by a private company, for free. In exchange, it can use the free public records on its for-profit site as well. Other companies paid $200,000 for the records.

Who says there's no free lunch?

You may have read over the past week about the release of 1940 Census records on a new U.S. government website, a site that buckled under the huge demand from people looking up details on the lives of their friends and relatives from the Great Depression.

You may not have realized that the site was built for U.S. taxpayers for the price of — not one dime. A company from Silicon Valley built the site, and is operating it, for free. Genealogy buffs have been using the site for a week now to check millions of records. (See our earlier story for tips on searching the 1940 Census, and examples of people who have found relatives.)

Of course, the company, Inflection LLC of Redwood City, Calif., did get something in return for its effort: a free copy of those 3.8 million images of records from the 1940 Census. While other companies paid $200,000 for a set of the public records, Inflection can use those records in its for-profit business, a genealogy site called Archives.com.

It's a barter system for federal records: the public gets a free official U.S. website, and the company gets free data. It's been done before, as when the U.S. Patent and Trademark Office gave data to Google, which since 2006 has hosted the site for free as Google Patents.


Do you approve of the approach that the National Archives took, giving the data away in exchange for the free website? And what stories have you found in the 1940 Census? Add your story in the comments below or on our Open Channel page on Facebook.

Inflection also was hoping to get a boost to its reputation for building websites that could withstand a storm of traffic.

Performance standards in the contract
Both the company and the National Archives and Records Administration (NARA) had anticipated that the site would draw a crowd, as 72-year privacy restrictions expired and the records became available. What happened next lends credence to the boast that genealogy is the country's favorite hobby.

The contract says, "Drawing from NARA's experience in releasing the 1930 Census, and the experience of the National Archives of the United Kingdom when they released their 1901 and 1911 Censuses, NARA anticipates immense interest in the 1940 Census and a tremendous increase in traffic to its www.archives.gov web site." (Here's the contract in a PDF file.)

But how much of a crowd?

Here are the performance standards in the contract:

  • "When browsing from one image to another, each image should be presented to the user in 3 seconds or less."
  • "When moving from the standard rendered image to each zoom level (e.g. zoom 1x, 2x, 3x), the reformatted image should be rendered in 2 seconds or less."
  • "Support up to 10 million hits per day while providing response times of less than three seconds for keyword searches of the descriptive metadata."
  • "Support up to 25,000 concurrent users."

There was one more element in the contract, a somewhat vague requirement that Inflection increase service if demand was greater than anticipated.

  • "Scale on demand in the event that 10 million hits and/or 25,000 concurrent users are exceeded to ensure that the performance requirements ... are still achieved."

The crowd certainly exceeded those levels, as the most old-fashioned sounding search term possible, "1940 Census," became a top "trending topic" on Google and Twitter.

Most people seemed to get little or nothing from the site on the first day, including Census leaders, who were prepared to show off how easy it was to look up their grandparents. When the site stuck on "loading image," as it did for many other users, the officials resorted to showing a PowerPoint presentation with the results from an earlier search.

A 'tsunami'
As Inflection's general manager, Joe Godfrey, told us last week, "We were expecting a flood, but we got a tsunami."

  • On Day One, Monday, an estimated 100 million hits, or requests, with 22.5 million hits in just the first three hours. Though Inflection scrambled to improve service, the site was unusable for many users on the first day. The company added more servers through Amazon Simple Storage Service, its cloud data service provider, and also restricted some features on the site (such as zooming of images), until finally it was able to get on top of the traffic.
  • On Day Two, Tuesday, the numbers haven't been totaled, but it's believed to be higher than on Day One, with an estimated 40.1 million hits in the three-hour peak.
  • By Friday, the site was stable with about 60 million hits per day, and had served up more than 80 million images, or about 61 terabytes of data, the National Archives said. (That's more than the data contained in the first 20 years of astronomical observations by the Hubble Space Telescope.) The service quality was better than called for in the contract, with a load time of about 1.8 seconds per page, according to the Archives.

In other words, this might have been a good project for a "soft launch."

The contract called for extensive load testing before the release. We asked the National Archives for copies of those test results, but its spokeswoman said it wouldn't be able to provide them. But it said the site was tested to handle more than 70,000 simultaneous users — more than the contract called for, and fewer than the level that resulted.

A 'no-cost contract'
No-cost contracts are allowed under Federal Acquisition Regulation competitive procedures. This contract has a one-year base period and options to extend for four more one-year periods.

"NARA provided a copy of the data to Inflection at no cost, copies that were sold to others for $200K," said spokeswoman Laura Diachenko of the National Archives. "Why Inflection agreed to this is a better question for them, but we are very happy to have them as a partner. They have experience with Census data, and managing access to large data sets, the capabilities we were seeking for this project."

She added, "Even though this is called a no-cost contract, the Government did incur costs — in this case, aside from our resources, we also provided a copy of the 1940 Census to Inflection, at no cost.  In this particular case, we provided them data that they wanted in exchange for hosting access to this data.  Their interest was in getting the data (for their archives.com business), and for business development (attracting users to their site and eventually converting them to a subscriber."

Inflection's Godfrey said, "The primary value for us was in building our brand/notoriety, leveraging and expanding our technical expertise/infrastructure and helping to getting this extremely valuable record collection into the hands of as many people as possible.  Also, our engineering team (like all great engineers) are motivated by tackling challenging technical problems, and so the team was very excited to work on this."

Competition
All or most of the 1940 Census is now available free from several other companies, which had to pay for the public records. As a sort of loss leader, other genealogy sites, even the commercial ones, are making the 1940 Census records available for free, to subscribers and non-subscribers alike.

Here's how the race worked: All the commercial sites that chose to buy the data for $200,000 were handed a rack of hard drives full of 20 terabytes of images, taken from 4,745 rolls of microfilm, at 12:01 a.m. on April 2, or 72 years and a day after the Census Day in 1940.

By Thursday, a relatively new genealogy site called myHeritage, was the first to have all the images online. Also making images available for free are Ancestry.com, a commercial site, and FamilySearch.org, owned by the Church of Jesus Christ of Latter-day Saints.

Thousands of volunteers are working on the next step: indexing the records by name, just as previous Census releases have been indexed by volunteers. Until those indexes are finished, searching is done only by address or neighborhood.

Your view
Do you approve of the approach that the National Archives took, giving the data away in exchange for the free website? And what stories have you found in the 1940 Census? Add your story in the comments below or on our Open Channel page on Facebook. See our earlier story for tips on searching the 1940 Census.

Discuss this post

Whether we like it or not is irrelevant. Everyone is going to be doing commerce like this soon as the value of the dollar turns to dust.

  • 2 votes
Reply#1 - Mon Apr 9, 2012 8:15 AM EDT

The bad news is the web site crashed. The good news is the Mormon Church added 100 million new members to the list of those who have been "saved".

  • 1 vote
#1.1 - Mon Apr 9, 2012 8:28 AM EDT
Reply

Right now it only works if you know where your ancestor lived. It takes a long time searching every page in their city or township for them.

    Reply#2 - Mon Apr 9, 2012 8:30 AM EDT

    Well, that is true Teacher. You need to know basic info to do the search. A cousin of mine found the 1940 census for our grandparents as well as our own parents who were all kids at the time. Those on that census are all gone now, but it's great to touch that part of their lives. To bring them back to us.

    • 2 votes
    #2.1 - Mon Apr 9, 2012 8:42 AM EDT

    If your family didn't move and you can find them in the 1930 census it is pretty easy to find them in the 1940 census. I was able to find my grandparents and my great-grandparents. Much to my surprise, my grandmother had a job! I had no idea that she had ever gone to work outside the home.

      #2.2 - Mon Apr 9, 2012 2:12 PM EDT
      Reply

      No taxpayer dollars used and there's still gonna be whining on here

      • 9 votes
      Reply#3 - Mon Apr 9, 2012 8:33 AM EDT

      I wonder how soon politics will get into this. Democrats will say it is a Republican ploy to keep an entire generation from voting and Republicans will say it is taking away our right to privacy -- or some other political bs. I guess I just started it.

      • 1 vote
      Reply#4 - Mon Apr 9, 2012 8:40 AM EDT

      If I could give you a thumbs down I would.

        #4.1 - Mon Apr 9, 2012 1:43 PM EDT
        Reply

        As someone who works in the Archives field, I think what NARA did was great. During a time when archival and library budgets are being slashed across the country, they were still able to perform a major part of their job: Access to the records. It might not have worked right away, but the records are up there and able to be seen. I think the site crashing is great. Maybe it will show the people that are cutting the budgets for these institutions that there is a lot of value there, and the people do want to see items that archivists and librarians are working hard to preserve.

        That being said, after a long week of searching and asking other relatives for information, I have found some of my relatives. I find it fascinating that when my grandfather was young his whole household made less than $4000 a year with 5 of them working. The hunt for the information is the best part of researching, and this has definitely been an adventure.

        • 8 votes
        Reply#5 - Mon Apr 9, 2012 9:17 AM EDT

        As long as you know the town, street and possible side street..you be able to find who you seek.

        Found my mom,she was 15 years old then living with her uncle and aunt with four of her younger siblings..rent was $25 per month and the only employed person in household was her uncle at $600 per year income..very hard times she had back then.

        • 3 votes
        Reply#6 - Mon Apr 9, 2012 9:20 AM EDT

        What NARA did was give the data away in return for Inflection creating and running a website for years! This would have been a very high cost to NARA over time - and as someone else noted, subject to reduction with the budget issues.

        • 1 vote
        Reply#7 - Mon Apr 9, 2012 10:12 AM EDT

        I found my grandparents with their four children including my dad, age 5, in less than 10 minutes. It helped that they lived in a small town in northern Wyoming and there are only 14 pages in the entire enumeration district.

        My grandfather made $720 in 1939 working as a roustabout in the local oil fields. I worked in an oil refinery one summer [1981] with my grandfather, who was then a pipefitter. He encouraged me to go to college. Today I make more than 250 times what he made in 1939, working as a Geophysicist for an oil company. Grandpa was right about college.

        • 1 vote
        Reply#8 - Mon Apr 9, 2012 10:20 AM EDT

        I found the site to be slow especially when displaying the search results. Not only that but when I clicked the associated map of my search area the system came back with an error telling me that the page I requested was not available! What the heck is that about?

        That said I still think having these records available for people is the way to go since paying for "public" documents seems to be wrong.

          Reply#9 - Mon Apr 9, 2012 11:04 AM EDT

          The reason you pay for the same documents on places like ancestry.com is that they index the names to addresses making it much easier to find the person you are looking for.

          • 1 vote
          #9.1 - Mon Apr 9, 2012 11:21 AM EDT
          Reply

          Given that I knew the address where my parents lived in 1940, I found them and my grandparents but given that I was born 15 months after the census, I will have to wait another 10 years to see my name in the 1950 census. All are gone save for one aunt.

            Reply#10 - Mon Apr 9, 2012 11:20 AM EDT

            Considering the census was paid for by the citizens of this country it should be accessible for free directly from the government. So no, I don't think "free" in this case is such a great deal, the price is what it should be.

            • 2 votes
            Reply#11 - Mon Apr 9, 2012 12:02 PM EDT

            Yes, good deal.

              Reply#12 - Mon Apr 9, 2012 2:23 PM EDT

              I'll echo what others have said: of course the information that we paid for as taxpayers should be free to all.

              At one point in time, this applied to research as well, but big business has found ways to circumvent that, by paying for the research after it's been done. So no risk to the business, but a huge loss for the public domain.

                Reply#13 - Mon Apr 9, 2012 3:09 PM EDT

                I found it interesting looking through at the old road names that used to exist where I grew up in rural Indiana. While I grew up at the corner of CR1125 and CR850 I found that New Linden Road and Quigg's Road and John's Road were all places I know, but by utilitarian 911-friendly county denominators of the 21st Century.

                  Reply#14 - Mon Apr 9, 2012 3:17 PM EDT

                  Those of you saying the information should be "free" to us as taxpayers, well, it is. In microfilm form at the National Archives. That and all the OTHER free "public" information we have available to us. The government doesn't have to make that information easily available, for free,on the internet. :) It is very nice of them to find a way to do that, though. Very nice project!

                  • 2 votes
                  Reply#16 - Mon Apr 9, 2012 3:44 PM EDT

                  My search was unsuccessful. Todate, the site has a map available and 0 distribution and 0 schedules. However, I managed to find 68 census pages, covering only a portion of the village. I sent a "contact" message but have heard nothing back. It appears there is still a lot of work that needs to be done by this "non-profit."

                    Reply#17 - Mon Apr 9, 2012 10:29 PM EDT

                    Don't forget, the IRS still expects you to pay tax on the value of your barter

                      Reply#18 - Tue Apr 10, 2012 9:21 AM EDT
                      You're in Easy Mode. If you prefer, you can use XHTML Mode instead.
                      As a new user, you may notice a few temporary content restrictions. Click here for more info.