Tag: GSS Monthly

  • To Twitter or Not to Twitter?

    Ok, I did it. I started to “Twitter”.

    Actually, I created a Twitter account a few months ago but only started actively using it a couple of days ago. Mind you, I don’t change my habits easily. I’m almost never the first person to try something new. I wait for others to play the guinea pig and see if the idea is going to catch on to the mass market before I invest my time. For example, I never created a MySpace page, and I’m glad I didn’t. I do participate in Facebook to keep up with friends and relatives, but that’s a passive thing that requires very little time. The upside is really good if you like staying current with friends/family.

    Twittering and blogging are very different than passive social networking tools like Facebook or MySpace or even a website. The fact is that Twittering and blogging require regular attention in order for them to have value, in my opinion. Twitter more so than blogs.

    What is Twitter?

    Twitter is what I call one-line zingers that people send out. It’s sort of like text messaging in its brevity. Each Twitter message is limited to 140 characters. Twitter messages can be news events (Accenture dumped Tiger), comments on personal status (I’m getting a lobotomy), and so on. People who are interested in what you are doing can decide to follow your Twitter, which means that their Twitter messages will be sent to your Twitter account. Those who are really, really interested in what you are doing can also receive Twitter messages on their phone via text message.

    You can send Twitter messages from your computer or from your phone via text messaging. According to Twitter co-founder Evan Williams, up to 60,000 people (assuming they’ve decided to follow you) receive your Twitter message in a matter of seconds after you send it. Click here for a short presentation on Twitter at the TED conference by Williams.

    In months past, I’ve asked myself what value Twitter could add to my personal life. My conclusion at that time was and still today, is not much. I’m connected with my friends and family, who live all over the continent and in other countries, via Facebook. I don’t need real-time info about what my family and friends are doing and I’m really not interested in providing real-time info to them about what I’m doing. If there’s an urgent matter, they call me or I call them.

    If I didn’t see the value of Twitter in my life, then why did I sign up for it?

    For me, I see Twitter as a potentially valuable tool in my job. However, you have to realize that I’m in the gossip business (self-deprecating definition of a journalism) so its value may be more for me than you depending on the value you place on near-real-time information. From my activity on it these past few days, I see the upside. I’m not sure how good of a Twitter provider I’ll be yet, but I’m following about 64 other Twitter-ers and find the information they are providing useful.

    I’m far from optimizing my use of Twitter. The people I’m following are providing news updates for the most part, not personal updates, which is the way I like it. If I find that people are sending personal updates, I “unfollow” them. I really don’t care to know if someone is at the dentist’s office. To decide who to follow, I searched for terms such as “GIS,” “GPS,” “mapping,” and similar, as well as for particular companies I want to follow. I spent an hour or so doing this. I’m sure I will add and subtract many more Tweeters, but this is a start and I’m already receiving a wide array of Tweets. Following is what I’ve got on my Twitter list right now, at least the first six entries:

    1. trimbleoutdoors

    An easy #cycling tour of the back roads through the West San Francisco Bay foothills http://ow.ly/LBYM #bicycling21 minutes ago from HootSuite

    1. NN4D

    “The Use of Game Engines in GIS” http://bit.ly/4EKchz35 minutes ago from API

    1. thegisforum

    Forum Post: Trimble Office Software and Windows 7: Looking for anyone who has installed either Trimble Pathfinder O… http://bit.ly/4wjcshabout 1 hour ago from twitterfeed

    1. NN4D

    “MapQuest Adds StreetView-type Imagery” http://bit.ly/8fSJeKabout 1 hour ago from API

    1. mikehogan

    ArcSDE 9.3.1 SP1 released: http://bit.ly/8PQvHf ArcIMS 9.3.1 SP1 Released: http://bit.ly/5OgK6V #ESRIabout 1 hour ago from Seesmic

    1. ESRI

    lass=”entry-content”>Want to keep track of the latest discussion on Spatial Roundtable? Check out the #RSS feed for comments: http://tr.im/HBZn #ESRIabout 1 hour ago from TweetDeck

    If you want to follow my Twitter, my Twitter name is GPSGIS_Eric. I promise to keep my Tweets relevant and frequently infrequent.

    What’s this got to do with GIS and Geospatial technology?

    If you’ve followed this column, you’ll recall I’ve mentioned social networking on one or more occasions. Geospatial data and positioning technology (eg. GPS) are a huge part of Location-Based Services (LBS) that will be a significant technology in our lives in the next five years.

    Last August, Twitter announced they were working on a Geotagging API that would provide developers the ability to geotag Tweets. As of now, the release is official and they have announced that Twitter developers such as (quote from Twitter blog):

    BirdfeedSeesmic WebFoursquareGowallaTwidroidTwittelator Pro and others are already supporting this new functionality (go try them out now!) in interesting ways that include geotagging your tweets and displaying the location from where a tweet was posted. The added information provides valuable context when reading your friends tweets and allows you to better focus in on local conversations. Now you can find out what live music is playing right now in your neighborhood or what people visiting Checkpoint Charlie are saying today about the anniversary of the Berlin Wall. These are only the beginning and we are really looking forward to seeing the creative uses emerge from the developer community.”

    I’m not sure where Twitter is going to end up yet. Maybe it will morph into something different. The entire LBS landscape is really wide open right now. A lot of people will make a lot of money off of different LBS applications. Some will stick, most will not. The challenge for me (and you) is to decide which ones to invest our time in.

    Just this morning, I received an email invite from my son to join him on…

    Google Wave

    ….deep sigh.

    Thanks and see you next week.

    P.S. I received an email from Jon Sperling, who contributed an article on TIGER data earlier this Fall, with a link to a paper he co-wrote on using Twitter to “…demonstrate how to use Twitter to automatically obtain breaking news from the tweets posted by Twitter users, and to provide a map interface for reading this news, since the geographic location of the user as well as the geographic terms comprising the tweets play an important role in clustering tweets and establishing clusters’ geographic foci.”

  • Who is Geospatial?

    It used to be relatively easy to compartmentalize what GIS was and who were the major players in the industry. It’s not so easy any longer.

    Take a look at the following graphic published in a market research abstract from Daratech, Inc., a firm from Cambridge, Massachusetts. When one thinks of traditional GIS software providers, this list covers the major ones. As you would surmise, ESRI is the top dog for pure-play GIS software sales.

    Intergraph #2? Naaaah. I don’t believe it. Last decade, Intergraph was trying to compete with ESRI in the pure-play GIS software business. But in 2000 Intergraph made a major strategic shift. It bailed out of the hardware business (thanks, in part, to litigation with Intel) and then began focusing on providing software and consulting for vertical markets. Next, after years of being a publicly traded company, Intergraph was “taken private” by Hellman & Friedman LLCTexas Pacific Group and JMI Equity. In 2010, it nearly reach $1B in revenue, the same revenue number it reached in 1990, 20 years earlier. It’s impressive considering the company significantly shifted its focus and exited the hardware business. But, the company isn’t a factor in the pure-play GIS software business any longer. It’s tough to try to compare it to ESRI, SmallworldMapInfo, and even Autodesk.

    Smallworld (GE Energy), Bentley, MapInfo (PB), and Autodesk offer pure-play GIS software. While the Daratech reports that ESRI has about 30 percent of the GIS software market, it’s a lot higher than that thanks to a grassroots distribution channel that reaches nearly every corner of the globe.

    One of the more interesting thoughts in looking at the list of GIS software vendors above is the GIS vs. CAD approach. Clearly, Autodesk, Intergraph, and Bentley design their software from the CAD perspective while ESRI, Smallworld, and MapInfo design their software from the world of GIS. With that in mind, it’s natural that when these companies think about expansion, the GIS-centric suppliers to want to invade the CAD market and for CAD-centric suppliers to want to invade the GIS market. This battle has been going on for many years with no company having the ability to become a dominant player in both CAD and GIS. This is not for lack of trying. Autodesk, who in my opinion is in the best position to invade the GIS space given its worldwide presence and heavy penetration in the AEC (architectural, engineering, construction) market, has plowed a ton of resource into their GIS offerings, it seems to have reached a plateau in terms of market share.

    Anyway, I digress. On to the next Daratech chart. The following graphic, according to Daratech, encompasses a larger “umbrella” than just GIS to include “data, geo-enabled engineering, GPS, photogrammetry, and remote sensing.”

    After viewing this chart, my first thought was “Where’s GoogleGeoEye/Spot Image/DigitalGlobe/RapidEye?” My intention is not to criticize Daratech, but to ask the question “Who is geospatial?” The answer is not clear.

    Whereas GIS is and will continue to be a highly technical profession in which the geospatial infrastructure is based, geospatial data and services are weaving their way into the lives of the consumer. Obviously, Mapquest/Google Earth/Bing Maps/Yahoo Maps paved the way and then were followed by GPS navigators. However, what will dwarf the MDAs, ESRIs, Autodesks, and Intergraphs of the world are when the wireless service providers (think mobile phones) really kick it in gear. Some already have. A company called TeleNav writes navigation software for mobile phones. TeleNav’s product is what Sprint sells to their customers for GPS navigation on their mobile phones for $10/month. Even in these dire economic times, TeleNav filed for an Initial Public Offering (IPO) last month. They are growing like crazy…~700 employees. Are they a geospatial services company? I think so. The Verizon VZ Navigator application written by Networks In Motion Inc. (NIM) boasts millions of paying subscribers. Let’s see….$10/month * 5 million subscribers = REAL MONEY. Not surprisingly, NIM was acquired just last week by TeleCommunication Systems for US$170M.

    Navigation applications are just the tip of the iceberg for mobile phone users. There is a ton of software development going on in the social networking arena, which is under the LBS (location-based services) umbrella. Social networking applications are going to be huge. Look at Loopt. Their application is available on AT&T, Sprint, Verizon, Metro PCS, T-Mobile, and Boost for $4/month. ABI Research estimates that the value of location-based social network companies like Loopt will be $3.3B by 2013.

    Of course, one can already see the writing on the wall. The $10/month price for a navigation application or $4/month for a social networking application won’t remain that high. As competition heats up, the giveaways and bundling will begin. That’s when the volume will really crank up into the hundreds of millions of users.

    Now,  I’m even confusing myself. Who is geospatial?

  • Future Roles of the RPLS and GISP

    One of the new initiatives at Geospatial Solutions (GSS) that I mentioned a couple of weeks ago is a Surveying Section on the website. This is an area on the GSS website that’s going to be dedicated to surveyors and GISer’s who want to collaborate. By collaborating, I mean sharing ideas, sharing data, resolving conflicts and generally communicating in a positive way.

    From my previous column…

    “A particular area where I want to pay specific attention is what I call the Survey Section. There is no doubt in my mind that land surveying professionals and GIS professionals are going to be close brethren in the geospatial world. The roles of both are evolving and the line of demarcation is not always clear, but the two need each other terribly in order to best serve the public. GIS isn’t always about parcel maps and land surveying isn’t always about coordinates. The Survey Section (or whatever better name I come up with) will be a place for this sort of knowledge exchange and collaboration in a positive way. There is not one person or company that can stop this geospatial train, so what’s left is how best the two professions can work together.”

    In the column title, I refer to RPLS (Registered Professional Land Surveyor) and GISP (Geographic Information System Professional). RPLS is a generic term I use, but actually there are many different titles used depending on which US state or which country you reside in. RPLS roles are defined by each U.S. state statute (or country law) and their role varies widely from state to state and from country to country. For example, their role in California is much more encompassing than in Nevada. Likewise, the RPLS definition from one country to the next can be vastly different. GISP, on the other hand, is a certification of competency. Their role is not defined nor regulated by individual states or countries (that I’m aware of).

    RPLS and GISP are formal titles for the licensed (RPLS) and certified (GISP) people in their respective professions. Beyond those who are licensed and certified, there are orders of magnitude more practioners, technicians and specialists who utilize measurement, data collection and data processing technology (hardware and software) in order to complete their tasks at hand. As technology advances and becomes more accessible and more powerful, the growth rate of practioners, technicians and specialists is going to far exceed the growth rate of RPLS and GISP personnel.

    How can I make that statement?

    The fact is that data collection (hardware/software) and analysis (software) technology is becoming cheaper and easier to use every year. The established guard might think of it as “dumbing down” the profession, but I don’t think it is. Think back 60 years ago when data entry was accomplished via punch cards. Today, it is widely accepted and expected that everyone uses a keyboard to enter data…and they are responsible for their data entry errors.

    The result of advancing technology and a tightening economy is that there are many more organizations purchasing the technology and performing certain tasks themselves. So, not only are the roles between the RPLS and GISP getting fuzzier, but also the line of demarcation between the client and the RPLS/GISP.

    One example is wetland delineation mapping. In the past, it was common for a wetland scientist to flag the boundary of a wetland and then hire a third party to measure the locations of the flags and create the wetland delineation map. However, it is becoming common for the wetland scientist to carry a GPS mapping receiver with them and map the flags as they are set. This saves the client time by not having to wait for someone else to return to the site and map it. It also saves money by being able to keep the activity in-house. The risk of bringing the activity in-house is having potentially lower quality data if the wetland scientist isn’t well-versed in how to utilize the GPS mapping technology.

    One can argue that the line of demarcation is easy to draw…just refer to the local statute. Maybe, but I can see two problems with that:

    1. Advancing technology and expertise have allowed RPLS to extend their scope of services in their businesses. For example, an RPLS firm offers mapping services that aren’t inside the bounds of the local statute (eg. wetland surveys). GPS and GIS technology advancements have stimulated this growth.
    2. In the areas of expertise that fall outside of the local statutes, there ends up being a “turf battle.” In the wetland mapping example above, the wetland scientist may not be an in-house employee. He/she may be a contractor to the client just like the RPLS firm. In essence, they are competitors. As technology tools become more accessible, powerful and friendlier, this type of “turf war” is inevitable.

    Does this mean the RPLS firm is destined to only be competitive within the confines of the local statutes (eg. boundary surveys, etc.)?

    I think it depends on the RPLS firm. The answer is “probably” if the RPLS firm is not willing to update its technology toolbox and technology expertise. That would be a shame, though.

    From an expertise perspective, the GISP need the RPLS, badly…but that can only work if the RPLS can speak and operate in terms that make sense to the GISP. The fact is that it’s a bigger step for a GISP to learn RPLS-speak than it is for an RPLS to learn GISP-speak. The GISP speaks coordinates and metadata. To an RPLS, a boundary survey does not a coordinate make.

    From a “turf war” perspective, it’s more difficult. When egos and money are involved, people predictably become sensitive and defensive. GISP are a threat to the way RPLS firms have been doing business for decades. There have been two typical attitudes by the RPLS firm in response to this threat:

    1. Disengage and view the GISP as an adversary rather than a complementary colleague
    2. Expand its “turf” by effecting change in the local statutes to increase the scope in which the RPLS firm operates, thus excluding/restricting the GISP from practicing in certain areas (e.g., GIS mapping).

    The former isn’t going to accomplish much. Change is inevitable and firms must adapt to change in order to survive. Sticking one’s head in the sand isn’t going to make the problem go away.

    The latter is a short-term solution. One can “hide behind a stamp” for only so long. Maybe that’s a rough way to state it, but it’s a proven fact that competition promotes excellence. One only has to look as far as the automobile industry to understand this. Had not Japan entered the automobile industry, the quality of automobiles would not be near what they are today. Yes, it has shaken the core of the US economy, but it was something that was needed. It was a “wake-up call” of sorts.

    This column was not meant to provide answers or even attempt to paint the entire picture. It’s an opening statement on what will be a core part of the Geospatial Solution’s coverage in the future.

    See you next week.

  • Where Are We Going?

    It occurred to me that I haven’t discussed my plans for GeoSpatial Solutions (GSS) since I assumed the editorship of GSS a couple of months ago. Some of you have told me that you thought GSS had “gone away.” True, it was somewhat dormant for awhile, but we’ve got some fantastic initiatives underway for 2010 that will renew your interest in GSS.

    First of all, GeoSpatial Weekly will have an opinion column (mine or a guest whom I coordinate) every issue. I’ll strive to provide something interesting to read that’s relevant every week. Sometimes there is big news to cover and sometimes there’s not, but I’ll always strive to make it interesting. We’ll also continue to produce the monthly GeoIntelligence Insider Newsletter which is focused on news and analysis of spatial technologies in the homeland security and defense segment.

    As some of you may know, I am the founding editor (and continue to be) of GPS World magazine’s Survey Scene newsletter that began more than three years ago. Target marketed e-mail newsletters were a new concept for GPS World at that time, and over the past several years we’ve proven it’s the right formula for delivering valuable information to your e-mail inbox in a timely manner. Furthermore, our highly successful webinar series has drawn a tremendous amount of response from our readers.

    Fortunately, GeoSpatial Solutions is a sister publication of GPS World magazine. Both are owned by Questex Media, which operates a fair number of print and digital magazines in addition to other products and services. They have a very capable IT department which can administer a number of powerful technologies like webinars and video hosting.

    This gives GSS a firm leg in which to leverage from.

    Some initiatives I’m working on now:

    Website, 98.27.162.175/gpsworld.com:

    Currently, you may have noticed that we are piggy-backed on the GPS World website as a temporary home. Over the next couple of months, we will be redesigning the website with its own “look and feel.” We will be adding a number of sections that will make resources available to you such as archived webinars (GIS-oriented), videos, white papers and others.

    A particular area where I want to pay specific attention is what I call the Survey Section. There is no doubt in my mind that land surveying professionals and GIS professionals are going to be close brethren in the geospatial world. The roles of both are evolving and the line of demarcation is not always clear, but the two need each other terribly in order to best serve the public. GIS isn’t always about parcel maps and land surveying isn’t always about coordinates. The Survey Section (or whatever better name I come up with) will be a place for this sort of knowledge exchange and collaboration in a positive way. There is not one person or company that can stop this geospatial train, so what’s left is how best the two professions can work together.

    Webinars:

    As I mentioned above, webinars are a powerful communication tool. Also consider that travel budgets and industry conference budgets have been chopped considerably in the “new economy,” webinars are a natural fit. Last year during the weeks before the ESRI User Conference, I dedicated a GPS World webinar to GIS. I plan to do the same this before prior to the ESRI UC in July as well as the INTERGEO conference in Europe in October. These are the two largest geospatial events.

    Guest/Industry perspectives:

    I tell my wife I’d hate to be married to me. Thank goodness for our family that she doesn’t always listen to me :-)

    I think it’s invaluable to hear perspectives from industry folks, even if they don’t agree with me. I’ve started reaching out to those whom I think would bring an interesting perspective to GeoSpatial Solutions. Starting in 2010, I’d like to have at least one per month on varying topics from GIS database technology to trend analysis to data collection methods to new computer hardware developments that effect geospatial professionals.

    Multimedia content:

    The next best thing to being there is viewing a video of an event, an interview, a process, or experience of some sort.

    Youtube, Google Earth and the internet in general have transformed the way we interact in our world and specifically our geospatial world. Those technologies have brought us closer. During my last webinar, I had questions from several people who lived on different continents…Asia, Australia, Europe and Africa. I was interacting with other geospatial professionals who lived in completely different cultures, spoke different languages and lived in significantly different time zones. I cannot begin to imagine kind the room he or she sat in while attending the webinar anymore than he or she could picture what my little office space looked like, but our common geospatial connection brought us together.

    Multimedia content is a tremendously important technology that allows us to grow closer in the geospatial community even though our geographic coordinates are significantly different.

    Industry Conference Live Coverage:

    Attending major industry conferences is important to me because that’s where a lot of industry buzz is taking place and where I get a chance to meet up with a lot of people with whom I don’t see on a regular basis. I also tend to present at these conferences too. The next one is the ACSM/GITA conference next Spring in which I’m leading a half-day GPS workshop along with Pamela Fromhertz of the National Geodetic Survey.

    Live coverage of conferences is a great way of bringing you closer to the conference buzz from your desktop at work or home especially when combined with blogging (such as what we’ve done at GPS World) and video coverage.

    KSA (keyword search) Service:

    Soon, we will give you the ability to sign up for KSA free of charge. Essentially, you select from a list of keywords (such as web-mapping, WAAS, GeoPDF). Once signed up, we will automatically send you an e-mail every time new content (news stories, columns, webinars, etc.) is published that include your keywords.

    Blog/Twitter/Discussion Forum:

    There is a notion of utilizing too much technology. I want to be careful of that. Blog’s make sense if they are relevant and insightful. Twitter is the fast-food of the blog world.

    Discussion forums can be very useful, but they are only powerful if there is participation from a lot of users. That can only happen if there is a foundation of relevant and useful content.

    I can’t tell you if we will use these technologies, but I can tell you that if we do, we will do it right. I respect your time and attention enough to not want to waste it.

     

    Thanks, and see you next week.

  • 3D Geospatial Data

    The usage of three dimensional data in the geospatial industry is in its infancy. It makes sense to me. Sometimes, it’s hard enough for folks to obtain and maintain accurate two dimensional data, not to mention elevation! However, as geospatial technology continues to evolve, the availability of 3D geospatial data will evolve. I’m pretty sure that in ten years we will look back and be amazed at how little we used 3D geospatial data.

    But for now, what the heck are Mean Sea Level, ellipsoidal height, orthometric height, geoid height?

    Sources of accurate elevation data are difficult to find. Typically, you’re going to find elevation data from aerial photogrammetry projects, LiDAR missions or from GPS data collection projects. Since availability of this sort of data on the world-wide web isn’t as prevalent as 2D geospatial data, 3D geospatial data utilization isn’t main stream yet.

    There’s also the issue of the definition of elevation. Yes, just like there are differential horizontal datums, there are a variety of elevation datums. On legacy paper maps, elevations are typically displayed with respect to Mean Sea Level (MSL). MSL is an the elevation reference for local areas, but the Earth is not like a bathtub where gravity has an equal impact on the water in the bathtub that forms a smooth surface. MSL around the world varies tremendously. 2 meters MSL in New York is orders of magnitude different than 2 meters MSL in Hong Kong.

    MSL is a complicated subject in itself. Check out this web page on the National Geodetic Survey’s web site that provides definitions related to MSL. The Earth is not a perfect sphere and gravity influences vary by region. For centuries until recently, elevations were stated with respect to sea level because that was the most reliable and widely known reference.

    GPS has changed that. GPS uses an elevation model called the geoid which was intended to somewhat approximates MSL. There are a couple of good references that provide much more detail. They are worth reading. One is from ESRI written back in 2003 by Witold Fraczek. The other is from an NGS presentation given in 2007 by Daniel Roman.

    In fact, following are a couple of graphics from the NGS as well as one from Dr. Roman’s presentation that draws a clear picture of how GPS heights are related to MSL.

    H = Orthometric height (Mean Sea Level), h = Ellipsoidal height, N = Geoid height

    Note that the height determined by GPS is the ellipsoidal height, not Mean Sea level. The difference between the two can be tens of meters.

    Most GPS receivers have a rough model of the Geoid height built into it. However, it’s very rough and can be a few meters in error. To resolve this, significant efforts have been made in the two decades to create high resolution geoid models. Creating a high resolution geoid model (for a country) is a relatively large effort that requires very skilled people and specific equipment.

    Following is a similar graphic illustrating North American Datum of 1983 and GEOID03, which was the most recent geoid model of the United States (GEOID09 was just released).

     

    Finally, following is a graphic from Dr. Fraczek that depicts the relationship between the ellipsoid, MSL and the Earth’s surface. You can see here that at some points, the ellipsoid is actually above the geoid and at some points, it’s below the geoid.

     

     

    The purpose of this column is to point out that when you receive 3D geospatial data, you should inquire what about the elevation data is referenced to. Are they ellipsoidal elevations? Are they MSL elevations? If MSL, what was the resolution of the geoid model used?

    Flushing out horizontal datum inconsistencies in your GIS is, for the most part, pretty straight-forward. The 2D view is the norm and once you bring data into your GIS, you can compare the imported features to the existing features and identify fairly quickly if there’s a problem with the 2D data. The problem is that most GIS folks aren’t used to working in a 3D world. I speculate that most people figure that if the 2D data is reasonable, then the elevation (if it exists in the database at all) must be accurate. It would be interesting to hear from folks who are making a concerted effort in quality checking the heights used in their GIS.

    Even though GIS horizontal data is still far from perfect with respect to accuracy, at least I can see the road to success. The quality of horizontal data in the past ten years has improved significantly thanks to widespread availability of data collected via remote sensing and GPS data. I think that trend will continue as the widespread availability of accurate horizontal data continues to improve. The roadmap for 3D data isn’t so clear. Not only is there a lack of accurate 3D data, but also the models (eg. geoid model) for generating accurate 3D data continue to evolve.

    Applications for 3D data are expanding and are going to continue to expand. People, both inside the geospatial industry as well as the general public, still have a hard time visualizing 3D data. For example, a land development plan for a site can be communicated much more effectively if there’s a 3D visualization (either still image or animated video) that accompanies the engineering drawings. Following is a visualization of a particular golf course hole where the architect was trying to convey the design change to the golf course owner. The image on the top is the existing golf course. The image on the bottom is the proposed design. The data used to create the terrain model in these images was high quality 3D geospatial data.

     

    Thanks and see you next week.

  • TIGER Database Historical Perspective

    In my last two columns, I’ve made several references to geospatial data. Jon Sperling, Ph.D. GISP, wrote in and commented on the omission of the importance of TIGER data in the history of geospatial data development and commercialization. He made reference to a document he wrote that was published in 1995 regarding the development and maintenance of the TIGER database. I’ve decided to reprint his work, with his permission, as this week’s column. It gives keen insight into the early stages of TIGER. Albeit quite in-depth, it’s a fascinating read for gaining a historical perspective on geospatial data development.

    Keep in mind that this document was written in 1992 so there are references to initiatives, etc. that were subsequently developed and that you enjoy today.

    Sperling argues that our public investment in TIGER need not be just history but a pro-active means to leverage partnerships and new advances and innovations (e.g. synergistic links with national parcel data initiatives, local/state/federal data sharing and a national road network). Census still remains a pioneer in supporting and furthering geospatial science at all levels of our society for the betterment of our Nation’s communities.

    Dr. Sperling is currently a Senior Researcher of Geographic Information and Analysis at the Office of Policy, Development and Research for the U.S. Department of Housing and Urban Development. He has designed and led many innovative geospatial, addressing, and data integration efforts in coordination with local and state governments and the academic research community. Dr. Sperling was involved in the initial development of the Nation’s TIGER/Master Address File databases for the 1990 and 2000 Census, enabling digital spatial data sharing capabilities, and efforts to enhance its coordinate accuracy and data quality. Currently, he is working with university partners on a number of innovative research applications to enable sophisticated querying of unstructured text and tables using textual spatial references in the data.


    Jonathan Sperling, “Development and Maintenance of the TIGER Database: Experiences in Spatial Data Sharing at the U.S. Bureau of the Census,” in Harlan J. Onsrud and Gerard Rushton, eds., SHARING GEOGRAPHIC INFORMATION (New Brunswick, NJ: Center for Urban Policy Research). Copyright 1995 by Rutgers, The State University of New Jersey. Reprinted with permission.

     

    Development and Maintenance of the TIGER Database: Experiences in Spatial Data Sharing at the U.S. Bureau of the Census (1992)

    The U.S. Census Bureau has played, and will continue to play, a vital role in the development, maintenance, and sharing of spatial and attribute data for Geographic Information Systems (GISs) on the local, regional, national, and international levels. The Census Bureau’s development of shareable geographic data files, the GBF/DIME (Geographic Base File/Dual Independent Map Encoding) Files for the 1970 and 1980 censuses, and the TIGER (Topologically Integrated Geographic Encoding and Referencing) database for the 1990 census, have provided a major impetus to the rapid growth and diffusion of GIS technology. This chapter discusses the Census Bureau’s experiences in the spatial data sharing during these two file-building projects as well as from ongoing experiences in developing Memoranda of Understanding with federal and state agencies to update and improve the spatial and attribute data in TIGER. On the basis of these experiences, preliminary generalizations are made concerning the organizational issues that may facilitate or impede the future digital interchange of spatial data.

    Introduction

    The entrance and persistence of the nation’s lead statistical agency as a primary producer and user of both geographic and attribute data have grown out of practical but critical concerns (Tomasi 1990). The mandate to conduct a population and housing census of the entire nation and its territories every ten years, and then disseminate the resulting information  quickly and accurately, has led to the need to devise innovative ways of improving the collection, processing, and tabulation of data.  These improvements have aimed at four goals: increasing efficiency and timeliness, improving data quality, lowering costs, and providing new products for the data user.

    Over the past half century, the Census Bureau has pursued innovations and led the government and private sector in computerization, statistical sampling and interviewing techniques, data processing, quality control, and cartographic techniques, to name but a few (Anderson 1989).  The Census Bureau pioneered the first large-scale commercial user of the computer with UNIVAC 1 following the 1950 census and the development of the Film Optical Sensing Device for Input to Computers for the 1960 census.  The broad use of computer mapping by local governments and private firms was strongly influenced by the Census Bureau when it developed the GBF/DIME-Files for the 1970 and 1980 censuses and the TIGER System for the 1990 census.

    In addition to being one of the nation’s largest digital geographic databases — currently sixteen gigabytes — TIGER enables the automated spatial manipulation of census data for all legal and statistical entities in the United States and its territories.  Every state and local government now has the capability to create rudimentary GIS using small-area census data, publicly available extracts of the TIGER database – TIGER/Line  files – and the appropriate hardware and software.  More than 130 private vendors currently have the capability of processing TIGER/Line files, and the number is growing.

    In this respect, the development of the TIGER database may be the most important “data” file from the 1990 Census of Population and Housing.  The accessibility and widespread potential application of this innovation make it qualitatively different from earlier technological developments at the Census Bureau.  The TIGER database will be used not only internally to administer future censuses and surveys (see Marx 1986 for a basic rationale of the TIGER system) but externally to provide a major impetus to the development and sharing of integrated special information systems in the public and private sectors as well as the academic community.

    A common theme in the GIS literature, generated both within and outside the Census Bureau, has been the notion that although the Census Bureau developed its GBF/DIME-Files and TIGER database to meet internal Census Bureau needs, their existence has facilitated applications well beyond the scope of census-taking.  Perhaps analogous to the many commercial spin-offs that have been generated from wartime military innovations or the nation’s space program, the decennial census operation, conducted by the government’s largest non-military assemblage of people and resources, has been responsible for a number of innovations, some already outlined, that have had a major effect on the private sector and the academic research community.

    The Census Bureau’s recognition of these “consumer spin-offs” traditionally has never been well articulated or coordinated because most innovations have resulted from an internal production standpoint rather than a customer-oriented design.  Recent efforts toward building a national spatial data infrastructure and the Census Bureau’s adoption of the principles of Total Quality Management and Strategic Planning create the external and internal incentives, respectively, to stimulate a change in that design.  Paradoxically, however, the fiscal constraints of the 1990s and their impact on congressional funding, while often considered an incentive for data sharing, may also function as an impediment in the early developmental stages of new technologies.

    THE DEVELOPMENT OF SHAREABLE GEOGRAPHIC DATA FILES – GBF/DIME-File Development 1966-1982

    Many local agencies have been introduced to GIS by the Census Bureau through is various geographic base-building projects over the past twenty-five years.  The development of the Address Coding Guides (ACG) for the 1970 census and the GBF/DIME-Files for the final stages of the 1970 census and for large-scale use in the 1980 census were major steps toward full automation of the Census Bureau’s geographic support programs.  Althought the ACGs provided the building blocks for the later development of the GBF/DIME-Files, they lacked geographic coordinates and topological structure (Marx 1986).

    The design and development of these computer-readable files involved the active participation of federal, state, local, private and academic organizations.  The Census Use Study, a small-area data research group sponsored by the Census Bureau from 1966 to 1969, was instrumental in creating and diffusing knowledge about a system that represented map features numerically for processing by a computer to create a geographic base file (U.S. Bureau of the Census 1973).  The enhancement of this body of knowledge was of critical importance to the Census Bureau because it enabled the adoption of census-by-mail enumeration methodologies for the major urban centers of the United States.

    The change in enumeration procedures was a response to the increasing costs and difficulty of conducting a traditional door-to-door canvassing of the population, a growing and increasingly urban population living within areas having mail delivery by house-number/street-name address, and technical feasibility of linking such addresses with the geographic units used for data tabulations due to improvements in computer capabilities.  For the 1980 census, the GBF/DIME-Files were enhanced to cover 287 of the nation’s largest urban centers, representing more than 60 percent of the population but less than 2 percent of the nation’s land area (Carbaugh and Marx 1990).

    Data sharing during the 1970s.  The development of the ACGs and GBF/DIME-Files involved data sharing with more than 300 local planning agencies.  Sharing primarily took the form of converting analog data into a computer-readable format rather than digital exchange per se.  The long-term nature of these relationships provided the Census Bureau with intimate knowledge of the resources available to local agencies, the limitations of available data, and the willingess and ability of agencies to share data.  At the same time, local planning agencies and others, through their association with the Census Bureau, became increasingly aware of the potential computer mapping, automated address matching, and spatial data analysis to meet local needs (Sobel 1978).  These relationships played an important role in the later development of the TIGER system.

    Monetary and other incentives to the local agencies played a major role in the “data sharing” development of the ACGs and GBF/DIME-Files (Silver 1977).  In many cases, local agencies completed work under contract with the Census Bureau or with funding provided by the Department of Housing and Urban Development (HUD), and Federal Highway Administration (FHWA), and other federal, state, and local planning sources.  Sensitive to the fact that the development of the GBF/DIME-Files provided a practical solution to many needs of local governments, the Census Bureau offered to provide the computer programs, processing methodology, and clerical procedures for creating  and updating the file, as well as a free copy of the completed file, to each participating local agency.  The Census Bureau also offered to do the data keying and processing if agency resources were limited.  The promise of higher quality data when these files were used for taking and tabulating the 1970 and 1980 census provided a further incentive for data sharing.

    As a result of this process, the Census Bureau and local officials learned that the quality and currency of existing data used by planning agencies could not be taken for granted.  For example, many local communities relied on the knowledge of a few people for the location of dwellings, address-numbering systems often were not systematic, tax assessor sources sometimes did not meet the bureau’s quality standards, and data from different agencies often were inconsistent.  Because of these situations, the initial transition to address assignment via automated processes was difficult.

    The efforts of working with so many local agencies also challenged the Census Bureau.  During the file-building projects, there was turnover in project personnel and key decision makers in the local agencies as well as at the Census Bureau.  Some “champions” of the new methods were not reelected or moved on to other jobs.  The long-term benefits of building such files were not always apparent to public administrators.  Some administrators viewed the new computer technology as a threat to their role in the agency.  Also, during the 1970s, there was a general lack of understanding (personal computers had not yet been invented) and/or trust in the new computer technology and the ever-present fear of “big brother” mainframe computers held by a few large government agencies.  All these factors tended to hamper development activities and to result in considerable variation in the quality, time expended, and funds necessary to complete the GBF/DIME-Files.

    Data sharing during the 1980s.  Census Bureau funding of data collection and coding by local agencies to create the GBF/DIME-Files created precedents that would later affect the pre-1990 TIGER-building efforts.  The overall reductions in federal aid to cities and states during the 1980s provided an atmosphere of “less than cooperative initial attitudes” among many local agencies struggling with small staffs and fewer resources than they had in the 1970s.  Many agencies expected to be paid for their efforts.  These situations, and the perceived difficulties inherent in working with several hundred local agencies, contributed to the Census Bureau’s decision to do most of the map and address updates for the 1990 census in-house, albeit often based on materials supplied by local agencies.

    Changes in the original GBF/DIME-File format by local agencies exacerbated some early data-sharing efforts.  Many of the larger files that had been maintained by local agencies (e.g., New York City) had been adapted and enhanced to meet local planning and administrative needs.  Fiscal difficulties at the local level, time constraints on the Census Bureau, and the lack of additional programming staff in all agencies often precluded efforts to recreate the original file structure without losing the map and address updates.  In these cases, the feature and address range update work had to be redone manually by the Census Bureau from digital plots and databases.

    One of the general weaknesses of the GBF/DIME-Files was that they were limited geographically and, therefore, the ability to use the files on an ongoing basis was limited to large-scale users with on-line access to mainframe computers.  Relatively few of the original GBF/DIME-Files given to local governments were updated and maintained during the 1980’s. Even fewer of the locally updated files met Census Bureau standards for direct incorporation into the TIGER database. In practice, however, these updated GBF/DIME-Files provided the Census Bureau with its first challenge in the digital spatial data exchange at the local level.

    Digitized GBF/DIME-Files, attribute-rich but of mediocre
    coordinate accuracy, formed the cartographic base for 345 urban centers in the 1990 TIGER database. Absolute coordinate accuracy was not a primary concern in the development of the GBF/DIME-Files, as they were used primarily by the Census Bureau for geocoding rather than mapping purposes. Although their coordinate accuracy was well below that of the U.S. Geological Survey’s (USGS’s) Digital Line Graph (DLG) files derived from their 1:100,000-scale maps, these files represented features with all their respective feature names, address ranges, and 1980 geographic area codes in their correct relative location – sufficiently accurate for taking a census (Sobel 1986).

    Although the Census Bureau would have liked to provide an enhanced cartographic quality, the deadline pressures of an upcoming decennial census forced management to abandon initial plans to align this information to the USGS’s DLG files. Also, because of staff and time constraints, the files were sent to four private-sector contractors for digitizing of feature updates using Census Bureau-supplied updated 1980 census maps. The results from these arrangements were of mixed quality.

    TIGER Database Development, 1983-1990

    The institutional knowledge and experience gained from the development of the GBF/DIME-Files, further theoretical and conceptual advances in the field of mathematics (Corbett 1979 and White 1984), and the “enabling” availability of new and affordable technology, provided fertile ground for the Census Bureau’s next challenge: the development of the TIGER database for use in the 1990 census. Whereas the GBF/DIME-Files covered small noncontiguous portions of the United States and were developed initially without spatial or geographic references in their design, the TIGER database covered the entire nation and its territories and was grounded in a more rigorous conceptual model of topology and space (see Boundriault 1987; Kinnear 1987; Broome and Meixler 1990).

    Census Bureau/USGS cooperative agreement. In addition to data-sharing arrangements on the local level, the ability to complete the building of the TIGER database in time for the 1990 census was directly dependent on a landmark 1983 data-sharing agreement with the USGS. The USGS provided the Census Bureau with computer files of scanned versions of its 1:100,000-scale maps for the lower forty-eight states. In return, the Census Bureau assigned cartographic classification codes to the roads in these files. The resultant product formed the cartographic base for all areas outside the large urban centers covered by the GBF/DIME-Files, thereby enabling the Census Bureau to complete a coast-to-coast digital map base in time for the 1990 census (McKenzie and LaMacchia 1987).

    Interagency cooperation with the USGS was an experience that provided valuable lessons to the Census Bureau in particular, and a model for future cooperation between federal agencies in general. Following a successful Florida pilot project, high-level management in both agencies perceived that cooperation would result in a win-win situation. Each agency would be able to accelerate its individual map production programs and, in the process, they could develop the first “large-scale” digital map file of the United States. Early negotiations ensured that neither agency would feel it was bearing an unfair burden. A schedule of meetings on a regular basis ensured communications during all phases of the cooperative agreement. The challenge of meeting the Census Bureau’s decennial deadlines provided an added measure of incentive to “get the job done” and an ongoing requirement to measure progress.

    The U.S. Census Bureau/USGS cooperative agreement minimized duplication of effort in federal map automation activities and provided immediate short-term benefits to both agencies. The success of this cooperative venture demonstrated to the Department of Commerce, the Federal Office of Management and Budget (OMB), and the Congress that there were significant benefits to be derived from such activities. The success of this cooperative effort also resonated in the later development of the Federal Geographic Data Committee (FGDC) (OMB 1990), the growing impetus for further cooperative efforts in spatial data sharing, and the increasingly articulated vision of a national spatial data infrastructure (Marx 1992).

    Other success factors. The success of the TIGER System and the ability of the Census Bureau to overcome organizational inertia both within and outside the Census Bureau were due to a number of factors. There was a shared sense both within the Census Bureau and by the data-user public that change was needed in the geographic support process. A primary incentive for developing the TIGER database was the large number of inconsistencies between the statistical and geographic data products in the 1980 and earlier censuses, a product of the complex and clerically intensive preparation of maps, ACGs and GBF/DIME-Files, and geographic reference files (Marx 1986).

    The enormous political and economic ramifications of the decennial census made everyone a stakeholder in the process. The growing importance of the information sector of the economy and growing public demands for more accurate, cost-efficient, timely, and accessible data products helped to promote an environment receptive to the exchange of data, expertise, and experience with other governmental agencies at all levels, the private sector, and the academic community.

    Bureaucratic inertia was further overcome by staff commitment, expertise, and initiative with the Census Bureau’s Geography Division. The transition from traditional to automated mapping for the 1990 census required changes in the organization and planning of the Census Bureau’s mapping activities as well as in the requirements of staff in developing and incorporating new cartographic techniques and computer skills. Motivated by a decennial environment of schedules and fixed deadlines with no alternative to full automation for product delivery and a cooperative agreement with the USGS, the Census Bureau’s staff produced significant results in a relatively short time (Trainor 1990). The resources and skills gained from this experience, combined with a history of successful technical innovations at the Census Bureau, promoted a willingness to reach out and explore the potential for digital data sharing.

    Some criticisms. Countering these positive aspects are criticisms of the TIGER database, such as the relative poor coordinate accuracy of the roads in the major urban centers (the roads that came from the GBF/DIME-files used in lieu of USGS DLG file), lack of address range and ZIP Code improvements or expansion beyond the 345 GBF/DIME-File areas, and inconsistencies in the names and classifications of streets. Public complaints about the quality of the data in the TIGER database provided the Census Bureau and its parent agency, the Department of Commerce, with further verification of the wide applicability and importance of this database beyond the internal needs of the Census Bureau.

    Although valid, most of these situations were not critical for the taking of the 1990 census, the primary mission of the Census Bureau. In fact, many private consulting firms have taken advantage of these “problems” to repackage “new and improved” versions of the bureau’s publicly available extracts from the TIGER database: the TIGER/Line files. The Census Bureau is correcting many of these situations and will release future TIGER extract products with these updates and corrections.

    In general, the TIGER database continues the GBD/DIME-File tradition of being attribute-rich and current but with limited coordinate accuracy in the major urban centers. The USGS’s DLG files, on the other hand, have high “ground truth” accuracy for the features they show but have few attributes (DLG-Enhanced Files, once released, will improve on the latter). In addition, the DLG Files do not contain current information and the USGS has not been provided with the financial resources need to perform frequent, nationwide, and systematic updates. As stated previously, alternative methodologies for updating the files are being evaluated, including an initiative to accelerate the collection of base cartographic data using graphic or digital orthophoto quadrangles or aerial photography (FGDC 1992).

    ENHANCING THE TIGER DATABASE

    Since the completion of the TIGER database for the 1990 census, the Census Bureau has become increasingly aware of its vast potential as well as its current weaknesses. If viewed as a process rather than a product, the TIGER database provides an opportunity to improve statistical accuracy and data quality significantly. The updated address and geographic information systems in a growing number of public and private agencies and the databases of the U.S Postal Service (USPS) provide important means of enhancing the collection, processing, and tabulation of census data.

    The planned release of a new TIGER/Line extract by the Census Bureau containing extended address ranges and ZIP Codes for all areas with city-style mail delivery is significant, not only for the inherent value of the added data (there will be an increase in address range coverage from approximately 55 percent to 85 percent of all housing units in the United States), but because it represents new capabilities that never existed before. For the first time, the Census Bureau will be releasing value-added files for data users after the census that are not directly tied to the decennial statistical data products (however, they will be used for geocoding establishments in the intervening economic census). This precedent also is important in that it reflects a subtle change in the traditional once-a-decade data dissemination paradigm and opens new possibilities for future digital data exchanges. Perhaps, it also is indicative of the more customer-oriented approach mentioned earlier in this chapter.

    Because the Census Bureau was not intended to be the nation’s preeminent mapping agency, any data-sharing agreements to improve the TIGER database must be perceived by the overall organization as primarily benefitting the census-taking process. At this stage of planning for the 1992 and 1997 economic censuses and the 2000 decennial census, the Census Bureau has made an organizational commitment to the integration of the Census Bureau’s related Address Control File with the intercensal update and improvement of the TIGER database. This commitment, however, is dependent on the availability of resources.

    In the absence of a national updated map system, the U.S. Census Bureau has pursued a relatively high-cost mapping compilation strategy for the 1990 and previous censuses (Rhind 1991). Labor-intensive comparisons of reference sources, often of varying scale and quality, have been the primary means for updating census maps. In this respect, the Census Bureau has pursued and maintained close contacts and relationships with other federal agencies, state, regional, and local planning and transportation agencies, engineering firms, aerial survey companies, tax departments, utility firms, and a host of other public and private firms with current map and address reference source materials.

    The development of the TIGER System and other digital spatial and attribute databases, coupled with the proliferation of more powerful computer hardware and GIS software, allows data exchange to occur in a more sophisticated, more timely, and potentially more accurate and less costly manner. From a technical perspective, there appear to be few limits to the advance of this new mode of digital data exchange. According to Cooke (1995), the technical problems of data sharing have mostly been, or are in the process of being, solved. The non-technical components of data interchange may prove more daunting.

    Current Data-Sharing Plans

    Current data-sharing plans at the Census Bureau are twofold. On the one hand, the Census Bureau is investigating the possibility of national sources, predominantly federal agencies, that could provide the information to keep the feature and address-range information in the TIGER database up to date. Based on a report by the Government Accounting Office (GAO 1991), which stated that federal agencies increased their planned expenditures on GIS by about 60 percent between fiscal years 1990 and 1992, the likelihood of such exchanges is promising.

    At the national level, the Census Bureau has entered into or proposed data-sharing agreements with several large governmental or quasi-governmental agencies including the USGS, USPS, the Environmental Protection Agency (EPA), the Soil Conservation Service, the Federal Railroad Administration, the U.S. Army Corps. of Engineers, and the Federal Emergency Management Agency (FEMA). These efforts to improve the spatial and attribute data in the TIGER database are ongoing and likely will include several other agencies by mid-decade. As the coordination of GIS activities improves in the federal sector, bilateral agreements between agencies to improve the TIGER database will likely evolve into multi-agency agreements, as has already occurred at the state and local levels (see Murakami and Greenleaf 1992).

    The Census Bureau also is investigating possible mechanisms for the electronic interchange of updated geographic information with state, local, private, and academic organizations. The current preference of the Census Bureau is to coordinate these efforts and control quality at the state level rather than having to deal with conflicting data and different file formats from several thousand local governments, private agencies, universities, and other sources. Experience indicates, however, that this will not always be possible.

    Role of the FGDC in Data Sharing

    Increased GIS use by federal agencies as well as state and local agencies has led to renewed efforts to coordinate development, sharing, and dissemination of spatial data, primarily through the Federal Geographic Data Committee (FGDC). The FGDC, formed in late 1990 at the direction of the OMB in its Circular A-16 (OMB 1990), includes representatives of fourteen departments and independent agencies but has no direct authority, responsibility, or resources. Participation is voluntary, with decisions based on consensual agreement among its members. Given the current limits on availability of fiscal resources, several agencies have agreed to pursue cooperation in accelerating the 1:12,000-scale digital orthophoto program and the 1:24,000-scale digital quadrangle program. The Census Bureau is working on individual Memoranda of Understanding (MOU) with other federal agencies in the spirit of the FGDC and has agreed to cooperate with the USGS in devising a still more powerful data structure.

    In the long run, the FGDC may provide the vehicle necessary to transform institutional relationships within the federal government as well as with state and local governments, the private sector, and the academic community. The FGDC is making an increased effort to involve the non-federal community in its coordination work. Although the goals are lofty and the potential long-term benefits extraordinary, effective leadership at the highest levels and a concomitant commitment to the development of compatible standards will be necessary. In the absence of these developments, only significant short-term and real cost-saving benefits gained from data sharing by particular agencies will move the process forward toward a national digital spatial database.

    Significantly, in July 1992 the Census Bureau and the USGS signed an amendment to the original 1981 Memorandum of Understanding that commits both agencies to merging the current DLG and TIGER databases, including information resulting from partnerships with other agencies. The development of a shared database that combines the essential geographic information needed by both agencies to carry out their respective institutional mandates will have a profound effect in further stimulating the development of a national spatial data infrastructure in the United States.

    Intercensal Data-Sharing Projects, Post-1990

    Current data-sharing experiences at the Census Bureau have been exploratory, and the actual mechanisms and standards for digital data interchange are still in the process of being developed.  The Census Bureau is engaged in a number of activities to help promote digital data sharing during the 1990s.  These activities include conferences, participation in the FGDC initiative, MOUs with other federal, state, and local agencies, pilot projects, and the planning and imminent release of the prototype version of the TIGER/SDTS (Spatial Data Transfer Standard) file.

    Based on these forums and activities, the Census Bureau has begun to receive initial feedback on the non-technical impediments to and incentives for digital spatial data interchange.  The Census Bureau has provided its digital geographic and statistical data sets to the public at the cost of dissemination and as a public resource (OMB 1992).  Other public and private agencies, however, have different perspectives and regulations pertaining to their own data sets and on the updates they may perform.  Profit is a major concern of private companies, and public agencies also are looking to their products as a source of revenue.

    States and local agencies consistently have reported one or more of the following problems in trying to coordinate a GIS:

    •    Agencies wanting proprietary control of internal data

    •    Lack of resources in one department or institution affecting data requirements of another

    •    Archaic systems

    •    Managers and commissioner-level officials who know, or care, little about GIS

    •    Staff turnover

    •    Lack of commitment

    Once a state or local GIS is operational, the lack of overt incentives to expend the additional time and expense required to feed local updates into the TIGER/Line or SDTS formats for interchange become more apparent.  Ensuring the currency, accuracy, and quality of the TIGER database is an integral part of ensuring the accuracy and quality of the associated census data, which are used for reapportionment, redistricting, the distribution of federal funds, and innumerable planning and development programs – but this may be a long-term and nebulous consideration for some agencies.  The lack of a clearly defined formal process for data interchange by the Census Bureau could provide an even greater impediment to institutions willing to share data.

    The existence of multiple geographic data file formats also have inhibited the interchange of data.  Future geographic data files at the Census Bureau will be released in accordance with a recent Federal Information Processing Standard (FIPS) – the SDTS.  Adoption of the SDTS involved cooperation of federal, state, and local officials, the academy community, and private sector over an extended period of time.  The Census Bureau was an active participant in the development of this new federal standard for data exchange and, to that end, released its first prototype TIGER/SDTS file (Davis et al. 1992).  However, the effect of this rather complex format on data interchange is not yet clear.

    Each data-sharing agreement brings forth new possibilities, new arrangements between agencies, and the potential for new products.  Working with a variety of agencies and soliciting recommendations for improving TIGER, Census Bureau staff have noted similarities among the interests of many agencies.  For instance, conversations with the USPS and the U.S. Department of Transportation have revealed similar interests in enhancing attributes for streets (e.g., turn and directional restrictions).  In many cases, the Census Bureau would be able to use the enhancements needed by other agencies for improving the quality and cost-effectiveness of its own internal operations (e.g., routing of enumerators).

    Proposed Census/USPS cooperative program.  The USPS and the Census Bureau have been working together for more than thirty years in the delivery and return of questionnaires for the decennial, agriculture, and the economic censuses and surveys.  The increasing use of mail-out/mail-back procedures throughout the nation has made the USPS an indispensable partner in the Census Bureau’s data-collection activities.  The Census Bureau traditionally has paid the USPS to verify the completeness and accuracy of its decennial census address list, which was purchased from private vendors and enhanced through in-house programs, prior to the mailout of questionnaires.  The Census Bureau also worked with the USPS on the development of computer algorithms to match the ZIP+4 files to GBF/DIME-File records during the 1980s.  ZIP+4 files, also known as the Address Management System (AMS) Files, contain potential address ranges for all areas where the USPS delivers mail.

    In an effort to enhance this cooperation to the benefit of both agencies, the Census Bureau proposed a formal Memorandum of Understanding (MOU) with the USPS similar to the one it had with USGS in the 1980s. In 1990, the Census Bureau and the USPS, with the participation of the USGS, began cooperating on a pilot project to provide a better geographic database for all three agencies.  If signed, this MOU will have broad implications, not only for these three agencies, but for the GIS community as a whole during the 1990s.

    The Census Bureau’s proposal envisions a four-year file update/enhancement plan (fiscal years 1994-1997) and subsequent ongoing cooperative efforts to update a Post/TIGER database.  As with the earlier USGS agreement, the USPS/Census Bureau MOU would formalize a seemingly well-suited alliance between two agencies.  Cooperative database-building activities could reduce duplication of efforts, thereby reducing overall costs, as well as improve the geographic and attribute accuracy of the information available to each agency.

    The Census Bureau’s objectives for the proposed joint program during the intercensal years are (1) to obtain updated information on the location of streets with their names, address ranges, and ZIP codes as well as the location of group quarters, office building or other locations of economic activity, and (2) to improve the error-prone decennial census address lists development operations.  According to a recent GAO report (1992), enumerator follow-up to vacant and nonexistent units alone resulted in an added expenditure of approximately $317 million to the 1990 census operation.   Accomplishing the above objectives will enable the Census Bureau to reduce its critical dependence on a large, temporary clerical workforce before and during each decennial census, thereby reducing costs, and to improve the overall quality and consistency of decennial census data products.

    In the overall proposal, the joint venture envisions updating the TIGER database to permit automated analysis of carrier routes and the production of carrier route maps for use by the USPS.  In the pilot study, the effort to improve and maintain the positional accuracy and completeness of the TIGER database involved the use of Global Positioning System (GPS) technology with receivers mounted on USPS-supplied vehicles.  In order to improve and update the geocoding capability of the TIGER system, the pilot study also updated the address ranges in the TIGER database and added ZIP+4 Codes and other USPS information.  The joint venture envisioned would extend similar geocoding improvements to all parts of the United States with city-style address systems.  The Census Bureau would provide the USPS with its technical and geographic expertise.

    Potential impediments.  Although this data-sharing scenario appears to satisfy the needs of both agencies, there are a number of organizational, behavioral, and institutional impediments that will need to be overcome.  Some of these impediments are specific to these two agencies while other are generic to any data-sharing milieu.  One specific impediment is that the USPS is not part of the FGDC.  The USPS also is a quasi-federal agency, which means that it must justify its participation in an agreement on a benefit/cost basis including expected revenue or savings from potential products.  There also are questions about what information will be shared and what will be the property of each organization.

    A more general institutional impediment is the difficult of building a single database to serve the needs of different agencies.  Federal agencies have worked independently of each other for a long period of time collecting and structuring their data according to geographic units based on their own unique criteria and naming conventions.  Data sharing presupposes a strong, long-term, funded commitment to reconcile what really constitutes different versions of the same reality.  In order to reach agreement, a shared database initially may involve compromises that could make it less effective than two separate databases.  Each agency has its own self-interest and mission that must be met first.  Short-term objectives may become more pronounced and inhibit progress.  The need to understand each other’s terminology, organizational structure, and needs represents another potential impediment.

    Another important issue, not only for the Census Bureau but for all potential data partners, is the potentially differing perception of the benefits from data sharing.  One agency may believe that it is giving up more than the other agency and is bearing an unfair burden, causing a negative effect on data-sharing plans.  Some groups within each agency may either be opposed to or less than committed to the idea of data sharing.  While this issue was always in the background during the joint Census Bureau/USGS Cooperative Mapping Project, it never came to the forefront as a major issue.

    Another potentially critical impediment common to bureaucracies is the inertia effect; that is, it is easier to stick with the tried and reasonably true methods than to try something new.  Change can be intimidating and can upset a delicate balance of power in an organization.

    OTHER ONGOING DATA-SHARING ACTIVITIES

    In a broad sense, the Census Bureau traditionally has been in the business of sharing geographic data and has a long history of partnerships with state and local governments.  The Census Bureau receives updated governmental unit boundaries from local officials on a periodic basis through its Boundary and Annexation Survey.  The Census Bureau also works with local Census Statistical Area Committees, composed of representatives of the public, private, and academic communities, in delineating the boundaries of census statistical areas such as census tracts, block groups, and census-designated places.

    The Census Bureau also has worked closely with state election officials to provide the information they need for redistricting and reapportionment; with the U.S. Department of Education to incorporate the boundaries of school districts as a means to produce data for school districts; and with metropolitan planning organizations to improve the quality of the Census Bureau’s address reference files, which improves the quality of the usefulness of the census journey-to-work and place-of-work data, and to define traffic analysis zones in terms of census blocks to facilitate the tabulation of decennial census data for those areas.  The Census Bureau has built its massive geographic database primarily from source materials acquired from state and local agencies.  In a new program based on 1990 census data, data users may independently aggregate census blocks to define their own statistical areas and receive maps and data profiles of these user-defined areas from the Census Bureau on a cost-reimbursable basis.

    In addition, the Census Bureau maintains and is constantly improving its huge, ongoing institutional data-sharing apparatus.  Each of the Census Bureau’s twelve regional offices supports information services and geographic programs that coordinate activities with state data centers and their affiliates throughout the United States.  The Census Bureau’s Data User Services Division combines educational functions (e.g., ongoing workshops on TIGER, census maps, and data products) with its data distribution functions.  The Census Bureau participates in the kindergarten through twelfth grade (K-12) geographic literacy campaign in the United States a means to address the important issues of access to new information technologies.  Public access issues (Emergency Planning and Community Right-to-Know Act 1986) also have provided the impetus for an ongoing cooperative project with the EPA and the National Oceanic and Atmospheric Administration (NOAA) that supports the development of public domain software that links and displays environmental, socioeconomic, and demographic data using an extract of the TIGER database.

    From a more global perspective, the Census Bureau shares its technological and statistical know-how with a number of participating countries through its in-house and overseas training programs.  The Census Bureau also is cooperating with Statistics Canada (Haythornthwaite 1992) and pursuing talks with Mexico’s statistical agency, Instituto Nacional de Estadistica, Geografia e Informatica, to create a North American Common Borders Database.  In summary, the Census Bureau maintains a huge ongoing apparatus that supports and complements its current efforts at spatial data interchange.

    CONCLUSIONS

    As the Census Bureau prepares for the 2000 census, change, once again, appears imminent.  According to a recent GAO report (1992,4), “the current approach to taking the census appears to have exhausted its potential for counting the population cost-effectively.”  Similar statements were made in reference to the Census Bureau’s geographic support program following the 1980 census (Tomasi 1990).  These statements have recurred with periodic frequency in the recent history of census-taking.  In large part, the innovations that followed were a response to intense public scrutiny, a result of a process that has significant and far-reaching political (reapportionment and redistricting) and economic (distribution of government funds) consequences.  Over the past fifty years, the Census Bureau has met not only the challenges brought about by massive social, demographic, and economic change, but has provided effective leadership in applying and diffusing a number of new technologies to both the governmental and non-governmental sectors.

    The application and use of new technologies on a vast scale by public organizations are reshaping the internal organization and relationships within the public sector as well as among the public and private sectors and the academic and research communities.  The dominant values of narrow, functionally separate governmental agencies and departments are increasingly being replaced by a system of greater complexity and interrelatedness and a growing trend toward public and private sector cooperation.  The Census Bureau’s TIGER system is an important example of this phenomenon and may be a primary motivator for such changes.

    The Census Bureau’s geographic and statistical products – inexpensive and ubiquitous spatial and attribute raw material – have helped transform GIS from a highly technical field dominated by large agencies, private firms, and universities, to one that is becoming increasingly accessible to many data users.  Advances in our technological infrastructure thus far have been the primary factor enabling institutions to develop and share digital geographic data.  Personal computers, workstations, CD-ROMS, and databases available for automation were relatively nonexistent until the 1980s.  The incipient use of new telecommunication technologies, such as the Internet, may have an equally profound effect on data sharing and online services in the latter part of this decade.

    As computer hardware and software continue to become less expensive and more powerful, the vast potential of GIS will depend increasingly on the behavioral, organization, and institutional issues acting as impediments and incentives to the sharing of geographic data.  The integration of various multi-media technologies and the growing capability to link a wide variety of public and private databases also raise a number of privacy issues.  Deriving the full benefits of GIS and related information systems will depend, to a significant degree, on how society approaches and resolves these issues (Onsrud 1992).

    Based on past experience, it will require extraordinary leadership, communication and flexibility among agencies to facilitate the process of spatial data interchange.  Data sharing will be most successful when such ventures can be justified by short-term results, verified cost reductions, improved operations, and minimal problems of data ownership.  Data sharing will be enhanced to the degree that these ventures can be built around joint development projects such as the Census Bureau/USPS and the Census Bureau/USGS experiences in the 1990s.  Awareness of the need for such coordination is growing and is best reflected in the increasing number of statewide GIS committees and partnerships between the federal and state communities.

    Geographic databases have been built thus far to support the mandates of single institutions or parts of an institution.  All who collect and manage data for activities related to their own responsibilities will need to understand and appreciate the value of those data to others and to collect and structure their data accordingly.  In order to take full advantage of the opportunities offered to these new technologies, business, government, and academic will need to develop, support, and fund data exchange on a systematic and ongoing basis as well as promote accessibility of GIS capabilities to all sectors of our society.

     

    REFERENCES

    Anderson, M.J. 1988. The American census: a social history. New Haven: Yale University Press.

    Boudriault, G. 1987. Topology in the TIGER file. Eighth International Symposium on Computer-Assisted Cartography, Proceedings. Baltimore, Maryland, 258-263.

    Broome, F.R. and D.B. Meixler. 1990. The TIGER database structure. Cartography and Geographic Information Systems 17, 1:39-47.

    Carbaugh, L.W. and R.W. Marx. 1990. The TIGER system: a Census Bureau innovation serving data analysts. Government Information Quarterly 7, 3:285-306.

    Cooke, D. 1995. Sharing street centerline spatial databases. In H.J. Onsrud and G. Rushton, eds., Sharing Geographic Information. New Brunswick, NJ: Center for Urban Policy Research, Rutgers University.

    Corbett, J.P. 1979. Topological principles in cartography. Technical Paper 48. U.S. Bureau of Census, Washington, D.C.

    Davis, B.A., J.R. George, and R. W. Marx. 1992. TIGER/SDTS: standardizing an innovation. Cartography and Geographic Information Systems 19, 5:321-327.

    Emergency Planning and Community Right-To-Know Act: Title 3 of the Superfund Amendments and Reauthorization Act of 1986. PL 99-499, 17 October 1986. United States Statuates at Large 100. pp. 1728-1758.

    Federal Geographic Data Committee. 1992. Multi-agency initiative to meet high priority requirements for base cartographic data. FGDC Subcommittee of Base Cartographic Data. Washington, D.C.

    Government Accounting Office. 1991. Geographic Information Systems: information on federal use and coordination. IMTEC 91-72-FS. Washington, D.C.

    ________. 1992. Decennial Census: 1990 results show need for fundamental reform. GAO/GGD-92-94. Washington, D.C.

    Haythornwaite, T. 1992. Development of the United States-Canada Common Border Database. The Operational Geographer 10, 1:28-30.

    Kinnear, C. 1987. The TIGER Structure. Eighth International Symposium on Computer-Assisted Cartography, Proceedings. Baltimore, Maryland, 249-257.

    Marx, R.W. 1986. The TIGER System: automating the geographic structure of the United States census. Government Publications Review 13, 181-201.

    ________. 1992. Building the National Spatial Data Infrastructure: the data integrity challenge. Paper presented at the Regional Surveying Engineering Conference, Hartford, Connecticut.

    McKenzie, B.Y., and R.A. LaMacchia. 1987. The U.S. Geological Survey-U.S. Bureau of Census Cooperative Digital Mapping Project: a unique success story. Paper presented at American Congress on Surveying and Mapping meeting, Reno, Nevada. Fall.

    Murakami, E., and K. Greenleaf. 1992. Multi-agency TIGER file updating. URISA Proceedings 2:25-35.

    Office of Management and Budget. 1990. Coordination of surveying, mapping, and related spatial data activities. OMB Circular A-16 (Revised). Washington, D.C.

    _______. 1992. Management of federal information resources. OMB Circular A-130. Washington, D.C.

    Onsrud, H. 1992. Privacy and spatial databases. Technical Program Abstracts., 27th International Geographical Congress, Washington, D.C., 480-481.

    Rhind, D.W. 1991. Counting the people: the role of GIS. In D.J. Maguire, M.F. Goodchild, and D.W. Rhind, eds., Geographic information systems: principles and applications. Longman Scientific and Technical, Essex, 2:127-137.

    Silver, J. 1977. The GBF/DIME system: development, design and use. Paper presented at 1977 Joint Annual Meeting of the American Society of Photogrammetry and the American Congress on Surveying and Mapping. U.S. Government Printing Office, 1977-240-869/1102.

    Sobel, J. 1978. GBF/DIME system – development and reference source problems. Applied Geography Conference, SUNY – University Center at Binghamton, 1:112:121.

    _______. 1986. Principal components of the Census Bureau’s TIGER file. Research in contemporary and applied geography: a discussion series. SUNY at Binghamton, 10, 3:1-17.

    Tomasi, S.G. 1990. Why the nation needs a TIGER system. Cartography and Geographic Information Systems 17, 1:21-26.

    Trainor, T.F. 1990. Fully automated cartography: a major transition at the Census Bureau. Cartography and Geographic Information Systems 17, 1:27-28.

    U.S. Bureau of Census. 1973. Census Use Study. International DIME Colloquium. Confernce Proceedings, Washington, D.C., August 27-29, 1972.

    White, M. 1984. Technical requirements and standards for a multipurpose geographic data system. The American Cartographer 11, 1:15-26.

  • TIGER Database Historical Perspective

    In my last two columns, I’ve made several references to geospatial data. Jon Sperling, Ph.D. GISP, wrote in and commented on the omission of the importance of TIGER data in the history of geospatial data development and commercialization. He made reference to a document he wrote that was published in 1995 regarding the development and maintenance of the TIGER database. I’ve decided to reprint his work, with his permission, as this week’s column. It gives keen insight into the early stages of TIGER. Albeit quite in-depth, it’s a fascinating read for gaining a historical perspective on geospatial data development.

    Keep in mind that this document was written in 1992 so there are references to initiatives, etc. that were subsequently developed and that you enjoy today.

    Sperling argues that our public investment in TIGER need not be just history but a pro-active means to leverage partnerships and new advances and innovations (e.g. synergistic links with national parcel data initiatives, local/state/federal data sharing and a national road network). Census still remains a pioneer in supporting and furthering geospatial science at all levels of our society for the betterment of our Nation’s communities.

    Dr. Sperling is currently a Senior Researcher of Geographic Information and Analysis at the Office of Policy, Development and Research for the U.S. Department of Housing and Urban Development. He has designed and led many innovative geospatial, addressing, and data integration efforts in coordination with local and state governments and the academic research community. Dr. Sperling was involved in the initial development of the Nation’s TIGER/Master Address File databases for the 1990 and 2000 Census, enabling digital spatial data sharing capabilities, and efforts to enhance its coordinate accuracy and data quality. Currently, he is working with university partners on a number of innovative research applications to enable sophisticated querying of unstructured text and tables using textual spatial references in the data.


    Jonathan Sperling, “Development and Maintenance of the TIGER Database: Experiences in Spatial Data Sharing at the U.S. Bureau of the Census,” in Harlan J. Onsrud and Gerard Rushton, eds., SHARING GEOGRAPHIC INFORMATION (New Brunswick, NJ: Center for Urban Policy Research). Copyright 1995 by Rutgers, The State University of New Jersey. Reprinted with permission.

     

    Development and Maintenance of the TIGER Database: Experiences in Spatial Data Sharing at the U.S. Bureau of the Census (1992)

    The U.S. Census Bureau has played, and will continue to play, a vital role in the development, maintenance, and sharing of spatial and attribute data for Geographic Information Systems (GISs) on the local, regional, national, and international levels. The Census Bureau’s development of shareable geographic data files, the GBF/DIME (Geographic Base File/Dual Independent Map Encoding) Files for the 1970 and 1980 censuses, and the TIGER (Topologically Integrated Geographic Encoding and Referencing) database for the 1990 census, have provided a major impetus to the rapid growth and diffusion of GIS technology. This chapter discusses the Census Bureau’s experiences in the spatial data sharing during these two file-building projects as well as from ongoing experiences in developing Memoranda of Understanding with federal and state agencies to update and improve the spatial and attribute data in TIGER. On the basis of these experiences, preliminary generalizations are made concerning the organizational issues that may facilitate or impede the future digital interchange of spatial data.

    Introduction

    The entrance and persistence of the nation’s lead statistical agency as a primary producer and user of both geographic and attribute data have grown out of practical but critical concerns (Tomasi 1990). The mandate to conduct a population and housing census of the entire nation and its territories every ten years, and then disseminate the resulting information  quickly and accurately, has led to the need to devise innovative ways of improving the collection, processing, and tabulation of data.  These improvements have aimed at four goals: increasing efficiency and timeliness, improving data quality, lowering costs, and providing new products for the data user.

    Over the past half century, the Census Bureau has pursued innovations and led the government and private sector in computerization, statistical sampling and interviewing techniques, data processing, quality control, and cartographic techniques, to name but a few (Anderson 1989).  The Census Bureau pioneered the first large-scale commercial user of the computer with UNIVAC 1 following the 1950 census and the development of the Film Optical Sensing Device for Input to Computers for the 1960 census.  The broad use of computer mapping by local governments and private firms was strongly influenced by the Census Bureau when it developed the GBF/DIME-Files for the 1970 and 1980 censuses and the TIGER System for the 1990 census.

    In addition to being one of the nation’s largest digital geographic databases — currently sixteen gigabytes — TIGER enables the automated spatial manipulation of census data for all legal and statistical entities in the United States and its territories.  Every state and local government now has the capability to create rudimentary GIS using small-area census data, publicly available extracts of the TIGER database – TIGER/Line  files – and the appropriate hardware and software.  More than 130 private vendors currently have the capability of processing TIGER/Line files, and the number is growing.

    In this respect, the development of the TIGER database may be the most important “data” file from the 1990 Census of Population and Housing.  The accessibility and widespread potential application of this innovation make it qualitatively different from earlier technological developments at the Census Bureau.  The TIGER database will be used not only internally to administer future censuses and surveys (see Marx 1986 for a basic rationale of the TIGER system) but externally to provide a major impetus to the development and sharing of integrated special information systems in the public and private sectors as well as the academic community.

    A common theme in the GIS literature, generated both within and outside the Census Bureau, has been the notion that although the Census Bureau developed its GBF/DIME-Files and TIGER database to meet internal Census Bureau needs, their existence has facilitated applications well beyond the scope of census-taking.  Perhaps analogous to the many commercial spin-offs that have been generated from wartime military innovations or the nation’s space program, the decennial census operation, conducted by the government’s largest non-military assemblage of people and resources, has been responsible for a number of innovations, some already outlined, that have had a major effect on the private sector and the academic research community.

    The Census Bureau’s recognition of these “consumer spin-offs” traditionally has never been well articulated or coordinated because most innovations have resulted from an internal production standpoint rather than a customer-oriented design.  Recent efforts toward building a national spatial data infrastructure and the Census Bureau’s adoption of the principles of Total Quality Management and Strategic Planning create the external and internal incentives, respectively, to stimulate a change in that design.  Paradoxically, however, the fiscal constraints of the 1990s and their impact on congressional funding, while often considered an incentive for data sharing, may also function as an impediment in the early developmental stages of new technologies.

    THE DEVELOPMENT OF SHAREABLE GEOGRAPHIC DATA FILES – GBF/DIME-File Development 1966-1982

    Many local agencies have been introduced to GIS by the Census Bureau through is various geographic base-building projects over the past twenty-five years.  The development of the Address Coding Guides (ACG) for the 1970 census and the GBF/DIME-Files for the final stages of the 1970 census and for large-scale use in the 1980 census were major steps toward full automation of the Census Bureau’s geographic support programs.  Althought the ACGs provided the building blocks for the later development of the GBF/DIME-Files, they lacked geographic coordinates and topological structure (Marx 1986).

    The design and development of these computer-readable files involved the active participation of federal, state, local, private and academic organizations.  The Census Use Study, a small-area data research group sponsored by the Census Bureau from 1966 to 1969, was instrumental in creating and diffusing knowledge about a system that represented map features numerically for processing by a computer to create a geographic base file (U.S. Bureau of the Census 1973).  The enhancement of this body of knowledge was of critical importance to the Census Bureau because it enabled the adoption of census-by-mail enumeration methodologies for the major urban centers of the United States.

    The change in enumeration procedures was a response to the increasing costs and difficulty of conducting a traditional door-to-door canvassing of the population, a growing and increasingly urban population living within areas having mail delivery by house-number/street-name address, and technical feasibility of linking such addresses with the geographic units used for data tabulations due to improvements in computer capabilities.  For the 1980 census, the GBF/DIME-Files were enhanced to cover 287 of the nation’s largest urban centers, representing more than 60 percent of the population but less than 2 percent of the nation’s land area (Carbaugh and Marx 1990).

    Data sharing during the 1970s.  The development of the ACGs and GBF/DIME-Files involved data sharing with more than 300 local planning agencies.  Sharing primarily took the form of converting analog data into a computer-readable format rather than digital exchange per se.  The long-term nature of these relationships provided the Census Bureau with intimate knowledge of the resources available to local agencies, the limitations of available data, and the willingess and ability of agencies to share data.  At the same time, local planning agencies and others, through their association with the Census Bureau, became increasingly aware of the potential computer mapping, automated address matching, and spatial data analysis to meet local needs (Sobel 1978).  These relationships played an important role in the later development of the TIGER system.

    Monetary and other incentives to the local agencies played a major role in the “data sharing” development of the ACGs and GBF/DIME-Files (Silver 1977).  In many cases, local agencies completed work under contract with the Census Bureau or with funding provided by the Department of Housing and Urban Development (HUD), and Federal Highway Administration (FHWA), and other federal, state, and local planning sources.  Sensitive to the fact that the development of the GBF/DIME-Files provided a practical solution to many needs of local governments, the Census Bureau offered to provide the computer programs, processing methodology, and clerical procedures for creating  and updating the file, as well as a free copy of the completed file, to each participating local agency.  The Census Bureau also offered to do the data keying and processing if agency resources were limited.  The promise of higher quality data when these files were used for taking and tabulating the 1970 and 1980 census provided a further incentive for data sharing.

    As a result of this process, the Census Bureau and local officials learned that the quality and currency of existing data used by planning agencies could not be taken for granted.  For example, many local communities relied on the knowledge of a few people for the location of dwellings, address-numbering systems often were not systematic, tax assessor sources sometimes did not meet the bureau’s quality standards, and data from different agencies often were inconsistent.  Because of these situations, the initial transition to address assignment via automated processes was difficult.

    The efforts of working with so many local agencies also challenged the Census Bureau.  During the file-building projects, there was turnover in project personnel and key decision makers in the local agencies as well as at the Census Bureau.  Some “champions” of the new methods were not reelected or moved on to other jobs.  The long-term benefits of building such files were not always apparent to public administrators.  Some administrators viewed the new computer technology as a threat to their role in the agency.  Also, during the 1970s, there was a general lack of understanding (personal computers had not yet been invented) and/or trust in the new computer technology and the ever-present fear of “big brother” mainframe computers held by a few large government agencies.  All these factors tended to hamper development activities and to result in considerable variation in the quality, time expended, and funds necessary to complete the GBF/DIME-Files.

    Data sharing during the 1980s.  Census Bureau funding of data collection and coding by local agencies to create the GBF/DIME-Files created precedents that would later affect the pre-1990 TIGER-building efforts.  The overall reductions in federal aid to cities and states during the 1980s provided an atmosphere of “less than cooperative initial attitudes” among many local agencies struggling with small staffs and fewer resources than they had in the 1970s.  Many agencies expected to be paid for their efforts.  These situations, and the perceived difficulties inherent in working with several hundred local agencies, contributed to the Census Bureau’s decision to do most of the map and address updates for the 1990 census in-house, albeit often based on materials supplied by local agencies.

    Changes in the original GBF/DIME-File format by local agencies exacerbated some early data-sharing efforts.  Many of the larger files that had been maintained by local agencies (e.g., New York City) had been adapted and enhanced to meet local planning and administrative needs.  Fiscal difficulties at the local level, time constraints on the Census Bureau, and the lack of additional programming staff in all agencies often precluded efforts to recreate the original file structure without losing the map and address updates.  In these cases, the feature and address range update work had to be redone manually by the Census Bureau from digital plots and databases.

    One of the general weaknesses of the GBF/DIME-Files was that they were limited geographically and, therefore, the ability to use the files on an ongoing basis was limited to large-scale users with on-line access to mainframe computers.  Relatively few of the original GBF/DIME-Files given to local governments were updated and maintained during the 1980’s. Even fewer of the locally updated files met Census Bureau standards for direct incorporation into the TIGER database. In practice, however, these updated GBF/DIME-Files provided the Census Bureau with its first challenge in the digital spatial data exchange at the local level.

    Digitized GBF/DIME-Files, attribute-rich but of mediocre
    coordinate accuracy, formed the cartographic base for 345 urban centers in the 1990 TIGER database. Absolute coordinate accuracy was not a primary concern in the development of the GBF/DIME-Files, as they were used primarily by the Census Bureau for geocoding rather than mapping purposes. Although their coordinate accuracy was well below that of the U.S. Geological Survey’s (USGS’s) Digital Line Graph (DLG) files derived from their 1:100,000-scale maps, these files represented features with all their respective feature names, address ranges, and 1980 geographic area codes in their correct relative location – sufficiently accurate for taking a census (Sobel 1986).

    Although the Census Bureau would have liked to provide an enhanced cartographic quality, the deadline pressures of an upcoming decennial census forced management to abandon initial plans to align this information to the USGS’s DLG files. Also, because of staff and time constraints, the files were sent to four private-sector contractors for digitizing of feature updates using Census Bureau-supplied updated 1980 census maps. The results from these arrangements were of mixed quality.

    TIGER Database Development, 1983-1990

    The institutional knowledge and experience gained from the development of the GBF/DIME-Files, further theoretical and conceptual advances in the field of mathematics (Corbett 1979 and White 1984), and the “enabling” availability of new and affordable technology, provided fertile ground for the Census Bureau’s next challenge: the development of the TIGER database for use in the 1990 census. Whereas the GBF/DIME-Files covered small noncontiguous portions of the United States and were developed initially without spatial or geographic references in their design, the TIGER database covered the entire nation and its territories and was grounded in a more rigorous conceptual model of topology and space (see Boundriault 1987; Kinnear 1987; Broome and Meixler 1990).

    Census Bureau/USGS cooperative agreement. In addition to data-sharing arrangements on the local level, the ability to complete the building of the TIGER database in time for the 1990 census was directly dependent on a landmark 1983 data-sharing agreement with the USGS. The USGS provided the Census Bureau with computer files of scanned versions of its 1:100,000-scale maps for the lower forty-eight states. In return, the Census Bureau assigned cartographic classification codes to the roads in these files. The resultant product formed the cartographic base for all areas outside the large urban centers covered by the GBF/DIME-Files, thereby enabling the Census Bureau to complete a coast-to-coast digital map base in time for the 1990 census (McKenzie and LaMacchia 1987).

    Interagency cooperation with the USGS was an experience that provided valuable lessons to the Census Bureau in particular, and a model for future cooperation between federal agencies in general. Following a successful Florida pilot project, high-level management in both agencies perceived that cooperation would result in a win-win situation. Each agency would be able to accelerate its individual map production programs and, in the process, they could develop the first “large-scale” digital map file of the United States. Early negotiations ensured that neither agency would feel it was bearing an unfair burden. A schedule of meetings on a regular basis ensured communications during all phases of the cooperative agreement. The challenge of meeting the Census Bureau’s decennial deadlines provided an added measure of incentive to “get the job done” and an ongoing requirement to measure progress.

    The U.S. Census Bureau/USGS cooperative agreement minimized duplication of effort in federal map automation activities and provided immediate short-term benefits to both agencies. The success of this cooperative venture demonstrated to the Department of Commerce, the Federal Office of Management and Budget (OMB), and the Congress that there were significant benefits to be derived from such activities. The success of this cooperative effort also resonated in the later development of the Federal Geographic Data Committee (FGDC) (OMB 1990), the growing impetus for further cooperative efforts in spatial data sharing, and the increasingly articulated vision of a national spatial data infrastructure (Marx 1992).

    Other success factors. The success of the TIGER System and the ability of the Census Bureau to overcome organizational inertia both within and outside the Census Bureau were due to a number of factors. There was a shared sense both within the Census Bureau and by the data-user public that change was needed in the geographic support process. A primary incentive for developing the TIGER database was the large number of inconsistencies between the statistical and geographic data products in the 1980 and earlier censuses, a product of the complex and clerically intensive preparation of maps, ACGs and GBF/DIME-Files, and geographic reference files (Marx 1986).

    The enormous political and economic ramifications of the decennial census made everyone a stakeholder in the process. The growing importance of the information sector of the economy and growing public demands for more accurate, cost-efficient, timely, and accessible data products helped to promote an environment receptive to the exchange of data, expertise, and experience with other governmental agencies at all levels, the private sector, and the academic community.

    Bureaucratic inertia was further overcome by staff commitment, expertise, and initiative with the Census Bureau’s Geography Division. The transition from traditional to automated mapping for the 1990 census required changes in the organization and planning of the Census Bureau’s mapping activities as well as in the requirements of staff in developing and incorporating new cartographic techniques and computer skills. Motivated by a decennial environment of schedules and fixed deadlines with no alternative to full automation for product delivery and a cooperative agreement with the USGS, the Census Bureau’s staff produced significant results in a relatively short time (Trainor 1990). The resources and skills gained from this experience, combined with a history of successful technical innovations at the Census Bureau, promoted a willingness to reach out and explore the potential for digital data sharing.

    Some criticisms. Countering these positive aspects are criticisms of the TIGER database, such as the relative poor coordinate accuracy of the roads in the major urban centers (the roads that came from the GBF/DIME-files used in lieu of USGS DLG file), lack of address range and ZIP Code improvements or expansion beyond the 345 GBF/DIME-File areas, and inconsistencies in the names and classifications of streets. Public complaints about the quality of the data in the TIGER database provided the Census Bureau and its parent agency, the Department of Commerce, with further verification of the wide applicability and importance of this database beyond the internal needs of the Census Bureau.

    Although valid, most of these situations were not critical for the taking of the 1990 census, the primary mission of the Census Bureau. In fact, many private consulting firms have taken advantage of these “problems” to repackage “new and improved” versions of the bureau’s publicly available extracts from the TIGER database: the TIGER/Line files. The Census Bureau is correcting many of these situations and will release future TIGER extract products with these updates and corrections.

    In general, the TIGER database continues the GBD/DIME-File tradition of being attribute-rich and current but with limited coordinate accuracy in the major urban centers. The USGS’s DLG files, on the other hand, have high “ground truth” accuracy for the features they show but have few attributes (DLG-Enhanced Files, once released, will improve on the latter). In addition, the DLG Files do not contain current information and the USGS has not been provided with the financial resources need to perform frequent, nationwide, and systematic updates. As stated previously, alternative methodologies for updating the files are being evaluated, including an initiative to accelerate the collection of base cartographic data using graphic or digital orthophoto quadrangles or aerial photography (FGDC 1992).

    ENHANCING THE TIGER DATABASE

    Since the completion of the TIGER database for the 1990 census, the Census Bureau has become increasingly aware of its vast potential as well as its current weaknesses. If viewed as a process rather than a product, the TIGER database provides an opportunity to improve statistical accuracy and data quality significantly. The updated address and geographic information systems in a growing number of public and private agencies and the databases of the U.S Postal Service (USPS) provide important means of enhancing the collection, processing, and tabulation of census data.

    The planned release of a new TIGER/Line extract by the Census Bureau containing extended address ranges and ZIP Codes for all areas with city-style mail delivery is significant, not only for the inherent value of the added data (there will be an increase in address range coverage from approximately 55 percent to 85 percent of all housing units in the United States), but because it represents new capabilities that never existed before. For the first time, the Census Bureau will be releasing value-added files for data users after the census that are not directly tied to the decennial statistical data products (however, they will be used for geocoding establishments in the intervening economic census). This precedent also is important in that it reflects a subtle change in the traditional once-a-decade data dissemination paradigm and opens new possibilities for future digital data exchanges. Perhaps, it also is indicative of the more customer-oriented approach mentioned earlier in this chapter.

    Because the Census Bureau was not intended to be the nation’s preeminent mapping agency, any data-sharing agreements to improve the TIGER database must be perceived by the overall organization as primarily benefitting the census-taking process. At this stage of planning for the 1992 and 1997 economic censuses and the 2000 decennial census, the Census Bureau has made an organizational commitment to the integration of the Census Bureau’s related Address Control File with the intercensal update and improvement of the TIGER database. This commitment, however, is dependent on the availability of resources.

    In the absence of a national updated map system, the U.S. Census Bureau has pursued a relatively high-cost mapping compilation strategy for the 1990 and previous censuses (Rhind 1991). Labor-intensive comparisons of reference sources, often of varying scale and quality, have been the primary means for updating census maps. In this respect, the Census Bureau has pursued and maintained close contacts and relationships with other federal agencies, state, regional, and local planning and transportation agencies, engineering firms, aerial survey companies, tax departments, utility firms, and a host of other public and private firms with current map and address reference source materials.

    The development of the TIGER System and other digital spatial and attribute databases, coupled with the proliferation of more powerful computer hardware and GIS software, allows data exchange to occur in a more sophisticated, more timely, and potentially more accurate and less costly manner. From a technical perspective, there appear to be few limits to the advance of this new mode of digital data exchange. According to Cooke (1995), the technical problems of data sharing have mostly been, or are in the process of being, solved. The non-technical components of data interchange may prove more daunting.

    Current Data-Sharing Plans

    Current data-sharing plans at the Census Bureau are twofold. On the one hand, the Census Bureau is investigating the possibility of national sources, predominantly federal agencies, that could provide the information to keep the feature and address-range information in the TIGER database up to date. Based on a report by the Government Accounting Office (GAO 1991), which stated that federal agencies increased their planned expenditures on GIS by about 60 percent between fiscal years 1990 and 1992, the likelihood of such exchanges is promising.

    At the national level, the Census Bureau has entered into or proposed data-sharing agreements with several large governmental or quasi-governmental agencies including the USGS, USPS, the Environmental Protection Agency (EPA), the Soil Conservation Service, the Federal Railroad Administration, the U.S. Army Corps. of Engineers, and the Federal Emergency Management Agency (FEMA). These efforts to improve the spatial and attribute data in the TIGER database are ongoing and likely will include several other agencies by mid-decade. As the coordination of GIS activities improves in the federal sector, bilateral agreements between agencies to improve the TIGER database will likely evolve into multi-agency agreements, as has already occurred at the state and local levels (see Murakami and Greenleaf 1992).

    The Census Bureau also is investigating possible mechanisms for the electronic interchange of updated geographic information with state, local, private, and academic organizations. The current preference of the Census Bureau is to coordinate these efforts and control quality at the state level rather than having to deal with conflicting data and different file formats from several thousand local governments, private agencies, universities, and other sources. Experience indicates, however, that this will not always be possible.

    Role of the FGDC in Data Sharing

    Increased GIS use by federal agencies as well as state and local agencies has led to renewed efforts to coordinate development, sharing, and dissemination of spatial data, primarily through the Federal Geographic Data Committee (FGDC). The FGDC, formed in late 1990 at the direction of the OMB in its Circular A-16 (OMB 1990), includes representatives of fourteen departments and independent agencies but has no direct authority, responsibility, or resources. Participation is voluntary, with decisions based on consensual agreement among its members. Given the current limits on availability of fiscal resources, several agencies have agreed to pursue cooperation in accelerating the 1:12,000-scale digital orthophoto program and the 1:24,000-scale digital quadrangle program. The Census Bureau is working on individual Memoranda of Understanding (MOU) with other federal agencies in the spirit of the FGDC and has agreed to cooperate with the USGS in devising a still more powerful data structure.

    In the long run, the FGDC may provide the vehicle necessary to transform institutional relationships within the federal government as well as with state and local governments, the private sector, and the academic community. The FGDC is making an increased effort to involve the non-federal community in its coordination work. Although the goals are lofty and the potential long-term benefits extraordinary, effective leadership at the highest levels and a concomitant commitment to the development of compatible standards will be necessary. In the absence of these developments, only significant short-term and real cost-saving benefits gained from data sharing by particular agencies will move the process forward toward a national digital spatial database.

    Significantly, in July 1992 the Census Bureau and the USGS signed an amendment to the original 1981 Memorandum of Understanding that commits both agencies to merging the current DLG and TIGER databases, including information resulting from partnerships with other agencies. The development of a shared database that combines the essential geographic information needed by both agencies to carry out their respective institutional mandates will have a profound effect in further stimulating the development of a national spatial data infrastructure in the United States.

    Intercensal Data-Sharing Projects, Post-1990

    Current data-sharing experiences at the Census Bureau have been exploratory, and the actual mechanisms and standards for digital data interchange are still in the process of being developed.  The Census Bureau is engaged in a number of activities to help promote digital data sharing during the 1990s.  These activities include conferences, participation in the FGDC initiative, MOUs with other federal, state, and local agencies, pilot projects, and the planning and imminent release of the prototype version of the TIGER/SDTS (Spatial Data Transfer Standard) file.

    Based on these forums and activities, the Census Bureau has begun to receive initial feedback on the non-technical impediments to and incentives for digital spatial data interchange.  The Census Bureau has provided its digital geographic and statistical data sets to the public at the cost of dissemination and as a public resource (OMB 1992).  Other public and private agencies, however, have different perspectives and regulations pertaining to their own data sets and on the updates they may perform.  Profit is a major concern of private companies, and public agencies also are looking to their products as a source of revenue.

    States and local agencies consistently have reported one or more of the following problems in trying to coordinate a GIS:

    •    Agencies wanting proprietary control of internal data

    •    Lack of resources in one department or institution affecting data requirements of another

    •    Archaic systems

    •    Managers and commissioner-level officials who know, or care, little about GIS

    •    Staff turnover

    •    Lack of commitment

    Once a state or local GIS is operational, the lack of overt incentives to expend the additional time and expense required to feed local updates into the TIGER/Line or SDTS formats for interchange become more apparent.  Ensuring the currency, accuracy, and quality of the TIGER database is an integral part of ensuring the accuracy and quality of the associated census data, which are used for reapportionment, redistricting, the distribution of federal funds, and innumerable planning and development programs – but this may be a long-term and nebulous consideration for some agencies.  The lack of a clearly defined formal process for data interchange by the Census Bureau could provide an even greater impediment to institutions willing to share data.

    The existence of multiple geographic data file formats also have inhibited the interchange of data.  Future geographic data files at the Census Bureau will be released in accordance with a recent Federal Information Processing Standard (FIPS) – the SDTS.  Adoption of the SDTS involved cooperation of federal, state, and local officials, the academy community, and private sector over an extended period of time.  The Census Bureau was an active participant in the development of this new federal standard for data exchange and, to that end, released its first prototype TIGER/SDTS file (Davis et al. 1992).  However, the effect of this rather complex format on data interchange is not yet clear.

    Each data-sharing agreement brings forth new possibilities, new arrangements between agencies, and the potential for new products.  Working with a variety of agencies and soliciting recommendations for improving TIGER, Census Bureau staff have noted similarities among the interests of many agencies.  For instance, conversations with the USPS and the U.S. Department of Transportation have revealed similar interests in enhancing attributes for streets (e.g., turn and directional restrictions).  In many cases, the Census Bureau would be able to use the enhancements needed by other agencies for improving the quality and cost-effectiveness of its own internal operations (e.g., routing of enumerators).

    Proposed Census/USPS cooperative program.  The USPS and the Census Bureau have been working together for more than thirty years in the delivery and return of questionnaires for the decennial, agriculture, and the economic censuses and surveys.  The increasing use of mail-out/mail-back procedures throughout the nation has made the USPS an indispensable partner in the Census Bureau’s data-collection activities.  The Census Bureau traditionally has paid the USPS to verify the completeness and accuracy of its decennial census address list, which was purchased from private vendors and enhanced through in-house programs, prior to the mailout of questionnaires.  The Census Bureau also worked with the USPS on the development of computer algorithms to match the ZIP+4 files to GBF/DIME-File records during the 1980s.  ZIP+4 files, also known as the Address Management System (AMS) Files, contain potential address ranges for all areas where the USPS delivers mail.

    In an effort to enhance this cooperation to the benefit of both agencies, the Census Bureau proposed a formal Memorandum of Understanding (MOU) with the USPS similar to the one it had with USGS in the 1980s. In 1990, the Census Bureau and the USPS, with the participation of the USGS, began cooperating on a pilot project to provide a better geographic database for all three agencies.  If signed, this MOU will have broad implications, not only for these three agencies, but for the GIS community as a whole during the 1990s.

    The Census Bureau’s proposal envisions a four-year file update/enhancement plan (fiscal years 1994-1997) and subsequent ongoing cooperative efforts to update a Post/TIGER database.  As with the earlier USGS agreement, the USPS/Census Bureau MOU would formalize a seemingly well-suited alliance between two agencies.  Cooperative database-building activities could reduce duplication of efforts, thereby reducing overall costs, as well as improve the geographic and attribute accuracy of the information available to each agency.

    The Census Bureau’s objectives for the proposed joint program during the intercensal years are (1) to obtain updated information on the location of streets with their names, address ranges, and ZIP codes as well as the location of group quarters, office building or other locations of economic activity, and (2) to improve the error-prone decennial census address lists development operations.  According to a recent GAO report (1992), enumerator follow-up to vacant and nonexistent units alone resulted in an added expenditure of approximately $317 million to the 1990 census operation.   Accomplishing the above objectives will enable the Census Bureau to reduce its critical dependence on a large, temporary clerical workforce before and during each decennial census, thereby reducing costs, and to improve the overall quality and consistency of decennial census data products.

    In the overall proposal, the joint venture envisions updating the TIGER database to permit automated analysis of carrier routes and the production of carrier route maps for use by the USPS.  In the pilot study, the effort to improve and maintain the positional accuracy and completeness of the TIGER database involved the use of Global Positioning System (GPS) technology with receivers mounted on USPS-supplied vehicles.  In order to improve and update the geocoding capability of the TIGER system, the pilot study also updated the address ranges in the TIGER database and added ZIP+4 Codes and other USPS information.  The joint venture envisioned would extend similar geocoding improvements to all parts of the United States with city-style address systems.  The Census Bureau would provide the USPS with its technical and geographic expertise.

    Potential impediments.  Although this data-sharing scenario appears to satisfy the needs of both agencies, there are a number of organizational, behavioral, and institutional impediments that will need to be overcome.  Some of these impediments are specific to these two agencies while other are generic to any data-sharing milieu.  One specific impediment is that the USPS is not part of the FGDC.  The USPS also is a quasi-federal agency, which means that it must justify its participation in an agreement on a benefit/cost basis including expected revenue or savings from potential products.  There also are questions about what information will be shared and what will be the property of each organization.

    A more general institutional impediment is the difficult of building a single database to serve the needs of different agencies.  Federal agencies have worked independently of each other for a long period of time collecting and structuring their data according to geographic units based on their own unique criteria and naming conventions.  Data sharing presupposes a strong, long-term, funded commitment to reconcile what really constitutes different versions of the same reality.  In order to reach agreement, a shared database initially may involve compromises that could make it less effective than two separate databases.  Each agency has its own self-interest and mission that must be met first.  Short-term objectives may become more pronounced and inhibit progress.  The need to understand each other’s terminology, organizational structure, and needs represents another potential impediment.

    Another important issue, not only for the Census Bureau but for all potential data partners, is the potentially differing perception of the benefits from data sharing.  One agency may believe that it is giving up more than the other agency and is bearing an unfair burden, causing a negative effect on data-sharing plans.  Some groups within each agency may either be opposed to or less than committed to the idea of data sharing.  While this issue was always in the background during the joint Census Bureau/USGS Cooperative Mapping Project, it never came to the forefront as a major issue.

    Another potentially critical impediment common to bureaucracies is the inertia effect; that is, it is easier to stick with the tried and reasonably true methods than to try something new.  Change can be intimidating and can upset a delicate balance of power in an organization.

    OTHER ONGOING DATA-SHARING ACTIVITIES

    In a broad sense, the Census Bureau traditionally has been in the business of sharing geographic data and has a long history of partnerships with state and local governments.  The Census Bureau receives updated governmental unit boundaries from local officials on a periodic basis through its Boundary and Annexation Survey.  The Census Bureau also works with local Census Statistical Area Committees, composed of representatives of the public, private, and academic communities, in delineating the boundaries of census statistical areas such as census tracts, block groups, and census-designated places.

    The Census Bureau also has worked closely with state election officials to provide the information they need for redistricting and reapportionment; with the U.S. Department of Education to incorporate the boundaries of school districts as a means to produce data for school districts; and with metropolitan planning organizations to improve the quality of the Census Bureau’s address reference files, which improves the quality of the usefulness of the census journey-to-work and place-of-work data, and to define traffic analysis zones in terms of census blocks to facilitate the tabulation of decennial census data for those areas.  The Census Bureau has built its massive geographic database primarily from source materials acquired from state and local agencies.  In a new program based on 1990 census data, data users may independently aggregate census blocks to define their own statistical areas and receive maps and data profiles of these user-defined areas from the Census Bureau on a cost-reimbursable basis.

    In addition, the Census Bureau maintains and is constantly improving its huge, ongoing institutional data-sharing apparatus.  Each of the Census Bureau’s twelve regional offices supports information services and geographic programs that coordinate activities with state data centers and their affiliates throughout the United States.  The Census Bureau’s Data User Services Division combines educational functions (e.g., ongoing workshops on TIGER, census maps, and data products) with its data distribution functions.  The Census Bureau participates in the kindergarten through twelfth grade (K-12) geographic literacy campaign in the United States a means to address the important issues of access to new information technologies.  Public access issues (Emergency Planning and Community Right-to-Know Act 1986) also have provided the impetus for an ongoing cooperative project with the EPA and the National Oceanic and Atmospheric Administration (NOAA) that supports the development of public domain software that links and displays environmental, socioeconomic, and demographic data using an extract of the TIGER database.

    From a more global perspective, the Census Bureau shares its technological and statistical know-how with a number of participating countries through its in-house and overseas training programs.  The Census Bureau also is cooperating with Statistics Canada (Haythornthwaite 1992) and pursuing talks with Mexico’s statistical agency, Instituto Nacional de Estadistica, Geografia e Informatica, to create a North American Common Borders Database.  In summary, the Census Bureau maintains a huge ongoing apparatus that supports and complements its current efforts at spatial data interchange.

    CONCLUSIONS

    As the Census Bureau prepares for the 2000 census, change, once again, appears imminent.  According to a recent GAO report (1992,4), “the current approach to taking the census appears to have exhausted its potential for counting the population cost-effectively.”  Similar statements were made in reference to the Census Bureau’s geographic support program following the 1980 census (Tomasi 1990).  These statements have recurred with periodic frequency in the recent history of census-taking.  In large part, the innovations that followed were a response to intense public scrutiny, a result of a process that has significant and far-reaching political (reapportionment and redistricting) and economic (distribution of government funds) consequences.  Over the past fifty years, the Census Bureau has met not only the challenges brought about by massive social, demographic, and economic change, but has provided effective leadership in applying and diffusing a number of new technologies to both the governmental and non-governmental sectors.

    The application and use of new technologies on a vast scale by public organizations are reshaping the internal organization and relationships within the public sector as well as among the public and private sectors and the academic and research communities.  The dominant values of narrow, functionally separate governmental agencies and departments are increasingly being replaced by a system of greater complexity and interrelatedness and a growing trend toward public and private sector cooperation.  The Census Bureau’s TIGER system is an important example of this phenomenon and may be a primary motivator for such changes.

    The Census Bureau’s geographic and statistical products – inexpensive and ubiquitous spatial and attribute raw material – have helped transform GIS from a highly technical field dominated by large agencies, private firms, and universities, to one that is becoming increasingly accessible to many data users.  Advances in our technological infrastructure thus far have been the primary factor enabling institutions to develop and share digital geographic data.  Personal computers, workstations, CD-ROMS, and databases available for automation were relatively nonexistent until the 1980s.  The incipient use of new telecommunication technologies, such as the Internet, may have an equally profound effect on data sharing and online services in the latter part of this decade.

    As computer hardware and software continue to become less expensive and more powerful, the vast potential of GIS will depend increasingly on the behavioral, organization, and institutional issues acting as impediments and incentives to the sharing of geographic data.  The integration of various multi-media technologies and the growing capability to link a wide variety of public and private databases also raise a number of privacy issues.  Deriving the full benefits of GIS and related information systems will depend, to a significant degree, on how society approaches and resolves these issues (Onsrud 1992).

    Based on past experience, it will require extraordinary leadership, communication and flexibility among agencies to facilitate the process of spatial data interchange.  Data sharing will be most successful when such ventures can be justified by short-term results, verified cost reductions, improved operations, and minimal problems of data ownership.  Data sharing will be enhanced to the degree that these ventures can be built around joint development projects such as the Census Bureau/USPS and the Census Bureau/USGS experiences in the 1990s.  Awareness of the need for such coordination is growing and is best reflected in the increasing number of statewide GIS committees and partnerships between the federal and state communities.

    Geographic databases have been built thus far to support the mandates of single institutions or parts of an institution.  All who collect and manage data for activities related to their own responsibilities will need to understand and appreciate the value of those data to others and to collect and structure their data accordingly.  In order to take full advantage of the opportunities offered to these new technologies, business, government, and academic will need to develop, support, and fund data exchange on a systematic and ongoing basis as well as promote accessibility of GIS capabilities to all sectors of our society.

     

    REFERENCES

    Anderson, M.J. 1988. The American census: a social history. New Haven: Yale University Press.

    Boudriault, G. 1987. Topology in the TIGER file. Eighth International Symposium on Computer-Assisted Cartography, Proceedings. Baltimore, Maryland, 258-263.

    Broome, F.R. and D.B. Meixler. 1990. The TIGER database structure. Cartography and Geographic Information Systems 17, 1:39-47.

    Carbaugh, L.W. and R.W. Marx. 1990. The TIGER system: a Census Bureau innovation serving data analysts. Government Information Quarterly 7, 3:285-306.

    Cooke, D. 1995. Sharing street centerline spatial databases. In H.J. Onsrud and G. Rushton, eds., Sharing Geographic Information. New Brunswick, NJ: Center for Urban Policy Research, Rutgers University.

    Corbett, J.P. 1979. Topological principles in cartography. Technical Paper 48. U.S. Bureau of Census, Washington, D.C.

    Davis, B.A., J.R. George, and R. W. Marx. 1992. TIGER/SDTS: standardizing an innovation. Cartography and Geographic Information Systems 19, 5:321-327.

    Emergency Planning and Community Right-To-Know Act: Title 3 of the Superfund Amendments and Reauthorization Act of 1986. PL 99-499, 17 October 1986. United States Statuates at Large 100. pp. 1728-1758.

    Federal Geographic Data Committee. 1992. Multi-agency initiative to meet high priority requirements for base cartographic data. FGDC Subcommittee of Base Cartographic Data. Washington, D.C.

    Government Accounting Office. 1991. Geographic Information Systems: information on federal use and coordination. IMTEC 91-72-FS. Washington, D.C.

    ________. 1992. Decennial Census: 1990 results show need for fundamental reform. GAO/GGD-92-94. Washington, D.C.

    Haythornwaite, T. 1992. Development of the United States-Canada Common Border Database. The Operational Geographer 10, 1:28-30.

    Kinnear, C. 1987. The TIGER Structure. Eighth International Symposium on Computer-Assisted Cartography, Proceedings. Baltimore, Maryland, 249-257.

    Marx, R.W. 1986. The TIGER System: automating the geographic structure of the United States census. Government Publications Review 13, 181-201.

    ________. 1992. Building the National Spatial Data Infrastructure: the data integrity challenge. Paper presented at the Regional Surveying Engineering Conference, Hartford, Connecticut.

    McKenzie, B.Y., and R.A. LaMacchia. 1987. The U.S. Geological Survey-U.S. Bureau of Census Cooperative Digital Mapping Project: a unique success story. Paper presented at American Congress on Surveying and Mapping meeting, Reno, Nevada. Fall.

    Murakami, E., and K. Greenleaf. 1992. Multi-agency TIGER file updating. URISA Proceedings 2:25-35.

    Office of Management and Budget. 1990. Coordination of surveying, mapping, and related spatial data activities. OMB Circular A-16 (Revised). Washington, D.C.

    _______. 1992. Management of federal information resources. OMB Circular A-130. Washington, D.C.

    Onsrud, H. 1992. Privacy and spatial databases. Technical Program Abstracts., 27th International Geographical Congress, Washington, D.C., 480-481.

    Rhind, D.W. 1991. Counting the people: the role of GIS. In D.J. Maguire, M.F. Goodchild, and D.W. Rhind, eds., Geographic information systems: principles and applications. Longman Scientific and Technical, Essex, 2:127-137.

    Silver, J. 1977. The GBF/DIME system: development, design and use. Paper presented at 1977 Joint Annual Meeting of the American Society of Photogrammetry and the American Congress on Surveying and Mapping. U.S. Government Printing Office, 1977-240-869/1102.

    Sobel, J. 1978. GBF/DIME system – development and reference source problems. Applied Geography Conference, SUNY – University Center at Binghamton, 1:112:121.

    _______. 1986. Principal components of the Census Bureau’s TIGER file. Research in contemporary and applied geography: a discussion series. SUNY at Binghamton, 10, 3:1-17.

    Tomasi, S.G. 1990. Why the nation needs a TIGER system. Cartography and Geographic Information Systems 17, 1:21-26.

    Trainor, T.F. 1990. Fully automated cartography: a major transition at the Census Bureau. Cartography and Geographic Information Systems 17, 1:27-28.

    U.S. Bureau of Census. 1973. Census Use Study. International DIME Colloquium. Confernce Proceedings, Washington, D.C., August 27-29, 1972.

    White, M. 1984. Technical requirements and standards for a multipurpose geographic data system. The American Cartographer 11, 1:15-26.

  • 3D Visualization Software and…Introducing a New Contributing Author

    A couple of weeks ago, I dedicated a column to discussing the emergence of 3D geospatial data. This week, I was navigating around the U.S. Army Geospatial Center’s (AGC) website, which was formerly known as the Engineer Research and Development Center’s Topographic Engineering Center (TEC). As of October 1, 2009, AGC is operating as a Major Subordinate Command Center under the U.S. Army Corps. of Engineers. Anyway, I was navigating the website and stumbled upon the most comprehensive list of commercial 3D visualization software programs I’ve ever seen.

    I’ve been involved with 3D visualization software (mostly on the data side) since about 2001. In my experience, it has always been a labor intensive process to develop high quality 3D visualizations. Still images are easier than animations, but still a chore to do if you desire high quality rendering in the images.

    Following are two images. One was rendered using medium quality resolution/textures vs. high quality resolution/textures:

    It’s a big step in time, both development time and rendering time, to upgrade from medium quality to high quality renderings. Most 3D visualization software programs work at the medium level or lower. This is primarily because they produce 3D visualizations that are “good enough” for the task at hand.

    3D visualizations have become much more common as compared to nearly a decade ago when I first started experimenting with them. Software has become more powerful and easier to use. Computing power has become exponentially more powerful. One of today’s computers can render as fast as small “server farm” back in the year 2000.

    Back to the AGC

    While surfing the AGC website, I found the most comprehensive listing of commercial 3D visualization software as I’ve seen anywhere. You can view it here. Be aware that some of the links might be obsolete, but certainly all of the 3D visualization softwares I’ve experienced are included in the listing.

    Introducing our new Contributing Author

    If you recall in my column a couple of weeks ago, I presented the many initiatives I plan for Geospatial Solutions in 2010. One of the initiatives was to enlist a number of industry specialists who could offer a different perspective from a very specific part of the geospatial industry. Well, I’m pleased to announce that Craig Greenwald is joining our team as our Contributing Author for Mobile GIS.

    Although I’ll ask him to formally introduce himself in his first contribution, I’ve known Craig for well over ten years. In the mid-90’s, he and I worked together at the same company…his first job out of graduate school at Oregon State University. Craig then spent a number of years at ESRI on the ArcPad Team, interrupted by a brief stint at Bradshaw Consulting. Many of you may have run into Craig at the ESRI User Conference where he conducted numerous basic and advanced ArcPad workshops and briefings. Craig is now a principal at the GIS firm GeoMobile Innovations.

    I’ll be publishing Craig’s first contribution in just a few weeks. I’ve asked him to provide us with a look forward into 2010 with respect to Mobile GIS. Will there be any disruptive technologies or will it just be the status quo? What kind of new productivity tools can we expect to see? What will be the trends in the industry?

    I look forward to his answers and I hope you do too.

    See you next week.

  • Geospatial Data Accuracy – Better and Better

    Circa. 1995. I walked into the GIS office of a major forest products company in northern Arkansas (or was it northern Louisiana…not sure). At the time, I was a product manager for a GPS company and field testing one of our newer GPS mapping hand-held products.

    We decided to go out and map the perimeter of a timber tract they owned to compare the area (acres) that the GPS calculated vs. what was in their GIS. This was nothing new as I’d done it many times for other companies. We went to the tract (adjacent to a road) and walked the boundary (~40 acres). Afterwards, we returned to their office and I post-processed the GPS data.

    I forget what the final area calculation was, but it was believable and the company didn’t challenge the result. Also, the shape of the polygon seemed reasonable. However, when the GIS manager inserted the GPS data into his GIS, it was offset a significant amount. I forget exactly, but something on the order of 50-100 feet. I immediately began considering if my data was the problem, but concluded the chance was low. For GPS post-processing, I’d tied into a local US Forest Service GPS base station so my GPS data was referenced to NAD83/86 (if I recall correctly). In the end, we agreed that my data was most likely positioned correctly.

    “So what?”, he said, “Do you think I’m going to adjust my entire GIS because it doesn’t agree with your GPS?” (I’m paraphrasing based on my recollection). I understood that I had won the battle, but lost the war. It didn’t matter that I was right, at least at that moment in time. However, he did agree with me that eventually he was going to have to reconcile the difference because GPS was destined to be the technology that defined the national spatial framework.

    I had many more experiences similar to the above during the mid-90’s. People would swear by the accuracy of USGS 1:24,000 quad sheets because that’s what they were used to. If the GPS data didn’t agree with the quad sheet, they’d dismiss the accuracy of GPS because it didn’t fit. This was particular true with utility companies too, that were some of the early adopters of CAD for mapping.

    Fifteen years later, this problem is not going away. The accuracy of Geospatial data continues to get better and better. Think back fifteen years and ask yourself about the quality/availability/price of orthophotography back then. I remember we were ecstatic to have access to free 1-meter, black/white DOQQs. Today, I can easily find 1’ pixel resolution orthophotography, commonly find 6” and occasionally run into 3” pixel resolution orthophotos free of charge. This allows one to digitize manholes and other infrastructure without leaving the seat at your GIS workstation.

    During the same period, the cost of accurate GPS measurements has reduced considerably. Whereas fifteen years ago, achieving sub-meter accuracy with a $12,000 mapping receiver was on the bleeding edge of technology. Today, a $2,000 mapping receiver can give you sub-meter results and a $6,000 receiver can achieve sub-foot accuracy. Looking way forward, the cost and availability of GPS accuracy is going to change significantly in the next 10 years. Obtaining one foot accuracy will be achievable with a very inexpensive GPS receiver.

    We all know that data drives a GIS. The better quality data we have, the more accurately and precisely and completely the GIS can answer our queries. Along these lines, I think it’s worth mentioning again the outcome of the litigation in California involving Santa Clara County and the ownership of GIS data.

    Santa Clara County GIS lawsuit

    Santa Clara County (California) was charging significant fees (potentially several hundred thousand dollars) to organizations who wanted to utilitize its full suite of GIS data including orthophotography, parcel, planning, streets, boundary, etc. A lawsuit was filed in 2006 by the First Amendment Coalition arguing that the GIS data should be released under the California Public Records Act. Santa Clara County argued that the GIS data was sensitive enough to be excluded due to homeland security issues (eg. making known the locations of critical infrastructure such as utilities). Santa Clara County lost the argument and was ordered by the court to hand over the GIS data. The 6th District Court of Appeal gave the final word last February.

    The outcome of the court case establishes a significant precedent in the geospatial industry. For as long as I can remember, this issue has been solidly ambiguous among state and local governments. One entity would email (or make available via FTP) GIS data at a moment’s notice. Another entity would have you sign away your first-born child. Even another would not entertain the thought of releasing “our data” to anyone. I think the attorney for Santa Clara County was accurate in stating “It was one of those cases that needed to be tried and for which we needed guidance from the court”. Normally, I have an anti-litigious attitude, but I’m happy to see a precedent has been established and publicized.

    Go on…be a TIGER

    After last week’s column about Google’s step forward in using their own base map for Google Maps/Earth in the US, I was admonished by a reader, and rightfully so, in not mentioning the value of Census data as an important part of the history of base map evolution in the US.

    Jon Sperling, Ph.D., GISP wrote:

    “It is quite disconcerting, from an historical and current perspective, that your article made no mention of the “pre-internet” Census TIGER database, the first topologically integrated national digital street centerline for the US or even the newly updated and positionally accurate TIGER files (with an associated but confidential file of GPS address points collected for every housing unit in the Nation). These files, newly updated for the 2010 Census, are still a major source for accurate and easily accessible public domain street level data for every community in the US, including Puerto Rico, the Virgin Islands, and other territories. More importantly, it was the innovation that spurred the GIS revolution across government, private industry, and academia by enabling every local agency, entity and person in the United States to build their own geographic information system by combining census data with TIGER. Prior to this development, the Census was also a leader in the development of the GBF/DIME Files which enabled address geocoding, a capability that led to the later success of Mapquest, Google, and others. Like the development of the internet by DARPA and the Global Positioning System also by the federal government, the widespread development of intelligent national street level mapping was also led by the government.  GDT/TeleAtlas began as a company that offered “enhanced” TIGER files and Navteq often used TIGER for the more rural areas. Not only has the Census pioneered but it remains a key catalyst and building block for delivering a cost-effective and truly integrated national spatial data infrastructure.”

    Attached is an article I wrote back in 1992 on the history/development of TIGER as well as a 2002 proposal for creating a shared national road network (geometry and basic attributes such as address range to enable consistent and shareable geocoding across
    domains). OpenStreetMap is a nice expression of a way to move forward.”

    A copy of Jon Sperling’s 2002 proposal can be read here (scroll down to Page 16).

    Thanks and see you next week.

  • The Struggles of a City GIS Manager

    This is real. The names have been omitted, but this is happening as I write at one city and I’m willing to bet many, many more cities around the world. The city is typical in the US. Its population is ~23,000. Geographic area is ~8 square miles. There are 430 acres of parkland, over 150 acres of designated openspace and 110 miles of sewer pipe pumping 2.3 million gallons per day.

    The issue at hand? These economic times are tight and the city is considering cutting back the GIS department.

    To me, an interesting fact is that this is not a city that’s behind the technology curve. In fact, I think they’re ahead of it. Has the GIS Manager (current and previous) done such a good job that they’ve worked their way out of a job? They’re using state-of-the-art GIS software products such as ArcGIS Server, ArcGIS desktop, ArcPad and even developed their own custom app using MapObjects that’s in use on 100+ computers throughout the city departments. They’re also using high performance GPS/GIS receivers to keep their GIS up-to-date.

    To give you an idea, following is a graphic illustrating the layout of their GIS:

     

    They serve up and make available data to the public much more than other municipalities that I’ve dealt with. In addition to their internal users, they serve this data up to the public 24/7 via an online, interactive web interface. Their data layers include:

    Utilities – Sewer, storm, water, streets, street signs.
    Land use – city-owned land, parks, open space.
    Environmental – Contours, slope, wetlands, streams.
    Planning – Zoning, comprehensive plan, buildable land.
    Parcel mapping – Taxlots, easements, property info, plat info.
    Boundaries – City limits, neighborhood assoc, special districts.
    Site Addresses – Master address file, geocoding.
    Digital imagery – Orthophotography, LiDAR, DEMs.

    They also develop and support applications for other city departments. Users of the custom mapping application developed in MapObjects include the police (in patrol cars on rugged laptop computers), EOC (Emergency Operations Center), public works, parks, planning, engineering in addition to managers and office staff who are able to print their own maps instead of relying on other city personnel.

    Earlier this year, the city conducted a survey to measure GIS usage. Following are the results:

    GIS as a business tool image
    How does this compare to your GIS user base?

    Do you know how many people are utilizing your GIS and understand what they are using it for?
    Does the city management/city council understand the benefits the GIS provides?

    In a conversation I had with the GIS Manager, I think it was summarized best in the following statement:

    “How do you put a price on instantaneous information?”

    An example was used regarding utility infrastructure. How would one, without a GIS, communicate the status of the utility infrastructure system for a maintenance or development project? It would involve finding, organizing and collating paper maps (probably from different departments and maybe from different agencies, including utility companies) in a manner that would effectively and efficiently serve the requestor. That process would take several “man-days” and painfully slow interdepartmental/interagency coordination. And, at the end of the day, the product would most likely be substandard to a GIS-derived product.

    I equate it to, if I may be so bold and over-simplistic, to maintaining ones vehicle. You can choose to spend the time and money to change the oil, maintain the brakes, change the transmission fluid, change the windshield wipers, wax the exterior, vacuum the interior, etc. and the vehicle will run smoothly and reliably and serve you well. On the other hand, if one does none of the above maintenance, there is a high probability that you’ll have several catastrophic vehicle failures that will consume time, money and add undue stress in dealing with ongoing problems. Dealing with emergency situations is always orders of magnitude more expensive than regular maintenance.

    To me, that’s the issue.

    So, while you’re focused on building your GIS, it’s easy to get caught up in the technology and forget about the economics behind it. Someone is paying the bills and those folks need to understand the benefits of maintaining an up-to-date GIS if you expect them to continue to provide funding.

    Thanks and see you next week.

  • GIS + GPS+ GSM = LBS

    Location-Based Services (LBS)…

    Make no mistake about it, LBS is a monster and it’s not even started to ramp up yet. The pieces are there…GIS for the map database, GPS for positioning, and wireless networks for communicating.

    It’s a super-dynamic scenario where all three technologies are changing, if not structurally, at least at the content level. For example, the trend in the GIS component isn’t necessarily structural (eg. database technology), but the content is evolving rapidly. Remote sensing data is more accurate, has greater coverage and is readily available. Digital map data in general is much more available and much more accurate.

    The current scenario takes me back to the 1980’s when the Personal Computer was in its infancy. I remember a friend of mine purchased a new PC in 1986. I had some experience on a TRS80 (remember that one?) via a college course. We unpacked his new toy, plugged it in, and turned it on. After booting up, no Windows interface (not invented yet), no friendly prompt guiding us to the next step. Just this…

    C:

    It took me awhile (months) to figure out that computer hardware and computer software were two completely different animals. In fact, not until I started working for a computer manufacturer did I really understand the importance of computer software. Without application software, a PC is just an expensive box that takes up space and eats electricity.

    Do you see where I’m going with this?

    Today, we have all the components to make a good PC box (GIS+GPS). What we are sorely lacking, and I believe very early (similar to where we were with the PC in 1986) is LBS application software.  The PC has transformed our lives in the last 20 years…and LBS will transform our lives in the next 20 years.

    As my compatriot Kevin Dennehy reported in his GPS World LBS newsletter this month, Nokia held their Nokia World 09 conference.
    At the conference, Nokia held a worldwide “Calling All Innovators” developer contest to promote the technology.

    “For all the noise about its dominant market share, Nokia could not shake the public feeling that it was losing ground in the hearts and minds of developers and the public in general. It is in part rooted in its poor performance in the United States, where most of the social networking application have the roots, and also the largest user communities,” Babcock said. “With its current 5 percent to 7 percent market share, Nokia has its work cut out for it.”

    Why the focus on Nokia? If you recall, in 2007 Nokia acquired Navteq for US$8.1B. Navteq is one of two street-level map database companies (the other being TeleAtlas) in a duopoly magnified by the explosive growth of automobile GPS navigation devices. Nokia are also the largest mobile phone producer in the world. They understand the GIS+GPS+GSM=LBS formula and they’re betting the farm on it (or at least the livestock).

    The technology and infrastructure are set, now it’s a matter of creating applications. Nokia is smart enough to create some of the obvious ones. But, they (and the consumers) need unorthodox and creative start-up companies to start offering LBS products/services to see what sticks. Clearly, the next LBS-style Facebook, Myspace, etc. is out there waiting to be plucked and brought to market. It’s not Loopt, it’s not Google Latitude.

    But make no mistake about it, knowing where you are, where your assets are and where your family members are in relation to everything around us is going to be as ubiquitous as the mobile phone you carry today. You will know where your kids, your spouse and your friends are (if include them in your personal network), at anytime. You will know where your vehicles are, at any time. And if you allow it, a coupon will pop-up on your phone display giving you a few bucks off your lunch order at the McDonalds you’ll be passing in a mile.

    And, believe it or not, GIS is the foundation on which all of it is built.

    Thanks and see you next week.