tv Key Capitol Hill Hearings CSPAN September 25, 2015 4:00am-6:01am EDT
that -- it gives us, frankly, the envy of other countries. when they talk about the benefits and the values of america, one of the things you will hear when you travel outside this country is that frankly, their awe at the fact that we can have the peaceful transition of power that we have every four or eight years. and that is because we invest in this democracy. why do we want to do anything to curtail anyone's participation in what has been an example to the world and has to be the beacon that we use to ensure freedom in this country. the message from the department we will not stop in these efforts. we will not be deterred and we will not rest until we've secured the right to vote for every eligible american. [ applause ]
and ochk that extends beyond the courtroom and the actions that we bring. working with many members who are sponsoring this wonderful weekend and other members of congress as well, we've promoted legislative proposals to resore the voting rights act to its full and proerp. we've profoesed legislation that would expand access to polling places for those living on indian villages and other tribal lands. we cannot have a situation in this country where the original americans are kept out of the participation in the bounties of this land. we cannot have that. [ applause ] we do this also through our monitoring program, we monitor federal elections and we've actively enforced the national voter registration act to protect those who are registering to vote, as well as the rights of our uniformed
members of the military. and overseas citizens who seek to vote as well. keeping on to what makes them quintessentially american. we will also protect their rights as well. and of course, the right to vote follows, from one of our nation's most fundamental promises, that one should have to endure indiscrimination or unfair treatment based on who they are, where they live or what they look like. the justice department is proud to stand on the front lines of the fight against hatred and intolerance and we're working aggressively to combat the violence. we have tools that are effective. the matthew shepherd crimes prevention act, signed into law by our president, president [ applause ] with this law we are enhanced our ability to hold accountable those who victimize their fellow
american because of who they are. and we've worked with our state and local partners to make sure that hate crimes are identified and investigated and we have continued to bring and will continue to bring federal hate crime charges, including our current prosecution of dylan roof for the murders of nine, nine people of faith, nine people of god. >> lord have mercy. >> at mother emanuel church in charleston, south carolina just a few months ago. for many of us, as we sat and watched that event, and watched r us back to a time that we thought was over. >> yes. >> this is a new day. look who's in the white house. look who's at the department of justice. we thought we had moved past those stark reminders that there are people there who live in a world of hate and will seek to act on it. we thought we had moved past this history of bigotry and brutality.
we thought we had left behind the pure intimidation and cruelty of the night riders, those who come in the night and try to keep you. we thought we had moved away from that. and for many of us it took us back to another time we thought we had erased away forever. a time where just 52 years ago this week four little girls went to church one morning, they went to sunday school one weekend, and think war there attending a sermon entitled "the love that forgives" and they didn't come home that day. they didn't come home to the four families who live on with the loss of their children who suffered the bomb at the 16th street baptist church in birmingham. now just in the days after that bombing 52 years ago, i was four years old. and my father like all parents looked at me and my two brother es and wondered, how do i
protect my children. how do i keep them safe. not just from the enemy next door but from the world that wants to tell them that they're less than. a world that wants to tell them that they're different. a world that wants toyáx/ñ tell that they don't matter and that they are simply cannon fodder. and he like all parents who were committed to the cause decided what he had to do was keep working, keep ming, keep pushing, keep advancing. and there were no guarantees 52 years ago when four little bodies did not come home. people did not know if we were going to get a voting rights ablth. didn't know if we were going to get a civil rights act. nothing was guaranteed. but with a deep faith and commitment, people pushed forward. we're at that same point again. in the days just after that bombing more than 8,000 people, people of all colors, people of all creeds and backgrounds, races and religions, attended a memorial service for those young victims.
and one of the individuals who gave one of the many stirring eulogies and remembrances of the day was the reverend martin luther king jr. and of course he was familiar, not just with the town but with the church. and not just with the church but with the families. and not just the families, but the four little girls themselves. and in his address, at a time of great tragedy and great challenge, he urged his fellow citizens to channel their grief, to harnness their energy saying we have to look passionately and unrelentingly for the american dream. the people sitting in the pews on the dark day 52 years ago, as my father looked at his children and wonder how he could keep us safe, could hardly have imagined the progress we've made thanks to their efforts and the work that would follow. they could hardly have imagine thd group, the congressional
black kau sus itself in size and strength. they couldn't have imagined this weekend, over 40 years of comments, of thoughts, of philosophy and teaching. they couldn't have seen who would be sitting in the white house today, sitting in a meeting with an attorney general who was that little girl whose father looked at her 52 years ago and said i have to protect. but they knew that there were better days coming. they knew that if they pushed forward they could move past the pain of a bond that tore apart a church. and they knew that their work wasn't over just as our is not also. and we have more work to do and we're here today to get started. and by that i mean people who have been here work rg going to continue. those people who where younger and new to the cause will join in and we will keep pushing ahead. every american has the right to grow up in a community and a world that offers not just responsibilities to uphold but
also opportunities to succeed. because every american has the right to live in a country that will support them and that will protect them no matter where they live, what they look like or who they are. and every american, every american has the right to a justice system that gives them a fair opportunity to grow, to learn, to improve. [ applause ] and to contribute. and every american has the right to make his or her voice heard. and this isn't just what i believe or what you believe. it is what this country believes. it is what this country means. it is what this society believes. and it is what america has always promised to every man, woman and child in every community across this nation. and i'm here to pledge to you today that neither i nor the department that i am so proud to lead will ever abandon or work
to make that promise real. but we need your help and your partnership. just as we have in decades past, to bring our country closer to its highest ideals. and we do look out and we see dark days at the times. as people did 52 years 0 ago. but just as they did then, they looked around and saw strength. they saw support. they saw fellowship. they saw commitment. they saw what i see when i look out over this extraordinary gathering today, and they saw what i see which is a people that will not be stopped. >> come on, now. >> the people who will not be silenced. [ applause ] a people that will not be held back and a people that will always, always reach back and lend a hand and pull someone along with them. that is what we do. that is how we've made america great today. that is how we make america live up to its promises to all of us
and that is how we will go forward in all of the challenges that we have to face. thank you for your time. thank you for your attention. thank you for your commitment to this important work. [ applause ] >> there is nothing that should follow that on this panel. i am glad that on a panel dominated by men we had a woman to lead us out. i'm glad to make a doctor to doctor introduction in transition. dr. melissa harris perry will be coming up to lead the next section. michael rogers told capitol hill lawmakers that if foreign ministers were using a private e-mail server, it woulde an intelligence opportunity.
that is after senator tom cotton asked him about about hillary clinton eats use of a private e-mail server. >> the nsa is in cherj of information insurance operations for the federal government, meaning the nsa is in charge of our national security systems. am i correct that nsa from time to time will also help federal agencies protect their unclassified systems? >> yes, when they request assistance. >> i realize this is before your time, but to your knowledge did the state department ever ask nsa about the wisdom of setting up a private server for hillary clinton? >> i'm not aware if they did or did not. >> what would be your response if the current secretary of state said i want to set up a private nongovernment server and use that to conduct personal business. >> you walereally want to drag into this one? >> i would simply like your
professional opinion. >> my comment would be you need to make sure you're complying with the applicable regulations of your department. i'm not smart about what the rules and regulations are for every element across the federal government. >> are the communications of the senior most adviser to the president of the eyes, even those that may be unclassified, a top priority for foreign intelligent services in your opinion in. >> yes. >> if an nsa employee came to you and said we have a reason to believe that russian foreign minister or iranian foreign minister is conducting official business on a private server, how would you respond? >> from a foreign intelligence perspective, that represents opportunity. >> are you aware of any nsa officials who e-mailed secretary clinton at her private account. >> no, i have no knowledge. >> aware of any nsa officials who were aware that secretary clinton had a private e-mail
account and server? >> you're talking about before my time. i just don't know the answer. >> can i ask you to check your records? >> yes. it's a good question for the record. >> thank you. you can see all of nsathbr director rogers' testimony about cybersecurity before the senate intelligence committee at c-span.org. rer nbc 4. this is marion berry's place? y he comes in all the time. i went back to the office and called him up, said, mr. mayor, i've just been to club 55. don't you realize people are watching what you do and where you go. they say you sit there all the time and watch naked dancing girls. there was a pause on the phone and he said, it's nice, isn't it? >> this sunday night, paul sherwood on the political corruption in d.c., maryland and
virginia. >> i think 44 attorneys general from around the country signed a letter saying they agreed with governor mcdonald, that what he did was politics, not bribery and that these gifts were -- he should have reported the gifts. $15,000 for a child's wedding, a 50,000, 70,000 dollars loan. the problem was bob mcdonald has had been considered potentially a vice presidential candidate was in over his head. this is another case where you're a public figure and you let your messy private life combine together. >> sunday night at 8:00 eastern and pacific on c-span's q&a. some federal agencies like the census bureau and cities like chicago are using data information to connect with communities. we heard about that at a data transparency conference this week. this is 45 minutes.
good morning. i'm herschel chandler. i'm here today representing act-iac. a private/public partnership dedicated to improving government. act-iac provides an objective and trusted forum where government executives collaborate to address key issues facing our government. it's been my pleasure to be a leader in the act-iac transparency and federal funding project where we have volunteers from dozens of companies in a vendor neutral, practitioner focused, trusted forum. white papers, panels,
conferences, and workshops have been produced and made available to the public. check out actiac.org/dataact for our data outputs. it's thus with great pleasure i'm able to introduce dr. mark doms. dr. doms' career has centered around data. for the past three years he's served as undersecretary for academic affairs in the department of commerce. in that role, he had three main responsibilities. first, mark led the economics and statistics administration, which includes two of the nation's preeminent data organizations, the census bureau, and the bureau of economic analysis. these agencies collect and produce information on the united states dynamic population and economy. publishing vital data to our nation's citizens, businesses and leaders. the census bureau and the bureau of economic analysis combined have over 10,000 employees and have a budget of over $1 billion. his second responsibility was
being a top economic adviser. he contributed to and spoke on a wide variety of subjects including open data, trade, manufacturing, taxation, innovation, immigration, and education. his contribution was often abou what data can be used to better understand the issue at hand. his third pillar of responsibility was leading the commerce department's strategic plan for data transformation. he detailed the strategic plan for the department making sure federal data are optimized to benefit american businesses, policymakers and people. prior to becoming undersecretary, dr. doms served as chief economist. he met with business leaders listening to concerns and insights and providing overviews of the u.s. economy. prior to joining commerce, dr. doms spent most of his career helping to guide monetary policy in the federal reserve system. he is a leading researcher, an expert in the areas of innovation, productivity, wages, manufacturing, and price measurement.
dr. doms received a bachelors in mathematics and economics from the university of maryland, baltimore county and ph.d. in economics from the university of wisconsin-madison. basically throughout his career mark has either used data to answer questions or made data available so others can do likewise. he's happily known as a fellow data geek. if that wasn't enough to convince you of that, he has three separate computers, is further evidence. join me from welcoming dr. mark doms. [ applause ] >> herschel, thank you very much. thank you for inviting me. so we're going to sit here for just a second and see if this works. there we go. okay. so i'll just say next slide. the first slide, please. what i'd like to do is talk for about 20-25 minutes and then let's open it up to
conversations. and the main things i want to talk about today is data and what's happening in our country and what role the data transparency coalition fits into that and what role you fit into that. we live in quite exciting times. if you look at this map, this is a map of street closures in downtown d.c. because of the pope's visit today. the reason i'm showing this is that ten years ago producing this map would have been really hard. and this map exemplifies a lot of the points that our previous speaker spoke about. there's a lot of data now that is in standard geospatial format, so making maps like this is a lot easier than it used to be. you see a lot more maps. there's a huge demand for geospatial information. people will always want to know what's happening where and how that relates to other points, geographically speaking. so if we think about this industry, we've also seen a huge explosion of the tools to make
these maps. we've also seen a huge explosion in the people who have the skills to make these maps. so it's basically this trifecta. you have people with the right skills. you have the right software tools, and you have the data. with all of that combined, you produce better outcomes. and so in this case you can now, in today's world, produce more and more maps to get the information that people really need. so when you look at these maps today, just always keep in mind that ten years ago making these things were really hard. today they're a lot easier. and, again, the concepts behind making these maps is very similar to the concepts that you heard our previous speakers talk about. so let's go to the next slide. so the summary -- next slide. the picture of my cat, you really want to see it.
oh, great, okay. so i think we really are at this tipping point if we think about the data revolution. we're really at the tipping point as a society of really beginning to benefit from this. think about what's happening over the past couple of decades, what we've really seen is a huge explosion in computer technology and software technology and communications technology. now with those key components in place, we can really take advantage of the huge explosion of data that's occurring. but how quickly is this going to happen? how quickly are we really going to see the benefits from this? that depends on a couple things. first, which is the theme of 1sv this conference, is it depends on how quickly we make really important data accessible. and when i say the word accessible, you heard hudson speak different concepts about the data being in standard format, sometimes you hear interoperability. there are buzz word bingo you can play with this. it's basically making data accessible and also usable.
and then, also, what i've just seen repeatedly in application after application across just a wide variety of data fields, is not only do you just need the data, you actually need people who know how to analyze data. and that's something that i think we're in relatively short supply. i'll talk about that a little bit more. so, one reason that people are getting so excited about data and you hear about it all the time, you always see these graphs, okay, and these graphs always have a certain flavor to them.ufjk so these graphs always have, on the vertical access, some measure of data volume. and sometimes it's a word you've never heard of before like petabytes or something like that, bigger than terabytes, some huge amount of daz7xd it's always increasing. the horizontal axis you have time, what's happened in the past and what's projected into the future. then you actually see a line that shows, you know, how much data there is going to be. what these lines always show they're sharply curving up. that means the amount of data that's accessible and usable by people is accelerating.
okay? and usually the arguments you hear for this acceleration and the amount of data that's accessible, you have this open government data efforts, which you've heard a lot about today. two, if you think about the private sector, the private sector is gathering and processing more information than ever. and then what gets people really excited about the future is just the internet of things, just how much data we'll be able to gather from censuses and so forth. those volumes will be really huge. so there's this huge amount of information, whether it's about information on governments, whether it's information on the private sector, whether it's information from somewhere else, from the scientific community, for instance. so the question that i would like to talk to you about is, why do we care? why is this so important? and basically when we look at data, we want something from it.
so the previous speaker spoke about how we want more sunlight into how government works. we want better information about how government is spending its money, for instance. okay? so i'm an economist. that's my background. so i'm also asking these questions not just can we have more insight into how our government works but how is this really going to help our country? so if we're thinking about all of this data and then getting these outcomes, whether it's better knowledge of our citizens or a better gdp growth or something like that, i would like to present the simple model. so i'm going to go through three simple steps. i'm going to start with the last one first. so it's going to be this kind of data to outcome model. so that's how do we go from data to get the outcomes that we want? i'm going to simplify it and you'll notice the acronym is doms, which is pretty cool.
at the end we want better outcomes. ok? so we want those outcomes to be. they usually fall into three buckets. the first one is what i call smarter governments. so we heard, again, the previous speakers talk about, you know, government being better, being more efficient, better able to meet its mission, to do that with less resources. so that is a huge, laudable goal, especially given how big governments are. so we have the federal government, which is literally trillions of dollars. it's double digit percentage of gdp. state and local governments. and working in commerce, we do a census of state and local governments and i want everyone to think about a number really quick. how many state and local governments are there? there's 50 states. there's about 3,000 counties. and then you have a lot more after that. everyone think of a number. the number is 91,000. there are 91,000 local governments out there. okay? there's one federal government.
50,000 state governments, 3,000 county governments. there's 91,000 state and local governments. so that's just really huge. so we want smarter government. it's not just the federal government we're talking about but, again, the previous speakers were talking about it's also the state and local level. secondly, from a macro economist level, we want our businesses to benefit from this data and they can benefit in two ways. one, they can use data themselves to be more efficient, so they can be more competitive. two, as represented by a lot of the people in this room, there are businesses in the data business, as is herschel, a lot of the companies that are represented here. and so this is an industry that's really important. it's growing, it's something the u.s. has a comparative advantage, something we want to trade surplus and these are jobs that pay really well. so this is an industry that we really want to support. and then, finally, as also we heard earlier, more informed citizens.
when we think about the benefits of data, think about it until it falls into the three buckets, we want better government, we want more competitive businesses, we want a stronger business community, because that's where our economic growth and welfare comes from, or we want kind of more informed citizens so we have a better idea of what's happening in the world, how our governments are working, so on and so forth. so that's what we're really striving for. that's the outcome that all of us are working towards. so how do we get there? well, we get there, the preceding step is the analysis of data. so there's all this data out there so how do we analyze it? well, one, we have the software tools. again, if you look at the map that we presented at the beginning, there's a company that has a lion's share of the market, just has done a tremendous amount of work to make geospatial data standardized and using these tools, gis, for instance, it's really easy to make these maps. and there's a lot of other software tools out there.
on the side now that i'm unemployed, i'm trying to learn. the first 30 minutes was kind of fun. after that got a little frustrating. i think i'm just going to hire somebody to do that. and then, second, you think about computer hardware, and if you think about, say, storage capacity, cloud capacity and what not, these things are now a commodity. ten years ago the stuff was a lot more expensive and was a real inhibitor, but the prices have really fallen. and then finally the point i was making before, human capital. and so human capital, that's an economics phrase. what it means is the skills of our workforce. we need not just data scientists and programmers, but we need people who really understand what's going on. so if we got all the financial data across all the government agencies in standardized format you need to know people who know how government actually works, for instance, to really make sense of that data. if you c'2!1áqp&ly large data sets -- and i'll put on my
statistician's hat for a moment. the bigger the data sets, the more correlations you'll find just by chance. okay? so as we get more and more and more data, you're going to have more and more correlations. how do you filter those out to really figure out what's going on? that really requires subject matter expertise. when i talk to people in health care, when i talk to people in the private sector, when you talk to people who are working criminal justice, for instance, you can have the best data scientist in world, but that has to be coupled with knowledge of what's happening in the industry. and, you know, an example of this is where you just need that kind of -- when you need that common sense. i'm going to tell a data joke. there's not a lot of data jokes out there so forgive me. it's a low bar. so there's three statisticians and they're out hunting, right? so they're out one morning, it's a beautiful day, a beautiful fall day like today. and they see this buck 100 yards away from it. the first statistician gets out his rifle, lines up a shot, squeezes the trigger. bullet goes five feet to the left of the deer.
the second statistician goes, huh, lines up his rifle, takes a shot, bullet goes five feet to the right of the deer. the third statistician packs up his rifle and goes, looks like we hit it."hyz so that's just an example of where you need not just people to look at data and understand data but they actually have to understand what's really kind of going on to make the right inferences. if anyone has any other data jokes, let me know. i know just one other that i can tell you afterwards. okay. so, again, the ultimate goal here is that we get better outcomes, whether we're talking about better governments or making our businesses better or citizens more informed, we have to analyze data and have some constraints there. what we also need and the data transparency coalition has been great at this, is we need the data itself, the building
blocks. and so you hear about data and i use the word kind of integrity often. you actually have to know where the data comes from. there's a lot of junkie data out there. the agencies i used to oversee, they prided themselves on producing high-quality data about our people and our population. and as this data is exploding, there are real questions about data, what do we actually know about it? and often when you're looking at complicated questions, you have to get those types of answers. and then we've all talked about common formats and standards because you want to reduce the cost of combining all these data sets. so it's one thing to -- another thing is you have to make your data sets easy to find. so if you go to data.gov, for instance, depending on the day you're looking at, last time i looked about 114,000 different data sets there. it's hard to find information in this kind of data revolution where data sets are just exploding both in terms of size and in terms of number. it's like how can you make these things easy to find? that's something we've been working on.
and then the ability to merge data. this is related to the standards. because combining data is where you get the real value so you can have the single data set and i'll go through a bunch of examples. it's when you combine data from here to here and put those things together. think about the map we just showed, right? there's information about the map, the city maps of d.c. it's combining those things, presents a good visual representation to everybody about how this is going to affect their commute and how it's going to affect their day. okay. so that's the simplified models. when i think about all the data stuff, because there are so many words out there, people talk about data, these are the buckets i put things in. let's go into these just a little bit more. now in the first bucket in terms of data accessibility, let me tell you about what we've actually done at the department of commerce and, you know, why you may actually care about that.
a lot of you probably don't know. it's like this big holding company. let me go through a couple of examples. we have noaa. they're the folks who monitor our climate. they're the folks who monitor our ocean. they monitor our fisheries. they monitor solar activity. just their weather data alone is about 30 terabytes a day. and they have this problem of how do you get 30 terabytes of a data a day out the door? so they're working with the private sector in new and creative ways in doing that, but that's just a huge, physical challenge that they face. and how do you make this data accessible? it's a huge problem and we can talk more about that later. then we have bea. they're the good folks who produce the number of gdp. i think a lot of you have heard about that, the current account deficit, what are our interactions with the rest of the world? how does that affect the u.s. economy?
so when you think about gdp, that's a relatively simple number. when they release their annual revisions, they go back in time, they literally release 5 billion data points on the u.s. economy. they produce a lot of detail and do that over time. it's a tremendous amount of information and it's really hard to get to. so how can we make that information easy to get? you may have a question about consumer spending in a specific category. how can you as a data customer quickly find that information without wading through just hundreds of pages of documentation? that's a big challenge. okay. and then the census bureau. so the census bureau is the definitive source of information about our people. and this is where the good data comes from again. so when census collects data on people, they care about everybody in our country. when you see data from a lot of these kind of private sector sources, you always have to question how representative is that data of the country?
and sometimes you see this in public polling, right? so if you have a pollster who doesn't have access or can't use cell phones, which is a big issue, those numbers can be quite skewed. you think about the last election, there were a lot of polls that were really off. so when you're looking at data about people and you're looking at it from these private sector sources, there are these huge questions about the quality of that information. so the census does a great job of that, when you go to the census website, the data is so what can we do to make it much easier to find? if you want to know what's happening to your community, what does your community look like? let's say you're moving to the d.c. area, as i imagine most of you live here. it's like what does falls church look like relative to, say, bethesda? how does that compare? the people in those communities, do they have characteristics you're looking for? you should be able to find that easily. right now it's pretty hard. and then we have pto, the patent and trademark office. when you're an inventor and 8,qd you're making inventions and all sorts of things which is so
hugely important to the growth of the u.s. economy, you have to look at the patent database. and right now a lot of that data is very unstructured. it's not machine readable, and not all the data the patent office has is out to the public. so i think for a lot of the data that hudson and company are asking for in terms of financial conditions, in terms of financial interactions of the federal government and state and local as well, there's a strong analogy where they're sitting on a bunch of data that hasn't been opened up yet. that's exciting. these are some of the efforts px we're doing, the department of commerce, but they all have very common themes. we want to get the data so people can find it and use it. so let's talk about the analysis of data. so, again, i mentioned one of the biggest constraints we have as a country, and for those of you in the private sector you probably have a hard time hiring people who have these skills of kind of looking at this data. and when we looked at this and there's a lot of people in our
society, over 10 million, who really are data insensitive in their day-to-day jobs. so there's about 150 some odd million people in the workforce who work for employers and then about another 20 million to 30 million who are self-employed. so 10 million are very data intensive. we expect that to increase over time. we need to develop more people with these skills who can look at the data and make the right differences. looking at the big data sets it goes beyond excel. at the department of commerce, finding people who had the ability to take large data sets and do something intelligent with them was hard to do especially on the federal pay scale. when i talk to my friends in the private sector, salaries are really high for this. we have to do a better job of educating people to get them into the pipeline to be able to do this type of stuff. so better outcomes.
so, as i said, smarter government. now we could talk about smarter governments all day. if i'm thinking from an economics point of view about where we're really going to move the needle a lot in our country, where data can really help, data often really helps where there's a lot of uncertainty, where we don't know stuff. and we've all been -- we've all had experiences, say, in the health care sector for ourselves. right now sharing information across the health care industry is very difficult. precision medicine is impeded by the ability to share information about our dna, for instance. so this is an area that is ripe for huge improvements because of data. health care is about 20% of gdp. this is huge. so if we can improve health care just a little bit, we can make it more efficient. that could have huge benefits for society.
again, a good friend of mine works in the criminal justice area. there is just so much we don't know about the criminal justice system. we think about all these financial records across state ' and local governments and all these government agencies and how they don't talk to each other. they're in different formats. the criminal justice system is much worse. so if you think about all your state and local law enforcement agencies even within those agencies data doesn't talk to one another. so my car was stolen july 3rd. my car was stolen in front of my house and so i called the police and they come and -- has anyone had a car stolen before? so the police assume that you just forgot where you parked it. okay? so once you get over that and you tell them you weren't drinking too much the night before and they actually drive around your neighborhood to see if they can find it, you know, my car was actually stolen. and then they're like, well, it was either sold at a chop shop or they're out joy riding and it will pop up in a few weeks. fine.
i'm talking to my neighbor, she said, well, my car was stolen, too. what i did, i went to the d.c. government website. and the one part of d.c. government that works really, really well, is parking tickets. right? everyone knows this, right? and they're really efficient. what you should do is go on the website a couple times a day and see if your car gets a parking ticket and you'll see -- and if it got a parking ticket, it wasn't at a chop shop and, sure enough, it did. and so it got a parking ticket for a license plate removed and the ticket told me exactly where the car was. so i went there. the car had been moved, but the data systems of the parking folks in d.c. government did not talk to the police department, right? so the person issuing the ticket had no idea that the ticket they were issuing to was a car that was stolen. so that's an example of the
criminal justice system where you have these data systems that aren't talking to each other at all and there's so much room for improvement and then if we think about we have all these big questions today about incarceration, what people -- what we should do, what our laws should be for certain violations. what are the effects of all of those laws? we really don't know. it's amazing that we're making such profound decisions about people's lives in an area we just don't know very much at all. but we're making big steps for that. one other example of merging data to really understand things. one of the last things i was able to do before i left office was to start this process of merging data on our veterans with data on their employment outcomes. so why do we care about that? we really want to know what happens to veterans when they enter the workforce.
we want to know how that varies depending on how many tours of duty you did, how long you were in the service, what you did in the service, which service you were in. we want to know what the relationships between all these things and what veterans programs you received. we don't know how efficient these -- we don't nope the outcomes of all these veterans programs. the budget is $163 billion last year. and we just don't know very much about the efficacies of these programs. we just don't know. but by merging these data sets together, we can figure this out. and then as i mentioned more competitive businesses. if we look at the u.s. economy, the economy has been growing 2%, 2.5% the past couple of years. one of the really big questions we have out there is productivity growth. most of you probably don't think about productivity growth that
much, how fast the economy grows, it's how fast our labor force is growing plus how efficient our economy is becoming. if you add those two up, gdp growth. so what we're seeing the last four years productivity growth averaging less than 1%. historically that's low in the united states. that's kind of really retarding u.s. growth. how are we -- why aren't we growing faster in this kind of data revolution where we hear all these great things about data? there's this big conundrum there. maybe what it is we have all these businesses gathering all this information and they really haven't yet materialized the benefits from all this data. but from a macro perspective that's a huge, huge question. so how much all this data stuff we're talking about, how much can this improve the u.s. economy, okay? so i'm just going to throw out a couple real rough numbers here. there's lots of studies out there who always talk about trillions of dollars and billions of dollars and so on
and so forth. i always find those numbers really hard to understand. so i'm an economist. i've been studying the economy for most of my professional career. i can't understand $1 trillion. maybe if i was warren buffett i could understand $1 trillion. he has yet to adopt me. you have all these numbers. let me put them into context. if this could improve the economy by just 1%, think about the improvements in government that we could get from this. think about the improvements in the private sector. 1%. that's not very much. is it? okay. so 1% of gdp is $175 billion, that's hard to relate to. it's such a big number. that's $543 a person. so that's about $1,300 for a typical american household. that's a lot. the median household income is about $52,000. so that would be a nice, big bump. that's if we could improve the economy one percentage point
from this data revolution. let's be a little more optimistic. over the next couple years this data revolution can improve our economy by, say, 5%, which i still think is actually a conservative estimate. it improves it by 5% and you get close to $1 trillion, a concept hard to understand, but that's over $2,700 per person. and, again, there's about 2.4 people in the typical american household, so now all of a sudden you're talking over $6,000 per household increase. so that's why everything that you're doing, everything that everybody out there is doing in this data space is just so important because the better information that we have, the better way we can analyze it, the better decisions we can make and get better outcomes and get those better outcomes because we get better government, because our businesses become more productive. we have businesses who actually thrive in the data space.
and then, more importantly, what's not quantified here is that our citizens become more informed. so i'm not sure what dollar value put on that but that's important as well. so let me repeat the main takeaway here which is i think we really are at this tipping point. we have more and more data. we have more and more groups like this who are advocating to make data accessible, to make it usable and make it more actionable. but how quickly we reap these benefits and these benefits are actually huge. the numbers i gave you just a moment ago, i think those are actually somewhat kind of conservative. that's not much of a stretch to get there but they could make a huge improvement in the quality of lives of just so many hundreds of millions of people who live here. we have to make our data more and more accessible and then we also have to -- one of the biggest constraints i think we're facing and, again, when i talked to the people in the lcñ private sector, we have to invest as a country to the
skills so we can really take advantage, so we can really leverage this kind of data revolution. so with that, thank you very much. [ applause ] all right, thank you, mark. we have time for about ten minutes of questions. and i think i'll start out with the first one. so you just finished six years in government. you've been advocating for fact-based decisions. you've been advocating for the release of high-quality data. what's next?'(!8÷ >> so i'm single, so i'm looking to marry an heiress. this is on tv, that's unfortunate. i don't work for government anymore, so i don't care. but more seriously, one thing i'm thinking about doing is writing a book. and what i'd like to do with this book is talk about all the different areas where data can really improve the quality of
our lives and really improve our country. what i notice across all these different areas whether you're talking about accessibility of data, the federal government on the spending side, whether they're talking about the health care data, the veterans data i've talked about, there are all these common challenges and how do you get all this data together while maintaining privacy? so on the one hand, we have this ability to take all this data -- many different aspects of our lives, combine it so we can answer these important questions. again, just think about the veterans example, but how can we do that while maintaining privacy and also the perception of privacy? the american public is getting very concerned about information that the government has on them and what the private sector has on them. and we want to use this data on people for good, okay. so if i got data on veterans,
for instance, i could combine that with the labor market outcomes. if i could look at their credit scores, i could look at how much debt they have, are they making their mortgage payments, for instance? i could better design veterans policies. i could better design programs while they're within the department of defense so that they have better outcomes when they leave the defense department. but to do that, i would have to combine data from lots of different sources, and our society, i think, is grappling with this big question about how do you do that while maintaining privacy and also just kind of this perception of privacy? people give a lot of their private data, their personal data to private companies, so facebook knows a lot about me. facebook provides me a service in return for that. but when it comes to the government doing this or even
the private sector getting more and more data on us, there's this real fear. i think it's a balance. the more data we have, the better decisions we can make. but the more data we have, the heightened anxiety of people. so how do we have this conversation with folks in order to do this? so this is something that i'd like to really work on in some capacity at some point because, again, i really do think that if we can really leverage all the information out there, we can really move the social needle quite a bit. >> so if you have a question, raise your hand. we have a couple mikes. right there is the first one that i saw. >> hi, i'm from the organization of leading excellence. we go back to your d.o.m.s model. one of the things i noticed absent from that model was the very beginning divisioning or hypothesis making before the data is examined or collected. so to the extent that data analysis and data decision making is a science, to what extent does there have to be active manipulation in terms of
experimentation to get the type of data you need to make the decisions you want and make sure those decisions are the right decisions? >> yeah, it's an excellent question. when it comes to hypothesis testing, that's what's really hard. and so when i was talking about having experts in the field, that's where you really, you know, need that expertise because, again, you get the big data sets, and statistically speaking you'll find lots of correlations, spurious patterns. there's this phrase we've heard which is correlation and causation and lots of examples of that. and i think what's going to happen then is you've seen this before in slow motion, it's entered a process that goes back and forth. we have data. you look at the data and say what hypotheses can i test with that data? what data do i want? and you collect that data. and as anyone who has done big data science before, wow, that was really cool. but then the number of questions begins to multiply even more
than the data sets themselves. and so i think as a country we have to be more adept at saying, okay, this is the information we have. this is what we can glean from it. based on the hypotheses we can't answer what data should we be gathering? and so we have to make sure that the causality goes both ways from a hypothesis to data and data back to the hypotheses. >> thank you. >> this side of the room. way over there. run, hudson, run. >> hi. i think you had a lot of very good points. you said 91,000 local governments and the potential to impact every citizen in the country. usually when we talk about open data we talk the national or a national level, at very broad scales. in your opinion what are ways to make this part of the vernacular of the way every citizen or business thinks, acts, and the smallest local government to consider how they change their processes, their policies in terms of making data and open data analysis to improve local outcomes?
>> so when it comes to local governments, this is fascinating. i think we have some groups here that present local governments. we traveled around and spoke to lots of different local government organizations and as one of the previous speakers said, some of the big cities, for instance, released data in a pretty good way, right, and there's a company or two here, one of your sponsors, they work a lot with kind of local governments in making their data accessible and i think what they are still in the early stage is what data do people want? my assistant and i went to chicago. they're at the vanguard at the local government level. and they have this active community of, what do you call them, warren? hackathons. they would have them once a
month where they would bring in these people -- the city of chicago would say, here's our data, do something with it. i think what they found is sometimes there were data sets they were releasing, yeah, there's not much use for that. sometimes there are data sets that they found really interesting. and so, for instance, one example of local governments doing good local government stuff in chicago, you take a picture of a pothole, you send it in to the public works department of chicago. it is then posted on the department of chicago's website so then they are accountable for filling in that pothole in a timely manner because everybody knows when that pothole was posted onto the website. so i just thout that was like a great example of making information available that then made the government accountable to addressing these concerns of the citizenry. the challenge we face, as i mentioned, we have so many local governments. a lot of these local governments
don't have the resources to do this type of stuff. when you talk about data science, making data open, analyzing, making sure it's out and what not, a lot of the local governments are just really hamstrung. if you look at the most recent recession, one of the biggest drags we had on our economy coming out of the recession was a state and local government sector. so employment just plummeted in the state and local sector because they were really hurt in part because of the housing crisis.
the property values went down. their tax revenues went down. and so i think what's happening at the state and local government level is that you have some that are really kind of out, the vanguard, some that aren't. the ones that aren't, are often, you know, they aren't because they're hamstrung. so maybe what we really need here are kind of standards, suggested best practices, across all these local governments and now that we've been doing this pretty well for a couple years for some of these larger entities, they're beginning to learn what these best practices are. but we need more -- i've spoken to many of these local government agencies. they get this. the state and local government level, again, i think it's that human capital constraint they're really facing. >> ellen? in front. >> i thank you so much. i'm helena sims, director of intergovernmental relations for the association of government accountants. one of the questions that i have in the human capital aspect that came up in the last question is relevant to this. all this data stuff, as you mentioned, is taking place at the same time where people are getting frustrated with the cost of higher education, and at the
same time we need people trained in this data stuff. what implications do you think that has for educational system in terms of -- what's the best way to educate people who are knowledgeable on the data stuff? >> one thing -- so i worked in the administration for six years, and one thing the administration really pushed on at the federal level is not a strong lever at all is the community college system. so if you look at community colleges across the country, i think they're just doing a better and better job, better working with local business to better match the skills of workers with those businesses. when you survey businesses, what you often see is that there's this skills mismatch issue. so businesses say we're not hiring because we can't find people with the right skills, right? and so how many millions of people are in the mismatched category? those estimates vary quite a bit. it's literally in the millions. when we're thinking about getting these skills to do the data stuff, we have to think, i
think, outside the traditional kind of four-year college, you know, degree. and also this is the big concept society always pushes is lifetime learning. so how can you learn these skills? and so literally i'm teaching myself a data processing language. what i'm surprised at is there are any number of online courses that basically don't cost anything for me to learn this. now that requires a certain amount of dedication and also i happen to know a lot of people who know a lot about this stuff, so i'm pointed in the right direction. but fundamentally we know that the cost of higher education, the rate of inflation for the past couple of decades has far outstripped the rate of the economy since it's very high. but i think one of the big answers to your question would be kind of the community college system is just really huge. and then also you see even colleges, carnegie melon is a case in point, in your first year now you take a computer programming course. okay? they just view it as a way to think, right?
this is something that everyone should be familiar with. when i was in high school a long time ago, i took four tran, for instance, but that was the exception. that wasn't the rule back then. so what can we do to teach kids today the things about coding? and, again, when i was undersecretary, you travel around quite a bit, it was a lot of fun. there are a lot of these camps where kids would learn java and then make an app, right? and it makes it really fun for kids because i think the way computer science used to be taught was done in a really nerdy way. it wasn't done in a very exclusive way. and you heard the same thing about math education as well. not only is it not where you get your education but it's also where, you know, how the stuff is taught, which i think is really fascinating as well. different people responded differently to different types of education. how things are taught. >> thank you. so that was a great question and will wrap up our keynote. thank you very much, dr. doms. [ applause ]
pope francis's visit to the u.n. continues with his speech to the u.n. general assembly. his speech is at 10:45. and later he gives a service at 9/11 memorial. >> on the next "washington journal," pope francis's visit to new york, his second city on his u.s. trip. and then an interview with tom roberts of the national catholic reporter. washington journal is live every morning at 7:00 a.m. eastern on c-span. we welcome your calls and
comments on facebook and twitter. the hope's visit to the united states continues saturday as he travels from new york to philadelphia. live coverage starts at 4:30 p.m. eastern as pope francis speaks at independence hall. and the pontiff presidential candidate lawrence lessig talks about his suggestion of running for president. and on c-span 2's book tv saturday night at 10 p.m., bill o'reilly talks about his book "killing reagan."
and on sunday, doug casey discusses his latest book on economics. and on c-span3, we're live from gettysburg college to mark the 125th birthday of president dwight d. eisenhower's birth, discussing his military and political career with his grandchildren, susan and mary eisenhower and a documentary film on the king and queen of afghanistan's visit to the united states. get our complete weekend schedule at c-span.org. >> up next, congressman darrell issa, answers questions on how congress will handle the issues
of data and transparency going forward. his remarks are about 50 minutes. hello, everyone. thank you all for being here and braving the traffic in support of open and structured data. my name is jonathan elliott. you really don't want to hear me talk as much as you want to hear this man to my left talk, so i will be quick. research data group provides compliance services and software tools to public companies to help them communicate with investors and comply with regulations with greater ease. and we've been in this industry for nearly 30 years. we're excited to see all the changes that have taken place recently. our country is pushing forward with real changes to help our system be more effective and efficient. those words do not normally correspond with government but we are making the move in the right direction. the single most important change in the past ten years has been the passing of the data act, and we are very, very enthusiastic
and executive member because we understand that opening up government data can help everyone in the country. not just idealistically but it is going to be in a very practical sense. the organization and linking of information that will help individual citizens, politicians, investors, institutions, and municipalities make better decisions in almost every aspect of what they do. the data act is the first open data law and many people have a hard time trying to understand just how large an undertaking the transformation from static documents to structured and searchable data is. our speaker for this panel is representative darrell issa. he is the champion of the data act, and understands what lies ahead. he knows that successful implementation of the data act cannot be achieved by one team, one person, or one government agency.
it requires a concerted effort from so many agencies and individuals. he also knows that there's more to be done and more leadership required to truly transform the way our government reports its information and he's going to touch on the next steps here today. without further ado, representative darrell issa. >> thank you. [ applause ] >> with that kind of an introduction i should just take the applause and leave. first of all, thank you very much. the one name that you didn't mention that without partners on the hill, things don't happen. and my partner in the data act on the senate side was senator warner. and i think it's extremely important to understand he was the one that went to harry reid and then the majority leader and demanded that the bill be moved. and although senator reid insisted that it be a senate bill, ultimately it's the senate, you have to expect that. but ultimately we made law together. the data act is just as it was
said, a major piece of legislation, but it's just the start. you can write legislation, but unless you oversee it and implement it and you're just diligent day after day, it will be meaningless. the fact is, today, there are many cios who still are, in fact, not competent, nor do they have the financial controls, the budget controls, of their projects. another major stumbling block. it doesn't mean there hasn't been law passed. it means that we, in fact, have to stay on top of that. and we have partners in that effort. i think most congressmen have
one thing that they can do very, very well and that is, they can talk about their next piece of legislation. so i want to get that out of the way so there not be any mystery. the financial transparency act, obviously, law, but the next steps are to insist that we make all data in government just as good and just as available. and some of it is hard. just before i was coming up, they started saying, well what happens if you're taking a picture of a pothole, how do you make a picture of a pothole machine searchable? well, if you're using a camera that's modern, you are going to have the gps location. you are going to have the time and date. you are going to have rich metadata that if not lost, does make that unique location for that unique picture at that unique time extremely valuable and searchable. it may not tell you why it was taken, it may not tell you whether it's been fixed, but at least it's a start. i want to mention one thing that
i have a passion for and that's modernizing foia. the data act is a standard that helps a tool and the freedom of information act, is today, in my opinion, a great success that is a fraction of what it was intended to be and it could be. every day, countless individuals, companies, news organizations, and law firms try to receive information. the first thing that happens it goes to a human being who begins a search process who then begins looking through the data in order to redact information that's not going to be given. literally a human nightmare to try to do. under the data act, we envision that metadata will be so easily searched, that when you're looking for it, you won't even have to ask, because the vast
majority of information that is being asked for, will already be available on-line with appropriate personal identifiable information and other fields that have been predetermined as at least not available openly, being removed. so foia will be limited to i looked at the data, the data indicates something more and i believe i have a right to some portion of what is redacted. knowing what you're asking for and cutting down the number of foia requests because the majority of what you want is available, on-line, searchable and to be developed is a good start to making government open and transparent. i think one of the most important things i can do at my age is tell the young people in the room how we got here and why we shouldn't be here, but why it was logical somehow to get here. nearly 40 years ago, actually 40 years ago, plus, i ran my first
computer program. well, actually i ran part of it until the card popped up showing me i had a flaw in my program. yeah, yeah, giggle in the back, you haven't actually held a stack of cards with three failures in it only one of which you get shown because you have to run it again with that corrected before you find the next mistake, line by line by line. in those days, we all understood that each card was simply more or less a 0 and a 1, that everything was purely data and we were turning it into something. over the next few years, we turned computer programs into devices that could be run for all the errors. you could bypass an error and find a next one. we also began printing out massive amount of ascii
characters on printers. absolutely useless information unless you read it. behind that if you had an index you could find out anything you wanted to find out about the data you were building. at that moment, whether it was a dek or a digital corporation for those who came after it, or an ibm or an hp, or a myriad of other companies, many of whom ncr and so on are not here today, had we said, oh, we have the beginnings of metadata, we have what we index, let's store in those characters the index, we would have been fine. but we didn't do it. what we did was we went along with proprietary indexing, proprietary calls, little
characters that were embedded with no standard. and many organizations over the next decades built standard after standard after standard that were well you could have as many standards as you want and everyone picked a different one. today we know that we can build standards that everyone can use or export to or make available and still maintain their proprietary calls. that's the future now with us. so one of the questions is, how do we get from a law to implementation. and there are really three components to it. one component very, very clearly is public demand. the public has to look at the benefits they get from open data. all of us who know and can know where our airplane flight is coming in, or even when you're on the airplane, find out where you are, are benefitting from data that's been made open for the application industry. all of us who have an app, we all have an app, come on, everyone in the room has a weather app somewhere, okay, and
you only use it when you worry, but it is there all the time. again, data made open. but imagine if all the spending of government to all the vendors was made open and available as appropriate for nonclassified work. imagine how quickly we could find out that the government, through no fault of its own, paid ten different prices for the same product. and, in fact, may buy once from the company that manufactures it, once from a distributor and several times from retailers, and not even be aware that way they went out for contracts they did that. imagine how much savings we could have. let's also imagine a world in which government stops and i said there were three parts. government stops making that progress and goes back. what do we do about it?
is it natural for vendors to say, yeah, the data act is great but it might hurt my particular revenue stream downstream so i'm not going to do it. unless the executive branch says no, we really mean it, we're not looking for open software, but we are looking for open data and we insist on it. and imagine, as government goes from one program, one person, one time, to another, that congress simply closes her eyes and says we passed that law, we're good. i think you can quickly imagine if congress takes its eyes off the oversight then the weeks, months, years and decades will go by and we can still have legacy programs and post-legacy programs and post-post-legacy programs such as those programs from the '60s that the irs claims they're still using with computers that are pretty well from the '60s. we can still have that. we can pay a huge price for it. under the data act, the office of management and budget has
huge responsibility. treasury has a huge opportunity. but when i said that it's the public, the executive branch and congress, that have the primary responsibility, i should have said, it's the public that must demand, it's the public that must continue to demand, it's the public that must ask why not. because the only way to get the executive branch to stay on it, is for it to be important in a political sense. the only way for congress to stay on it, is for it to be meaningful at -- with organizations like this that are dedicated to it. and so i charge all of you, we passed a law, i intend for the rest of my career to stay on top of it, to the best of my ability every day, and to work with others, but you have an opportunity and many members of the coalition are doing it right
now, every time you build an app or try to build an app, to take advantage of data that's being made available, you market to the public the benefit of rich data sets and your frustrations need to be communicated in three ways to the executive branch, to the major representatives here today, to your congress, one might say i'm here today, and that was a good line, i'm glad somebody liked that. i'm it, i'm the congress for today. and lastly, you have to communicate it to the public. do not go quietly into it's going to happen next week, next month, next year, it's not in the budget this year. if you have a success, market it to public and tell them it's because of open data. and if you're being thwarted or delayed, make sure you go just as public with it. because ultimately somewhere, there's some bureaucrat, bless their hearts, i always say bless
their hearts when i don't mean it, and they are just a matter of weeks or months or maybe years from retirement, but they just don't want to have that challenge. well, the people that work for them and then come afterwards, want it. and so for all the young, energetic, government workers, who want to be thought of as the leading edge of good technology, make sure you go public if there's somebody there who is looking and saying, that will happen on the next person's watch because i only have three years left until retirement and that's too much hassle. now, you notice i didn't mention government contractors. i didn't do so because my assumption is, that contractors do what is important and what is put into the bids, so one of the areas that i'm working with other members of congress is to ensure that congress begins pushing the executive branch to make sure that it's in the bid and that there's a benefit in the bid. no subcontractor or contractor for the government or government agency directly should ever be
working with data modernizing a program and not have part of their incentive to take us from where we've been to where we know we have to be. and that's going to be a monumental change, on time, on budget, of course is important, on time, on budget and saving the american people countless billions of dollars over the next decades, by opening up data, that's got to be in the bid. you won't see it the day the software is delivered, but you will see it for a generation to come. just want to make sure i didn't miss anything. in closing, we're just starting. in closing, this is the opening round for open data. there are companies that will take advantage of it and make fortunes.
there are non-profits who will take advantage of it and embarrass people in the administration, not just this one, but the next one and the one after. and if i have my way, the same level of open data will be both great for the public and questionable for members of congress, as data gets opened in all the branches. so i want to just close by saying, this is a start, i'm delighted that you're here, that there is, in fact, a coalition that dedicates itself to the same thing that senator warner and i were honored to be able to start. so the conference is a delight to attend. i look forward to next year having a list of accomplishments because i believe that this presidency, which was promised to be the most open and transparent, does have an opportunity to show that it can open up government at the areas that are least understood and least transparent, and do it before the lights go off for this administration.
and i think they will. i think they've set a course. i think they've appointed good people. now the question is, will we hold them to a timeline that's the same as the timeline of the president? because if any timeline is offered to you, this year, and it's one day after january 20th, 2017, then it's not a timeline, it's a dream. we don't need dreams. we don't need promises. we need what will you deliver before january 20th, 2017. thank you very much for all being here. [ applause ] >> we're going to take some questions from the audience. i have a couple questions here as well. i'll just start. we'll take yours last. who else has a question. >> no, go ahead. >> one of the requirements of
the data act is that they have to get all of their not only what they're reporting but recipients have to report this information, grant recipients. what do you see as the biggest obstacle in achieving that and getting the grant recipients to report? >> that's a great question. the reason it's such a great question is grant recipients were the obstacle for this bill passing when it sat for two years. there were two reasons that there's an obstacle. one, i can understand, you're a university professor, getting a couple million dollars every so many years, and you're used to loosely living up to the grant, but perhaps, you know, hiring an administrative assistant here or there that only loosely work on the program. the data act is intended to really follow the money and see
whether it is auditable as being spent appropriately to whatever the grant was for. and we think that's important and we think that those who shy away from it often do so because it's nice to get a pot of money and i don't want to say laws were broken or anything else, but the co-mingling and the moving around of grant money has gone on at universities since i was -- way -- until the ncr 500 and me in university. so that's one challenge. the other challenge is, that we in government, we that write grants, we that take applications, until the administration realizes that no one should have to enter data twice. that every entity, every unique entity, should have a number and once it has a number, it's
personally identifiable information. it's single data base should be there so that just like most of us when we log in, we expect to log in and it doesn't matter whether it's the cloud with google or our local device, we want to log in, we want it to say hi, daryl, and we want it to have all kinds of information already there so we don't have to enter it twice. that -- and if you assume you're the university of california and information is automatically populated, it's not asking you endlessly to give it essentially the same information, but at the most, asking you to fact check what comes up, then you see a reason for this information delivered this way to be valuable. that is government's responsibility. live up to the dream that you shouldn't have to enter again and again and again even if it's a different agency, the exact same information and allow that information to be valuable to the universities overseeing in the case of university of california, thousands of grants. because we think that's a value to the grant recipient that they don't have today which is to oversee and organize its grants in an easy way from a federal data base. so one, we can't do anything about except insist compliance.
two, we can be part of making it better for the grant recipient so they're more incentivized to support this. >> you also talked -- hudson is waving at me. he's got an important question. please, go ahead. >> hi. my name is mary anne. i'm the cto of a company called x version and full disclosure i'm a software developer, so -- >> is that a confession or what? >> yes, it is. >> if you told me you were a lawyer, that would be a confession. >> well, um, one of the problems that we're seeing kind of like developing on the edge of the open data movement is that often the people who are releasing the data don't really understand what a personal identification sort of piece of data actually looks like. so they pick on the obvious things like names, social security number, phone number, address, but there are a lot of like nonobvious things that especially as more and more data is released from more and more
sources, people like me can take multiple data sets can run them together and figure out like who's who in the data set. you have things like things being hashed incorrectly which was a problem the city of new york had earlier this year, so my question for you, my biggest fear as an open data advocate this will create a political backlash at some point, what policies are being put in place by the law to help these agencies sort of like come up and educate themselves technically so that they're not releasing data that will bite them later on? >> that's a technical term, right? no, you have hit on one of the great challenges of metadata that's not properly defined. the federal government has an endless amount of history, how far we define what name is, social security name is, so on. programs have been written without, if you will, compliant
metadata identifiers and that has to happen. and that's -- that is a matter of going into it and saying, here is the federal standard for all of this. can we -- can our data be searched based on it. the last thing you want to do is deal with it as though you had five spreadsheets written by five different people who named the top of every cell a different name with a different width with a different whatever. you don't want that and don't need that and shouldn't have it. the fact is that government agencies need to put their data in a format wheres there's a standard comparison. having said that, your challenge as a software developer is in the current world, yes, you need to be able to -- and the post office happens to have great program that almost works. now come on.
if you take your data sets because you've entered, you know, name, address, zip code and so on, you've named it as well as you can, the post office -- and you give them the data, they have a wonderful program that actually corrects almost everything. it will change your abbreviation for avenue or street to make it compliant. it, of course, will add the zip plus. a lot of the fuzzy logic that it takes is pretty amazing to take really bad typos in data entry for names, addresses and the like for postal and make it right. software for the interim is going to have to do a lot of that with government data. you're going to, for the short term, be getting a lot of data that you're exactly right, somebody embedded the name, a second or a third time, without
a field indicator, and it's going to take some cleanup in logic. that's one of the opportunities, if you will, for software companies, particularly if they're working with the federal government on data act compliance, is to scrub the existing data, apply appropriate identifiable metadata, so it doesn't have to be further scrubbed in the future. and i believe that, you know, although they'll have to be funding from congress, that those earmarks, if you will, those actions to get data so that you're not as -- you sounded a little like a lawyer when you said in fear of litigation, you shouldn't be in fear of litigation. the government does need to scrub and clean up their data so that doesn't happen. if the post office can be part of the solution in the case of the data that i had in outlook, the fact is, that you and companies like you shouldn't have to worry about that data
being reasonably scrubbed. hudson, who else have you picked? >> hudson, we need a microphone down front. >> i think we've got -- there we go. >> thank you. >> hi. i'm jeff myers with rei systems. i first want to say i think the data act is fantastic. it provides a huge amount of valuable data. but i think of a particular use case. i would like to be able to look across the federal government and say where is all spending on the same program, where is all the spending on the same activity even if it happens in different agencies. where is the spending on the same mission and not just because i care about those spending, but because i want to figure out where there's duplication or a need to coordinate. my question is, will you or is there an interest or commitment to taking the data act further and saying, for example, right now, agencies are required to identify the program, but one agency might say, you know, it's water quality audits and another might say water quality safety and another might say it's water quality research, will there be an opportunity to take the data act further to further use cases like the ones i've described? >> the answer is, yes, and if we get cooperation from the administration, we shouldn't
need a new law. the office of management and budget in setting, if you will, or requiring that the -- all the agencies set a common standard, can do a lot of this. years ago, i had a simple task. everything is simple until you get the bill for it. all i wanted to know was, how many jet aircraft and prop, but mostly jet, does the government own? what models are they, how long have they been around, and who controls them? and this happened to be shortly after 9/11 so i was a junior member. they kind of laughed at me. as the years went on i kept asking it. it's amazing, even the department of defense has a whole joint group with multiple officers trying to figure out where all their aircraft are and who's controlling them and what they do. that's a pretty simple thing. i mean, these -- you spend a few
million dollars each, there's only so many let's call it a thousand noncombat -- nonfighter aircraft, it shouldn't be that hard to figure it out. i can tell you, if you wanted that information today, it would cost you -- it would cost the government a fortune to get it to you because the coast guard has their aircraft, and this group has one and so on. you're exactly right. interoperable standards where if it's the same thing it's named the same way, a similar thing, it has a number that is identifiable, or similar, so you can not only find exact matches but when there's a characteristic difference there's a unique metadata identifier. that's what we're getting to, is a standard setting for what you call something if it's identical and what you call something when there's a difference. omb has a responsibility to build, if you will, interagency
cooperation, to get that. candidly, d.o.d. hasn't gotten there year and we're hoping that they will be among the first because if i've got a caterpillar d9 tractor in the army, navy, air force, marines, and i need a part, and that part is somewhere in the world under some agency, the last thing in the world i want is to have that asset down while somebody is waiting to buy something we already own and the other one is heading to property disposal. but that happens today. it costs us billions fro. hudson, who else? if you run out of questions, i have a second speech. >> my name is tony. i'm a consultant with deloitte. i've heard you talk a little bit about the data act supporting this concept of establishing a common language.
i think it's a very powerful concept and i wanted to get your perspective on how that might impact congress's ability to support its function in its capacity to represent the people? >> okay. i'm going to answer the question as i interpret it. i think i heard you say that, you know, what will happen if the data act is fully implemented to where all data can be made meaningful by a common program searching multiple data bases, if you will, a little like google going out to every newspaper and seeing what they all wrote about their congressman. the questions that the american people have on whether their money is well spent, whether they're properly represented, whether the waiting time at a particular veterans administration center is based on an actual shortfall of doctors or, in fact, an
inefficiency within the hospital, questions like that, if the -- if our democracy, our republic is a representative democracy, but the ability of every individual to know more before they ask a question of their congressman or for their congressman or woman to be able to get the information directly rather than months or years later, can have a dramatic effect. i would love nothing more than for my constituents who are waiting for va service, to know what the wait times are at every hospital, what the ratios are between the number of doctors and the amount of care, and be able to say, you know, my hospital in brunswick, ohio, is underperforming and as a result, i'm waiting longer and not getting care. what's wrong? that kind of a question is so
much more powerful than i'm waiting what can you do for me and we try to get them into the hospital faster. so i see it as empowering to members. i do see it as producing a lot more constituent requests, but those will be very targeted requests because a lot of the information will already be information will already be either gleaned by the constituent or easily gotten by a case worker at a computer
weeks or months from an answer so i see that as part of it. much of this will only happen if the software industry supporting tools. i have no illusions that, you know, leonard wright, one of my there's no way that she's going those data bases, but that developed and made available if searchable and open google i don't care, in a way right i just want it available to my workers. >> hi. my name is darla and i'm with >> with what? >> terra data. >> yes. >> and one of the things that when you're implementing law, a lot of the heavy lifting happens at the agency level, and as we is talk with agencies a lot of them have different sentiments about the implementation of the data act itself, particularly being able to get value via analysis and analytics. i've worked with data from the usa spending and it is currently as a testament to the need for standards in the data act. my question for you is, do agencies have the capacity to fully implement the act? and the reason that i ask that
is, because some of them in casual conversation i hear, not that -- not conversations i've had myself, that some of them don't believe they have access to some of the data to be compliant with the act and so what would you say about that capacity issue? >> okay. i apologize, the agency you work with, i missed hearing that. >> i've worked for agencies before, but i don't currently work for one. >> okay. >> i work for terra data. >> okay. >> no, but i mean you said some agencies didn't think they had the ability. i wondered if you wanted to name one. you know, it's old habit from my days. the answer to a question like that quite frankly is a leadership question from the white house through omb. i agree that people at a given level may think they don't have
the ability or may validly not have the ability and that's the reason that if the office management budget leads on behalf of the president, what they're going to very clearly do is say, we need a plan from every agency. how will you fully comply? what are your road blocks? what is your funding estimate? what are your short term -- and it should be short-term easy low-hanging fruit. what portion of your data is already in a format that can be easily made available? what portion of your data isn't? what guidance do you need for setting metadata standards? all those questions, some of them have been asked by omb and i don't want to short count the administration's willingness to do some of this, but you're right about one thing, it is typical for an agency to say, oh, another mandate from
congress, it's unfunded because in their mind that -- whatever money they got wasn't for that. and i don't expect movement unless this process goes forward where agencies are directed to produce plans, they're given guidance, and in my estimation, when you look at the security exchange commission, who in many ways is ahead of it, but in many ways -- i can't say reconstitute -- i can never pronounce the word that says they don't want to do it -- but, you know, quite frankly some of the agencies, the fdic and others, are actually very far along toward having their data in the right format and very far along toward not providing it in
some cases. so this is where the president's leadership is important. you need to both shed light on the agencies that are ready to go, have a lot of it, allow them to lead in the best practices determination in helping other agencies understand what they need to do, and at the same time, you need to say and the law as written has to be implemented and if you think we need to change something let me know because that's why we have thousands of people who are called legislative, you know, presidential legislative appointees. there's an army of people appointed by the president that are supposed to be working on legislative challenges, and we will meet with them at any time if they say they need a follow on to the data act. chairman chaffetzs will meet with them any time if they need a follow on. until they tell us what they don't have and what they need, my assumption is it's a lack of leadership and when you hear that from an agency, the only question i would ask you to say is, what have you done to find out what your capabilities are, what they're not and how you're going to get from here to there.
you've been told where you have to end up. do you already know where you are. have you done an assessment of where you are or do you just say we can't do it? i would propose to all of you that if an agency can type a few keys and most can, at some level, and get a piece of information in a proper format
nice rows and they export it to excel, if you can do that in an agency, then you already have everything except metadata that is assigned to those fields that makes it interoperable. you are mostly there. that's where i think people misunderstand within an agency, within a particular program, almost everybody has everything they need. the only thing they don't have is interoperable identifiers embedded. if every agency were to publish their, if you will, their keys, and then you compare their keys for what it means to another agency's keys for what it means, you can build a table for bringing them all together. now, we do that every time we ask for a report that comes from ask for a report that comes from multiple agencies.