Key Moments

All The Government's Information

Google TalksGoogle Talks
Education5 min read52 min video
Aug 22, 2012|804 views|9
Save to Pod
TL;DR

Open government data via Washington Bridge: archives, feeds, openness.

Key Insights

1

Open data as a civic foundation: Malamud’s work shows that individuals can push governments to publish data online (SEC EDGAR, patents, Smithsonian contracts).

2

Public memory as a mission: a nonprofit framework (Public Memory Trust) aims to archive and stream Washington hearings for permanent public access.

3

Hybrid distribution model: live feeds from hearing rooms via fiber, transcoded to public-friendly formats (MPEG-2/4) plus a retail-like access path.

4

Public-private collaboration: partnerships with GPO, Cisco, Sun, Google, and others to supply infrastructure, storage, and metadata tools.

5

Metadata and annotation are essential: initial efforts will rely on broadcast data, automated processing, and crowd-driven annotation to enhance searchability.

6

Policy and disruption risks: the project highlights tensions around net neutrality, media middlemen, and the funding/ownership of public information.

INTRODUCING A TROUBLEMAKER AND A VISION

Carl Malamud presents himself as a practical troublemaker transforming information access. He recounts decades of work turning government standards and documents into online resources, from attempting to publish ITU and ANSI materials to releasing blue-book standards as online data. His timeline includes translating obsolete formats, fighting bandwidth limits, and documenting the evolution of the Internet as a public research network. The core message is that dedicated individuals can seed large-scale change in how government information is produced, stored, and shared with the public.

UNLOCKING GOVERNMENT DATA: SEC, PATENTS, AND BEYOND

Malamud recounts launching free online services for SEC EDGAR data, and briefly mirroring patent office data when access was blocked by policy. He describes engaging with lawmakers and agencies—receiving resistance that later yielded to practical demonstrations and public pressure. The narrative emphasizes how a combination of grassroots activism, technical ingenuity (mirroring data), and strategic meetings can move reluctant institutions toward openness, transforming public records into accessible online resources for citizens, journalists, and researchers.

SMITHSONIAN SHOWTIME CONTRACT AND TRANSPARENCY FIGHT

A major thread in the talk is the Smithsonian Showtime contract, alleged to grant Showtime a right of first refusal without public disclosure. Malamud describes filing FOIA requests, the public’s pushback, and eventual scrutiny that frames transparency as a live political issue. He argues that long-term, secret agreements on public assets are antithetical to democratic accountability, illustrating how public access extends beyond data portals to the governance processes surrounding cultural and scientific archives.

FROM INSIDE WASHINGTON TO OUTSIDE: IPTV AND A PERMANENT RECORD

The talk shifts to a broader need: an enduring, searchable archive of congressional hearings beyond what C-SPAN or private vendors provide. Malamud notes the limitations of current webcasting, the gaps between transcripts and media, and the potential of IPTV-style delivery to sustain a permanent public record. He envisions a Whats-from-Washington pipeline—comprising live feeds, archives, and searchable metadata—that would empower researchers to locate, cite, and reuse content long after a hearing ends.

THE WASHINGTON BRIDGE: A PUBLIC MEMORY PROJECT

At the core is the Washington Bridge concept: a gateway that connects 16 hearing rooms to the Internet, enabling live streaming and permanent archiving. The plan contemplates converting 280 Mbps of raw feeds into scalable formats ( MPEG-2 at around 50 Mbps or MPEG-4 at 8 Mbps) for distribution to major platforms (Google Video, NBC) while also offering a retail-like access path. This architectural idea blends high-quality feeds with broad public access, turning government proceedings into a globally navigable resource.

BUSINESS MODEL AND PARTNERSHIPS

Malamud outlines a 501(c)(3) nonprofit—Public Memory Trust—with a dual funding model: hardware donated by industry and cash sponsorship. A government-printer partnership (GPO) via a fellow program is proposed to provide staff and public-sector access while maintaining independence. The strategy includes two access routes to agencies: as independent media and as a federally supported facilitator, enabling faster deployment and broader reach without becoming a pure government service.

TECHNICAL DESIGN AND ARCHITECTURE

The technical core centers on aggregating 16 high-quality feeds and distributing them via modern Internet infrastructure. Initial feeds would leverage existing cameras, pool feeds, and newly credentialed staff to fill gaps. The plan calls for co-location facilities (like Equinix) to host transcoded streams, robust firewalling, and scalable storage (eventually petabytes). The architecture emphasizes interoperability, with an emphasis on staying tech-agnostic enough to adapt to evolving codecs, storage, and delivery networks.

METADATA, TRANSCRIPTION, AND ANNOTATION CHALLENGES

A central challenge is producing usable metadata and searchable transcripts. The project plans multiple strategies: best-effort broadcast metadata, automated speech-to-text where feasible, and leveraging net-native annotation by users. The team contemplates xmpp/Jabber for live event coordination and a distributed annotation model, so data quality improves over time. The balance between automated techniques and human curation remains critical because high-quality metadata dramatically improves long-term value for researchers and the public.

INTERMEDIARIES, MEDIA, AND POLICY IMPACT

Malamud discusses how this openness could disrupt traditional intermediaries—lobbyists, specialized feeds, and major networks—while empowering broader media and independent voices. As more content becomes publicly accessible, traditional gatekeepers may lose some leverage, while the broader ecosystem (including blogs and citizen journalists) gains a richer, more immediate feed of information. The talk also touches on net neutrality as a live concern: who controls distribution, and how content is prioritized or throttled could influence democratic discourse.

REAL-TIME FEED, RESOURCE LIMITS, AND QUALITY CONTROL

A practical element is staffing and operations: a 12-person team (executive director, facilities manager, network and video engineers) plus field personnel who can operate cameras. The plan anticipates a mix of in-house monitoring and field deployment, leveraging existing schedules and infrastructure to minimize costs. Quality control becomes essential, especially in Washington’s environment. The approach emphasizes hands-on oversight of camera work, feed integrity, and responsive maintenance to keep the archive accurate and useful.

ACCESS TO THE FLOOR: PUBLIC DOMAIN FEEDS AND SEARCH

The floor of the House and Senate sits at a unique intersection of public domain material and access rights. Malamud notes prior success in publicizing floor audio and transcripts, and envisions robust search capabilities (speaker IDs, topic-based queries) to enable precise retrieval. The integration of transcripts with audio, improved speaker identification, and a flexible search interface are presented as essential features that would transform public engagement with legislative processes.

REPLICATION, NET OPENNESS, AND GLOBAL POTENTIAL

Ultimately, the project aspires to replication beyond Washington—state capitols, other capitals, and international venues. The model aims to be transferable: a funded, open-access infrastructure with public-private partnerships, capable of sustaining permanent archives and promoting transparency. Malamud sees a future where libraries, universities, and governments collaborate to replicate the architecture elsewhere, gradually phasing the initial nonprofit out in favor of a sustainable government-led or community-supported ecosystem.

Washington Bridge: Quick Do's and Don'ts

Practical takeaways from this episode

Do This

Pursue hardware donations from Cisco and Sun to seed the Washington Bridge project.
Engage government partners via the GPO fellows program to create public-private collaboration.
Stream 16 Washington hearing-room feeds in high quality ( MPEG-2/ MPEG-4 ) to a central co-location facility for distribution.
Archive all public proceedings for permanent access and searchability.

Avoid This

Don't rely on ad-hoc fee-for-service models or private monopolies to fund the project.
Don't block or degrade public access; avoid creating gatekeepers that restrict end-user access.

Common Questions

The Washington Bridge is envisioned as a gateway connecting congressional hearing rooms to the Internet, streaming up to 16 high-quality feeds live and archiving them permanently for public access and research. It would convert multiple feeds into MPEG-2/ MPEG-4 formats and distribute them via co-location facilities so end-users and media can access the content. Timestamp referenced: 881.

Topics

Mentioned in this video

personPaul Vixie

Internet pioneer mentioned as an example of people building the Internet worldwide.

personKen Burns

Documentarian referenced in relation to the Smithsonian Showtime contract issue.

personAndrew Partan

Engineer who helped with initial routing configs; part of the technical team.

bookExploring the Internet

Carl Malamud's book documenting the early development of the Internet and related standards work.

personJun Murai

Japanese Internet pioneer referenced as someone Malamud met while building Internet infrastructure.

personBruce James

Public Printer of the United States; discussed as a potential partner for online government data.

personSteve Wolfe

Cisco executive involved in the project; speaks to networking and data-center aspects.

toolEDGAR database

Securities and Exchange Commission's electronic database of company filings; highlighted as a data source to be online.

personNewt Gingrich

Former U.S. Speaker of the House; target of a public-protest during the SEC data project.

personDeb Roy

Programmer noted for work on speech-to-text and audio/video processing; identified in anecdote about MIT talk.

personEric Schmidt

Former Google executive; involved in discussions about potential support and advice.

toolAVOC (Audio Visual Operations Center)

Verizon's fiber-based broadcast distribution center used to feed hearings; central to the live streaming plan.

personCarl Malamud

Speaker of the talk; advocate for putting government data online and building public-access projects.

personAl Gore

Former U.S. Vice President; targeted in a campaign to bring government data online.

personRandy Bush

Networking engineer mentioned for his work on iij and routing; referenced as a technical contributor.

personHank Kilmer

Storage architecture expert referenced as part of initial engineering support.

personVint Cerf

Internet pioneer; co-designer of TCP/IP; referenced in connection with potential Google involvement.

More from GoogleTalksArchive

View all 13 summaries

Found this useful? Build your knowledge library

Get AI-powered summaries of any YouTube video, podcast, or article in seconds. Save them to your personal pods and access them anytime.

Try Summify free