Data Visualization

From dftwiki3
Revision as of 08:21, 27 November 2009 by Thiebaut (talk | contribs) (Morse-Smale Complex, and the visualization of "big data")
Jump to: navigation, search

The different visualization systems shown below are organized by application domains, and by type (using a scheme used by Viz4All.

The application domains include:

  • Business / Stock Market            
  • Demographics
  • Geographical
  • Government / Politics
  • Internet / Search
  • Lexical / Text
  • Multimedia
  • News
  • Product Search / Shopping
  • Social Network
  • Surveys
  • Sports

The types include

  • 1-D               
  • 2-D
  • 3-D
  • hierarchical
  • multi-dimensional
  • network
  • temporal


Contents


Graphite

Category:
Where: CMU and Lawrence Livermore National Lab
Implementation: network
Date: 2008

We present Graphite, a system that allows the user to visually construct a query pattern, finds both its exact and approximate matching subgraphs in large attributed graphs, and visualizes the matches. For example, in a social network where a person’s occupation is an attribute, the user can draw a ‘star’ query for “finding a CEO who has interacted with a Secretary, a Manager, and an Accountant, or a structure very similar to this”. Graphite uses the G-Ray algorithm to run the query against a user-chosen data graph, gaining all of its benefits, namely its high speed, scalability, and its ability to find both exact and near matches. Therefore, for the example above, Graphite tolerates indirect paths between, say, the CEO and the Accountant, when no direct path exists. Graphite uses fast algorithms to estimate node proximities when finding matches, enabling it to scale well with the graph database size.


JellyFish

Jellyfish.jpg

Category: Lexical / Text
Where: DMI Boston
Implementation: network
Date: 2005

Jellyfish visualizes an encyclopedia of the arts. The project should be seen as an experiment, which deals with a dynamic interface. The purpose was to remove a static, conventional design and to achieve a playful interface. The application was developed in Processing and uses an XML database to update content. 2005










Viz4All

Category: Survey
Where: University of Maryland, College Park.
Implementation: mixed
Date: 2005

A survey of Internet Visualization Tools

Viz4All is a survey of visualization tools examined by graduate students participating in the spring 2005 Information Visualization class at the University of Maryland, College Park. Our criteria for visualizations on this website: are designed for use by the general public (not specialists); present practical, useful, and possibly entertaining information; with an interactive interface that gives users control over the display.


Well-Formed.Eigenfactor.org

Wellformedeigenfactor.png
Wellformedeigenfactor2.png

Category: text / lexical
Where: University of Washington.
Implementation: network, temporal, hierarchical
Date: NA

Eigenfactor is a non-commercial academic research project by the Bergstrom lab in the Department of Biology at the University of Washington.

Interactive visualizations based on the Eigenfactor™ Metrics and hierarchical clustering to explore emerging patterns in citation networks. A cooperation between the Eigenfactor Project (data analysis) and Moritz Stefaner (visualization).

The visualization is dynamic, and generated with Flare (prefuse's successor).

The map visualization puts journals, which frequently cite each other, closer together. You can drag the white magnification lens around to enlarge a part of the map for closer inspection. Clicking one of the nodes will highlight all its connections. If a journal is selected, the node sizes represent the relative amount of citation flow (incoming and outgoing) with respect to the selection; otherwise, they are scaled by their Eigenfactor™ Score. Map calculated with Cytoscape, visualization built with flare

We use a subset of the citation data from Thomson Reuters' Journal Citation Reports 1997–2005. The complete data aggregate, at the journal level, approximately 60,000,000 citations from more than 7000 journals over the past decade. For an interesting subset, we select journals ordered by their Article Influence™ in 2005, but include no more than 25 journals from a single field. To make the subset coherent, we make sure that selected journals are included all years and that we cover the 10 journals with highest Eigenfactor™ score. To cluster the networks, we use the information-theoretic method presented in Maps of information flow reveal community structure in complex networks (PNAS 105, 1118 (2008)), which can reveal regularities of information flow across directed and weighted networks.


Visual Thesaurus

Visualthesaurus.png

Category: text / lexical
Where: NA.
Implementation: network
Date: Present

The Visual Thesaurus is an interactive dictionary and thesaurus that allows you to discover the connections between words in a visually captivating display. Written in Java.

A free version in Javascript is also proposed at http://www.kylescholz.com/blog/2006/06/javascript_visual_wordnet.html

The Visual Thesaurus is written using the ThinkMap SDK, available at http://www.thinkmap.com/thinkmapsdk.jsp







The Best Tools for Visualization at ReadWriteWeb

Readwriteweb2.png

Category: Survey
Where: NA
Implementation: NA
Date: Compiled 2008 by Sarah Perez

This is a collection of sites and packages used for displaying (mostly) social networks, but not only. ( pdf list)













GapMinder.org

Gapminder.png

Category: text / lexical, government / politics
Where: Professor of International Health at Karolinska Institutet, Sweden
Implementation: network
Date: Present

Watch a great presentation by Hans Rosling on world statistics on TED. He uses very clean, simple graphs (Flash, very likely) showing the variation of world data vary as a function of time. The message is extremely convincing.















CityMurmur.org

CityMurmure.org.png

Category: Geographic
Where: Politecnico di Milano
Implementation: geographic
Date: NA

CityMurmur is a Web application that periodically scans a pool of news sources, blogs, and online newspapers searching for references to local streets, points of interest and areas of the city. using this information the application is then able to plot topographical and semantic maps of the city according to the topic discussed by the news source( culture, society,...), the source's topology (blogs, online newspapers), and its scale (local media, regional media). </i>


Representing Graphs as Trees

GraphSpanningTreeBrowser.jpg

Category: Geographic
Where: University of Maryland
Implementation: network
Date: NA

Taken from "Visualizing Graphs as Trees: Plant a seed and watch it grow", Bongshin Lee, Cynthia Sims Parr, Catherine Plaisant, Benjamin B. Bederson. http://hcil.cs.umd.edu/trs/2005-23/2005-23.html

Abstract. TreePlus is a graph browsing technique based on a tree-style layout. It shows the missing graph structure using interaction techniques and enables users to start with a specific node and incrementally explore the local structure of graphs. We believe that it supports particularly well tasks that require rapid reading of labels.

Tree Browser for the Encyclopedia of Life

EncyclopediaOfLifeTreeBrowser.jpg

Category: lexical text
Where: Biodiversity Heritage Library, The Field Museum of Natural History, Harvard University, Marine Biological Laboratory, Missouri Botanical Garden, Smithsonian Institution
Implementation: network
Date: NA

An interesting way of representing trees taken from the Encyclopedia of Life (eol). See the video tour at http://www.eol.org/content/page/screencasts.













Chris Harrison's Wikipedia Visualization

HarrisonWikipedia.jpg

Category: Geographic
Where: Politecnico di Milano
Implementation: network
Date: NA

During [his] time at AT&T Labs, which coincidently has a great information visualization group, [he] started [to] think about how to visualizing something as massive as Wikipedia. With roughly 1.5 million articles (vertices) and tens of millions of article links (edges), a comprehensive visualization package would have to found or built. After playing around with GraphViz, but getting frustrated with layout limitations, I decided on the latter option: build!
















PubNet

PubNet authorship.jpg

Category: text / lexical
Where: Yale & Rutgers Universities
Implementation: network
Date: 2005

PubNet: a flexible system for visualizing literature derived networks, reviewed by Shawn M Douglas,1 Gaetano T Montelione,2 and Mark Gersteincorresponding, author1,3

Abstract: We have developed PubNet, a web-based tool that extracts several types of relationships returned by PubMed queries and maps them into networks, allowing for graphical visualization, textual navigation, and topological analysis. PubNet supports the creation of complex networks derived from the contents of individual citations, such as genes, proteins, Protein Data Bank (PDB) IDs, Medical Subject Headings (MeSH) terms, and authors. This feature allows one to, for example, examine a literature derived network of genes based on functional similarity.










VisPedia: Standford's Visualization of Wikipedia

Visipedia.png

Category: text / lexical
Where:Stanford
Implementation: network
Date: 2009

"We present Vispedia (live at vispedia.stanford.edu), a system that reduces the cost of data integration, enabling casual users to build ad hoc visualizations of Wikipedia data. Users can browse Wikipedia, select an interesting data table, then interactively discover, integrate, and visualize additional related data on-demand through a search interface and a query recommendation engine. This is accomplished through a fast path search algorithm over a semantic graph derived from Wikipedia. Vispedia also supports exporting the augmented data tables produced for use in more traditional visualization systems. We believe that these techniques begin to address the "long tail" of visualization by allowing a wider audience to visualize a broader class of data."


NetSci 07: Visualization Competition 07

NetSci07.png

Category: Survey
Where:New York Hall of Science
Implementation: mixed
Date: 2007

Visualizing Network Dynamics Competition, At NetSci07, New York Hall of Science, Queens, NY, May 20th-25th, 2007





























Map of Science at Los Alamos

MapOfScience.jpg

Category: text / lexical, Internet / Search
Where: Los Alamos National Laboratory
Implementation: network
Date: 2009


Los Alamos National Laboratory scientists have produced the world's first Map of Science—a high-resolution graphic depiction of the virtual trails scientists leave behind when they retrieve information from online services. The research, led by Johan Bollen, appears this week in PLoS ONE (the Public Library of Science).
























Vizster

Vizster.png

Category: Social Network
Where: Stanford
Implementation: network
Date: 2005

A project from Jeffrey Michael Heer at Stanford (http://jheer.org/vizster/), using the Prefuse package.

(Some notes about Prefuse can be found here)












NeoFormix.com

Neoformix2.png

Category: Text / Lexical, Art
Where: NA
Implementation: misc
Date: 2008

A sample below of many visualizations from NeoFormix maintained by Jeff Clark.

Obama visu1.png































Cinegraph.viz

Cinegraph.png

Category: Multimedia
Where: The GeoVISTA Center and the Department of Geography, The Pennsylvania State University
Implementation: multi-dimensional
Date: 2007

A visual tool to explore the relationships in the Internet Movie Database. Implemented with Improvise.

Cinegraph is an interactive visualization for exploring and analyzing the InfoVis 2007 contest data set derived from the Internet Movie Database (IMDB). By combining two complementary visual interaction techniques, cross-filtered views and attribute relationship graphs, Cinegraph supports a wide variety of general and highly focused analytic tasks. Users can express complex lines of questions in the form of rapid sequences of simple interactions. Designed and built in a little over two days by a single visualization designer using the Improvise visualization environment, Cinegraph provides high-dimensional interactive drill-down capability into the people, genres, awards, release dates, and box office characteristics of movies described in the database, using ancillary photographs of people, images of movie posters, and icons of movie genres to enhance the interaction process. (from http://www.cs.ou.edu/~weaver/improvise/examples/cinegraph)

Watch the movie on Cinegraph...

Morse-Smale Complex, and the visualization of "big data"

Morse-Smale-Complex.jpg

Category: 3D, Scientific, algorithms
Where: UC Davis
Implementation:3D
Date: 2009

Basking in Big Data [1], or how Visualization software makes viewing and interacting with enormous data sets practical without a supercomputer.

Recently [...] researchers at the University of California, Davis, and Lawrence Livermore National Laboratory announced that they have developed software that makes analysis and visualization of huge data sets possible without the aid of a supercomputer. The researchers' algorithm (the Morse-Smale Complex algorithm) slices up data into more manageable chunks, then stitches it back together on the fly, so that the data can be manipulated in three dimensions, all on a computer with the power and capacity of a high-end laptop. (from Technology Review, http://www.technologyreview.com/computing/21976/?a=f )

Slides of presentation at VisWeek 09 [www.idav.ucdavis.edu/~garth/vis09-tutorial/pdfs/childs.pdf here].

Cartograms

Maps of the 2008 US presidential election results, generated by M. E. J. Newman, U. Michigan.
http://www-personal.umich.edu/~mejn/election/2008/

2008Election1.png2008Election2.png

AtlasRealWorld.jpg

The map on the left is geographically correct. The map on the right shows the states deformed in such a way that their area is now proportional to the number of electoral votes they carry.

The author is a co-author of the Atlas of the Real World (Hardcover) published by by Daniel Dorling (Author), Mark Newman (Author), Anna Barford (Author), published Oct. 2008.

More cartograms can be found here...








Skyrails

SKYRAILS: 3D OpenGL visualization software.

Skyrails.png


Great video on YouTube at http://www.youtube.com/watch?v=I2d312_dXEs










SecViz.org

Some graphs taken from the SecViz.org site, dedicated to visualizing security information.

Geo Tagging an Attack The INAV software package for visualizing connection information in real time API Calls and Imported Symbols of Nepenthes Download Binary Files

GEOTaggingAttack.gif

INAV1.png

APICalls.gif


24 hours of firewall logs plotted by source port over time

Tenable Network Security's Security Center includes a 3D visualization tool

24hour firewall.png

TenableNetworkVisualizer.jpg

The Rest of the Genome

By CARL ZIMMER Published: November 10, 2008

From http://www.nytimes.com/2008/11/11/science/11gene.html?pagewanted=2&_r=1, NYT article of 11/11/08, on Thomas R. Gingeras of Cold Spring Harbor Laboratory. He is a leader of Encode, an effort to determine the function of every piece of DNA in the human genome.


RestOfGenome.gif

Digg HeatMap

From http://brian.shaler.name/digg/heatmap/, a dynamic interactive flash display of readers as they "digg" stories.

DigHeatMap.png

Bubble Chart from the NYT

RichardFuldCEOCompensation.png From the Oct. 7, 2008 NYT article "Multimillion-Dollar Men" Link to the interactive display.

Note the overlap of the bubble, indicating excess and data too large in magnitude for the display...

Prefuse

Prefuse1.pngPrefuse2.png Prefuse is best illustrated through a movie of its capabilities: movie.

Also, check out its gallery of visualizations.

Interactive 2-D Graph of word relationships in Dictionary (Prefuse)

NearWord is a free visual synonym thesaurus, based on the WordNet dictionary and the Prefuse visualization toolkit. Just type in a word below and press Search! Passing the mouse cursor over a sense number displays the definition for that sense. Double-clicking on a synonym re-launches the display using that synonym as the root.

Nearword.png

Visualizing Pairs of Words in two different documents

Think of it as a 2-D Tag-Chart. From ManyEyes, an IBM-based research group

(http://services.alphaworks.ibm.com/manyeyes/view/SQYskMsOtha64CXvdXd0N2~)

ManyEyes rivets.png

DocuBurst

Docuburst.png

Docuburst

DocuBurst is the first visualization of document content which takes advantage of the human-created structure in lexical databases. We use an accepted design paradigm to generate visualizations which improve the usability and utility of WordNet as the backbone for document content visualization. A radial, space-filling layout of hyponymy (IS-A relation) is presented with interactive techniques of zoom, filter, and details-on-demand for the task of document visualization. The techniques can be generalized to multiple documents.
A technical report on this project is available in PDF as well as a short poster abstract from the IEEE Information Visualization Symposium 2006.
















The Best Tools for Visualizations

Discovered by Christine Grascia: http://www.learnbydoing.org/?p=121 ( pdf), this article by Sarah Perez contains many links to various visualizers for various services/databases.

infosthetics

http://infosthetics.com/ form follows data - data visualization & visual communication

Another site dedicated to collecting stunning examples of visualization.

HeavyLosses.png

Newvisual

http://www.newsvisual.com/newsvisual/

YahooGoogle.png

Visualizers for Wikipedia

WikiCompany

WikiCompany WikiCompany.png Wikicompany profiles are now also part of the exciting Linking Open Data and DBPedia projects. These projects provide free, structured data sets and software tools for developing innovative Semantic Web applications.

Prefuse

http://prefuse.org/

Prefuse is a set of software tools for creating rich interactive data visualizations. The original prefuse toolkit provides a visualization framework for the Java programming language. The prefuse flare toolkit provides visualization and animation tools for ActionScript and the Adobe Flash Player.

Another collection of visualization examples

From SmashingMagazine and provided by Thomas Ciufo http://www.smashingmagazine.com/2007/08/02/data-visualization-modern-approaches/ ( Pdf )

3D-Visualizers

H3Viewer

This is a 3D-Graph visualization tool created/maintained by Tamara Munzer's research group at Stanford. It is open source and interactive, written in C++ and OpenGL.

H3Viewer.gif

Novelties - Lines and bubbles

--DT 13:37, 3 September 2008 (UTC)

Newtestamentnames.jpg

article link: http://www.nytimes.com/2008/08/31/technology/31novel.html?_r=1&ei=5070&emc=eta1&oref=slogin

NYT articles 8/8/31 acknowledging new ways for people to display information. It's really an article on IBM's Many-Eyes (http://many-eyes.com/)


Collaboration like this can be an effective way to spur insight, said Pat Hanrahan, a professor of computer science at Stanford whose research includes scientific visualization. “When analyzing information, no single person knows it all,” he said. “When you have a group look at data, you protect against bias. You get more perspectives, and this can lead to more reliable decisions.”
“The great fun of information visualization,” Ben Shneiderman says, “is that it gives you answers to questions you didn’t know you had.”

Visualizing the medals at the Olympics

--DT 14:21, 25 August 2008 (UTC)

NYTBejingMedals.png

http://www.nytimes.com/interactive/2008/08/04/sports/olympics/20080804_MEDALCOUNT_MAP.html

This is a dynamic display of the number of medals obtained at various olympics. This is nicely done, and uses some form of circle packing.

Visualization of relationship (similarity) between books

--DT 01:48, 22 July 2008 (UTC)

conflate.net shows a Processing visualization applet where the user can control the number of books shows (as circles) and the threshold defining whether they are similar or not.

BookSimilarity.png


Rhizome Navigation

--Thiebaut 11:59, 6 July 2008 (UTC)

A library for exploring graphically graphs. Written in Processing, created and maintained by the University of Vienna. http://www.rhizomenavigation.net/

Rhizome.jpg

Using the transcripts of Bill Gates' keynote from CES 2007 and Steve Jobs' keynote at Macworld 2007 (via Todd Bishop's Microsoft Blog) the author created this relational tagcloud using Rhizome Navigation.

Automatic centering and organizing of a graph in Processing

June 27th, 2008 by admin

Interesting Processing application showing the automatically scaling/organizing of a tree. Done in Processing

RandomArboretum.png

http://www.cs.princeton.edu/~traer/randomarboretum/

Also uses a Physics libraryhttp://www.cs.princeton.edu/~traer/physics/ and animation library from the same site.

New York Times example of multi-D graph (elections)

June 23rd, 2008 by admin

A string of debates


From the NYT, shows statistics on words/concepts appearing in candidates speeches


PDF File Version

Naming Names

June 21st, 2008 by admin

From article in NYT on candidates naming each other names…

Naming Names

Code Swarm

June 19th, 2008 by admin

Code-Swarm is a visualization technique to show the evolution of a software project under CVS as it is updated, modified, and as it evolves under the influence of many contributors/programmers.

Eclipse-640px.png

From Slashdot: http://developers.slashdot.org/developers/08/06/16/1855209.shtml

“A student at UC Davis has created some stunning visualizations of open source software contributions, including Eclipse, Python, Apache httpd and Postgres. From the website: ‘This visualization, called code_swarm, shows the history of commits in a software project. A commit happens when a developer makes changes to the code or documents and transfers them into the central project repository. Both developers and files are represented as moving elements. When a developer commits a file, it lights up and flies towards that developer. Files are colored according to their purpose, such as whether they are source code or a document. If files or developers have not been active for a while, they will fade away. A histogram at the bottom keeps a reminder of what has come before.’”


Watch the video: code_swarm - Eclipse (short ver.) from Michael Ogawa on Vimeo.

The code is available on Google-Code, and the 6/19/08 version is available here:

http://cs.smith.edu/~thiebaut/IS_blog/software/code-swarm/ [2]

In the Art of a DNA Graph

June 18th, 2008 by admin

Article in the 6/18/08 NYT, Science Times.

link

17dna-600.jpg

DESIGN AND SCIENCE: The Life and Work of Will Burtin

June 2nd, 2008 by admin

Excerpts from article published in 6/1/08 NYT (Sunday Book Review section):

Visuals

Burtin was one of many designer exiles who fled the Nazis and Fascists, including the Bauhaus teachers Herbert Bayer and Laszlo Moholy-Nagy […] Burtin was the art director of Fortune magazine in the late ’40s, responsible for introducing abstract and conceptual art covers. Burtin’s most impressive contribution was the marriage of science and design.

After a recent spate of graphic designer biographies, this detailed monograph is definitely overdue. Burtin’s virtually forgotten work, like the exhibition “Metabolism — the Cycle of Life,” prefigures the interaction design practiced today on the Web and reveals just how entertaining well-articulated graphic and exhibition design about science can be.

Heller-1.jpg

TextArc

May 29th, 2008 by admin

A somewhat confused view of a whole book in one graphic visualization. Visually pleasing, but what can we use it for?

http://textarc.org

InteractiveAndPrintSplit.gif

Measuring dynamic relationships between readers and stories

May 29th, 2008 by admin

http://labs.digg.com/arc

Too complicated (I think) for the average user. But shows a lot of information. Not intuitive, though…

Digg arc.png

Maps of market and news

May 29th, 2008 by admin

Two interesting uses of treemaps. Both are referenced in the StateOfTheUnion.net web site (in the essay)

Newsmap.png
Marketmap.png

Words used in the State of the Union addresses across the years

May 28th, 2008 by admin

http://stateoftheunion.onetwothree.net/

This is done with processing, and truly interactive. As the arrow key is moved left or right, we move by one year backward or foreward, respectively, and see in red the words used the year before, and in white the words of the current year. Cool…

Stateoftheunion.png

Stateoftheunion.png

The link contains an interesting essay, reproduced below:

The {Sorry} State We Are In

by Brad Borevitz

The triumph of iconicity over rhetoricity–call it the society of the spectacle, call it what you will. The change has certainly not gone unobserved. And yet, we are likely to blinker our awareness of the situation–and imagine that the mechanisms of our governance continue unaffected–that the institutions of democracy are somehow untouched by these changes. But how can this possibly be the case?

A democratic system of government depends on communicative practices that are founded on rhetoric: an art of persuasion. This implies a public sphere as the ground of a competitive exchange of argument and counter argument. Reason theoretically rules such a domain, where syllogistic conventions determine the outcome of a competition of ideas based on the strength of evidence and the logical coherence of their exposition.

What has displaced this rhetorical arena is a screen on which assertions are projected. It may be that these assertions compete for attention, but they don’t entertain argument or tolerate critique. Assertions are immune from denigration based on counterfactual evidence, or the revelation of faulty logic. Competition in this environment is a matter of precedence, authority, style, volume, frequency, and ultimately saturation.

Contemporary political ideas, which take the form of memes circulating in the soup of our media saturated world, are formally equivalent to the fragments of iconic identity circulating as agents of corporate entities, the brands. Politics is branding, the media practice of producing identity as awareness and desire, through the deployment of declarative language and image.

Not only have commercial interests produced a scarcity of actual public space by their domination of the landscape and their occupation of the commons, they have gained almost total control over the virtual spaces of communication, and colonized the language of political discourse itself.

In this atmosphere, the public debate over ideas is obsolete, if not impossible. The significance of such a change is immense. In Benjaminian terms, politics enters the realm of the aesthetic, a situation symptomatic of fascism.

How is it that we have arrived at this state? Why are we so surprised as we wake now to the nightmare? After all, here in the U.S., the president has been informing us of the state of the union from the year the constitution was ratified. Were we not listening to the message–not reading in this text the signs of transformation? When was it that the words addressed to us changed from having a rhetorical significance to an iconic one? When was it that the words last demanded our understanding, and when did they come to simply demand that we buy in?

A Form

The State of the Union address is a particularly apt data set to explore for clues about the change in political language: it is a remarkably consistent form available annually over the entire history of the United States. Article II Section 3 of the Constitution inaugurates the practice:

[The President] shall from time to time give to the Congress Information of the State of the Union, and recommend to their Consideration such Measures as he shall judge necessary and expedient …

Very quickly, the address acquired a conventional form as a yearly message delivered by the office of the president, at the beginning of the year, to the congress, the representatives of the people. This consistent structure, which endures over the course of U.S. history, is what allows for a useful comparison between actual instances of the address. Such comparisons would be difficult to make otherwise between more random fragments of political discourse not regulated by a uniform temporal frame and an archetypal structure of address.

In the ceremonious address in congress, when the president is announced, he is called, “the president of the United States,” and not mentioned by name. The State of the Union is specifically not delivered by the person of the president–the specific officeholder of that position–but rather, the office of the president–that space of authority that the person merely occupies temporarily. Even if he signs his name to the document (if it is written), the text has very rarely been composed exclusively by him–it is merely approved or intoned by him. Presidents have always had help with writing, starting with Washington who relied on Hamilton for this service; though, Coolidge (1923-29) was the first to have an official speechwriter on staff.

In addition to the hypothesized change in language between the rhetorical and the iconic, the State of the Union has, from the beginning, been subject to various tensions and trends both political and linguistic, which also affect its language. One of these is the opposition between written and oral delivery of the message. The address was first given in person, but starting with Jefferson in 1801, the message was written and delivered to congress where it was read by a clerk.

Jefferson’s objection to the spoken format was the inappropriately monarchical implication of the address, given its similarity to the imperial practice of the speech from the throne. From Jefferson, until Wilson in 1913, the message was written, and referred to as the “President’s Annual Message to Congress.” The contemporary usage of the term “State of the Union Address” starts with Roosevelt in 1935. With the exception of the years 1919-20 (Wilson), 1924-28 (Coolidge), 1929-32 (Hoover), 1944-45 (Roosevelt), 1946 and 1953 (Truman), 1956 (Eisenhower), 1961 (Kennedy), 1973 (Nixon), and 1981 (Carter), all messages since 1913 were delivered orally (in 1945 and 1956 they were broadcast as radio addresses, though they delivered to Congress as written messages).

The oral delivery of the address in the modern period coincides with a tendency towards broader dissemination of the message in the press and the media more generally as the century progresses. 1923 brings the first radio broadcast. In 1947, the speech was televised. In 1965 it was moved to the evening hours, presumably to be made accessible to the working person. And, in 2002, it was available live via the web. The audience for the message had always included office-holders in government, the direct address of the public via broadcasting, would certainly impact the language used. Broadcast demands a different, probably less formal, communication style, and an extra self-consciousness in targeting the more populous and diverse audience. Simultaneously, speech in general has likely become less formal.

Given that affirmation or rejection rather than argument are the possible responses to anti-rhetorical speech acts, the applause of the live audience, the congress, the supreme court, and dignitaries assembled in the capital building, takes on a special importance and contributes significantly to the length of the address. Applause are, more and more, a partisan affair, with the members of the president’s party responsible for most of the acclamation, and the members of the court abstaining from all but the most perfunctory and polite participation. In this respect, the speech comes to resemble a sporting event with a partisan audience both present and remote, and with referees maintaining their impartiality at the sides. Since Bush’s first speech in 2002, White House transcripts have contained parenthetical notations of the applause and laughter.

The content of the State of the Union messages certainly vary according to the issues of the day, but they also contain constants: the words that constitute the mythos of our governance and the machinery of its ideology. With characteristic eloquence, Jimmy Carter writes in the 1979 address:

As long as I’m President, at home and around the world America’s examples and America’s influence will be marshaled to advance the cause of human rights.

To establish those values, two centuries ago a bold generation of Americans risked their property, their position, and life itself. We are their heirs, and they are sending us a message across the centuries. The words they made so vivid are now growing faintly indistinct, because they are not heard often enough. They are words like “justice,” “equality,” “unity,” “truth,” “sacrifice,” “liberty,” “faith,” and “love.”

These words remind us that the duty of our generation of Americans is to renew our Nation’s faith, not focused just against foreign threats but against the threats of selfishness, cynicism, and apathy.

Despite Carter’s protest to the contrary, those key words he claims are not heard, are, in fact, heard constantly: the word “justice” appears in 180 addresses a total of 752 times. It is a staple of presidential expression. To be fair, other of these words have been less popular of late in State of the Union speeches, and may also be less popular in colloquial speech. Certainly justice, liberty and faith, at least, are constantly in our ears.

Word Speeches Occurrences
Justice 180 752
Equality 85 149
Unity 60 102
Truth 69 101
Sacrifice 57 90
Liberty 137 306
Faith 134 341
Love 54 78
Freedom 154 676
Peace 207 1821
War 206 2631*

*Not including plural forms

A word like “Freedom,” a Bush favorite, is in 154 of the speeches. The words “peace” and “war” occur together in 200 out of the 214 addresses–fully 93% of them. If there is a single issue, which dominates the corpus, it must be this. Practically the first thing out of Washington’s mouth in the very first speech was: “To be prepared for war is one of the most effectual means of preserving peace.” In that sentence, it is as if he sets the agenda for the next 200 years of history and inaugurates the confusion, which persists to this day, over the meaning of these two simple words.
Peacevwar.png

Each iconic word has a history of usage, a profile of its popularity, and a trace of the contest between words within the ecology of political language. Visualizing this history gives us insight into these struggles. The words “peace” and “war” seem bound tightly together in their rhythmic march into the future, but “war” peaks dramatically during major conflicts leaving peace far below in its shadow. Between “freedom” and “justice” the dynamics are livelier. Whereas “justice” seemed the favored value trending upwards from the beginning, World War II marks a dramatic reversal of fate. “Freedom,” it seems, is now more important than “justice.” And that just might explain some things.
Freedomvjustice.png

Another Reading

The counting up of words suggests a different sort of reading practice. There is reason to be skeptical of the positivist implications of a statistical analysis of language, but there is also motive to appreciate and explore the current vogue of quantitative methods. There is something compelling in the urge to empirically examine this particular corpus for clues as to how things have gone horribly wrong. Maybe we can no longer bear to listen to the address, or maybe it has become impossible for us to read it. There are certainly few who would be willing to scrutinize all 3000 pages of our legacy of 214 messages from the president. Perhaps counting is a defense against the spell of iconic language. It may be that counting is simply the automation of a practice that we participate in already, as we measure unconsciously our saturation in the messages of the media–as they work us over completely.

Is it a problem to conceive of reading practices that satisfy the will to process the material, to refine it, to empty it of its gold, but not necessarily to understand it? We fear succumbing to the desire to consume the messages, for consumption entails risks and investments that we will not bear. It is too much. The scanning of words abstracted from their context produces a reading that is dependent on iconic recognitions and specifically not on rhetoric. This is an analysis that is blind to motive but cunning to structure–or is this an analysis at all? If it is true that rhetoric is lost, it is a way of reading that those dead texts deserve.

Certainly, such practices already exist, separating attention into surface and depth: the scan and the mine. The task is divided between our parts: our silicon organs, and our fleshy ones. Applications of this kind surround us: search engines, maps of markets (Marketmap), of headlines (Newsmap), of books (Amazon’s concordance). We are increasingly aware of the degree to which our lives are the texts that are mined and scanned by industry and government alike. A quantitative reduction is not simply a violence, a prelude to injustice, a methodological inadequacy, a mistake, or a failure, it is a practice matched with a circumstance.

The master’s tools are turned against him. We can envision a Total Information Awareness program of our own and aim its tentacles at government documents. It is an ironic pleasure to scan in seconds the one-and-a-half-million-word corpus of the State of the Union for important keywords and significant characteristics, even as the government’s programs mine our personal data for incriminating evidence–how much more so with the revelations in late 2005 that domestic spying was not simply post-9/11 paranoia, that the theoretical capabilities of the now defunct Information Awareness Office had actually been deployed out of the dark side of the Defense Department.


† The version of the Bush addresses that are published in the Congressional Record contain only a few scattered notations of applause, mostly at the begining and end of the speech, while the White House transcripts (as well as those on CSPAN) include upwards of 50 parenthetical notations of applause in each.

Processing-driven visualization of a 3D network of word relationships

May 28th, 2008 by admin

http://jamesnsears.com/applets/spies/

This was done for the NYT, 3 Dec. 2006. The article is “Rewiring the Spy”. This is done in Processing and shows the connections existing between words in a government database dealing with terrorism.

396 big01.png

Funny Cartoons (for Wikipedians)

May 26th, 2008 by admin

http://xkcd.com/333/

Getting out of hand.png
(Foreplay)
Wikipedian protester.png
(wikipedian protester)

From the geeky site http://xkcd.com/. Note that the page on the laptop looks like a wikipedia page! There was a NYT article on 5/25/08, Link by link: This is funny only if you know Unix on this web-toon site.

NYT 050408: Inflation’s Little Parts

May 20th, 2008 by admin

Interesting article exhibiting a very “organic” chart showing influence of various factors in the inflation.

Note the scale given at the top, showing the relationship between color and change in price. The graph itself is hierarchic, with 8 different categories (apparel, health care, etc…), and each is divided up into sub categories shown as blobs of various sizes, the size being proportional to the part of spending.

The graph is interactive on the NYT web site, and gives more info about a blob on mouse-over.

The graph is done in flash.

Nyt 050408 inflations little parts.jpg

SeaDragon

May 19th, 2008 by admin

Technology for browsing large amount of pictures in jpeg2000 format. SeaDragon was purchased by Microsoft.

Watch the video: [3]

Facebook’s “friend wheel”

May 19th, 2008 by admin

Interesting display of friendship links on Facebook.

First example, mine:

Dtfriendwheel.png

Next, Lauren’s friend wheel (Lauren just graduated from Smith):

Laurenfriendwheel.png

Visualizing 3D

May 14th, 2008 by admin

New article from the NYT on 5/13/08.
Astro 600.2.jpg

“Exploring the virtual universe is incredibly smooth and seamless like a top-of-the-line computer game, but also the science is correct”

The WorldWide Telescope results from careful planning and lengthy development in a research division. It has the richer graphics and it created special software to present the images of spherical space objects with less polar distortion. WorldWide Telescope requires downloading a hefty piece of software, and it runs only on Microsoft Windows.

Google Sky started as a Google “20 percent” project, in which engineers can spend time on anything they choose. Google Earth, where Google Sky began, requires a software download, but its Web-based version, which came out in March, does not. The Google culture encourages engineers to put new things onto the Internet quickly and keep improving them, a philosophy geared to constant evolution instead of finished products.

Design and the Elastic Mind

May 12th, 2008 by admin

Paul Antonelli, the curator of the exhibit “Design and the Elastic Mind” speaks to Charlie Rose in a 1-hour interview. Great stuff!

http://www.charlierose.com/shows/2008/05/07/1/design-and-the-elastic-mind


Tree-maps: another interesting visual display of information

April 10th, 2008 by admin

From http://lifehacker.com/software/disk-space/geek-to-live–visualize-your-hard-drive-usage-219058.php

Harddisk treemap.png


Interesting word chart

April 1st, 2008 by admin

http://www.neoformix.com/2008/ObamaClintonSpeechContrast.html

Interesting comparison of two speeches…

Obamaclinton.png

Some Graphviz Examples

March 30th, 2008 by admin

Just found this while looking for ways to represent the CS curriculum as a graph. I think our direction using Processing is good, and I don’t want to go back to Graphviz, but looking at ways people are using graphing packages to show relationships is interesting, no matter what package they use.

http://www.flickr.com/search/?q=graphviz&w=all&s=int

Graphviz flickr.jpg

Showing the time variation of various quantities

February 24th, 2008 by admin

Today’s NYT (2/24/08) shows an interesting graph of the money made by different movies in 2007. It’s an interesting way to show time-variation of several tens of quantities.

The graph is interactive, as the mouse is moved over the different movies, some information is displayed, as well as the length of their duration. http://www.nytimes.com/
Ebbflow.jpg

Interesting links related to Processing

February 22nd, 2008 by admin
Yahoo Burst
Similarity
Valence

(Note: I will keep adding more links as the time comes, so please keep checking this post often Icon smile.gif




Click here to see an applet in action


Check http://www.processing.org for more info and examples.


Another network navigation site

February 17th, 2008 by admin

http://www.tinrocket.com/


Grabbed.jpg

Graphing the history of a wikipedia page

February 12th, 2008 by admin
Discover mag.jpg

Generated by Martin Wattenberg and
described in “Studying Cooperation and Conflict between Authors
with history flow Visualizations”, 2004 (link).

35 Great Visualizations

February 12th, 2008 by admin

Can be found here: abeautifulwww.com


Competition on Visual Network Dynamics

February 12th, 2008 by admin

Competition on visualizing network dynamics

2007, Queens, NY

Some interesting designs for representing large networks.


Must-watch video!

February 12th, 2008 by admin

Tamara Munzner of U. British Columbia presents a talk at Google titled 15 Views of a Node Link Graph: An Information Visualization Portfolio


http://video.google.com/videoplay?docid=-6229232330597040086 & q=type%3Agoogle+engEDU

It’s one-hour long, but worth it. It would be nice to see if some of the software she demonstrates for exploring graphs is available…

Tamara’s Web site and group’s
site have good information.


Neat graphical representation of activity in Wikipedia

February 11th, 2008 by admin
Windowslivewritervisualizingthepowerstruggleinwikipedia-f7c7wikivislowres74.jpg

Click here for full size image.
Very interesting and artistic way to depict activity in the wikipedia pages.
For more information, check Bruce Herr’s http://abeautifulwww.com/2007/05/20/visualizing-the-power-struggle-in-wikipedia/
or “Visualizing the ‘Power Struggle’ in Wikipedia”

A nicer web-2.0 type graph where the user can zoom in and out can be found here:

http://scimaps.org/maps/wikipedia/

Another nice image representing graphically the geography and activity by domain name


Cctld 1200.jpg

Interesting visualization packages

February 5th, 2008 by admin
  1. ManyEyes by IBM: Link
  2. Another interesting plot by ManyEyes


Visual Representation Options

January 29th, 2008 by Allie

So here is a link to Microsoft Silverlight’s “Showcase” page where some Silverlight applications are available for demo. I don’t want to initially create a large web application but Silverlight graphics can be inserted inline with HTML code easily.

http://silverlight.net/showcase/default.aspx


Animation of the history of a wikipedia page

January 29th, 2008 by admin

Here is a cool link to a page showing an animation of the life of a wikipedia page. This is done by Jon Udell.

They Rule & Wikipedia

January 28th, 2008 by admin

Here’s a way to get started with the idea.

First go to the site TheyRule.net and play with the system.

Theyrule2.png

Select “Load Map”/”Popular” and pick an entry. You will see a network of connections appearing. The network shows the people that belong to different boards of companies. As you move your mouse over some of the entries you are given a menu to search or delete the item. Also very neat, you can ckick on an item and move it around with the mouse while retaining the existing connections.


This mediawiki page was generated by I love wiki, an HTML to wiki syntax converter that took the html version of the blog and translate it into wiki syntax