• Weave for visualization development

    February 7, 2012 to Software  •  Share on Twitter  •  Comments (1)

    spacer

    Web-based Analysis and Visualization Environment, or Weave for short, is open source software intended for flexible visualization.

    Weave (BETA 1.0) is a new web-based visualization platform designed to enable visualization of any available data by anyone for any purpose. Weave is an application development platform supporting multiple levels of user proficiency — novice to advanced &mdsah; as well as the ability to integrate, disseminate and visualize data at "nested" levels of geography.

    It looks like everything is done through a click interface, and you can piece together modules and link them, etc. There is some setup involved, but there are a number of video tutorials and documents to get everything installed.

    Source code also available on GitHub.

    [Weave]

  • Members Only
    spacer

    How to Make a Contour Map

    Filled contour plots are useful for looking at density across two dimensions and are often used to visualize geographic data. It's straightforward to make them in R — once you get your data in the right format, that is.
  • Tracking the grizzly bear in emotional interactive documentary

    February 6, 2012 to Data Art  •  Share on Twitter  •  Comments (1)

    spacer

    In a blend of data and storytelling, Jeremy Mendes and Leanne Allison dig into surveillance logs generated by a monitored grizzly bear between 2001 and 2009. The final work is a moving interactive documentary, Bear 71.

    She lived her life under near-constant surveillance and was continually stressed by interactions with the human world. She was tracked and logged as data, reflecting the way we have come to see the world around us through Tron and Matrix-like filters, qualifying and quantifying everything, rather than experiencing and interacting.

    Leanne Allison sifted through thousands of photos from motion-triggered trail cameras for this project. The grainy images gathered over the past 10 years by various scientists reveal the hidden life of the forest, played out by the animals and humans — including Bear 71 — captured covertly on film.

    It begins with the capture of a grizzly, its tagging, and then release, as a first-person narrative tells a story through the eyes of the bear. You, the observer, are allowed to follow the bear and explore its environment on an abstract map, and somewhere along the way digital and the physical world melt together.

    [Bear 71 via @wiederkehr]

  • Most mentioned NFL players on SportsCenter

    February 5, 2012 to Infographics  •  Share on Twitter  •  Comments (4)

    spacer

    Like something from of a video game, this graphic from The New York Times shows the most mentioned NFL players and coaches this season. Players are scaled approximately by the number of mentions between August 1, 2011 to February 1, 2012 on ESPN's SportCenter and Sunday NFL Countdown. The giant on the left is Tim Tebow, with 1,450 mentions. Bar graphs on the bottom highlight mentions over time for players of interest.

    [New York Times]

  • An action plan for data science, a decade ago

    February 3, 2012 to Statistics  •  Share on Twitter  •  Comments (2)

    Data science has been covered at length during the past couple of years, and we tend to think of it as a field of study just a couple of years older than that. Jeff Hammerbacher and DJ Patil have played roles in further propagating the term as an actual profession in roughly the same timespan. So I was surprised to come across this rarely-cited 2001 paper by statistician William Cleveland, Data Science: An Action Plan for Expanding the Technical Areas of the Field of Statistics [pdf].

    This document describes a plan to enlarge the major areas of technical work of the field of statistics. Because the plan is ambitious and implies substantial change, the altered field will be called "data science."

    For those unfamiliar, Cleveland's work on graphical perception might ring a bell.
    Continue Reading

  • Compare presidential candidate fundraising

    February 2, 2012 to Statistical Visualization  •  Share on Twitter  •  Comments (3)

    spacer

    Presidential candidates have raised $186 million up to now, according to the Federal Election Commission. The New York Times lets you compare the amounts raised by each candidate, over time and space. Simply select a candidate on the left, and another on the right to see how they match up. Fundraising by candidates from previous elections, at the same time of year, are also included for context.

    While not the focus of the interactive, the distributions for donation size at the bottom seem to be especially telling.

    [New York Times via infosthetics]

  • Bird migration patterns mapped

    February 2, 2012 to Mapping  •  Share on Twitter  •  Comments (1)

    spacer

    Birds move. eBird shows us how.

    Understanding patterns of bird occurrence at continental scales has long been one of eBird's fundamental challenges. Only now, with 42 million records and ever more thorough coverage nationwide, is this becoming possible. Ongoing research at the Cornell Lab is currently producing cutting-edge graphics that we are pleased to share here. Day-by-day predictions of species occurrence allows these models to shine a spotlight on the most awe-inspiring of natural spectacles: the ebb and flow of bird migration.

    Cutting edge? No. They are thorough though, with maps (in the form of animated gifs) for a large number of species.

    [eBird | Thanks, Ed]

  • Mapping the drug wars in Mexico

    February 1, 2012 to Mapping  •  Share on Twitter  •  Comments (2)

    spacer

    Diego Valle-Jones maps homicides and trafficking routes in Mexico.

    To unclutter the map and following the lead of the paper Trafficking Networks and the Mexican Drug War by Melissa Dell, I decided to only show the optimal highways (according to my own data and Google Directions) to reach the US border ports from the municipalities with the highest drug plant eradication between 1994 and 2003 and the highest 2d density estimate of drug labs based on newspaper reports of seizures. The map is a work in progress and is still missing the cocaine routes, but hopefully I'll be able to add them shortly.

    There's lots to look at and interact with here. To start, there are bubbles that cluster homicides by region and major highway routes in black.

    Click on any bubble and you get a time series for the corresponding area, going back to 2004. Or if you like, draw your own polygon to see the time series for specific regions. Pointers on the time series highlight significant events. There's also a slider that lets you see numbers on the map for different years. A layer underneath the bubbles lets you see high density areas for marijuana, opium, and drug labs.

    Take a look at the full map for yourself. This is nice work by Valle-Jones.

    [Diego Valle-Jones | Thanks, Diego]

  • Challenges measuring crime worldwide

    February 1, 2012 to Statistics  •  Share on Twitter  •  Comments (3)

    You would think that something so concrete, carefully recorded by authorities, wouldn't be too tough to tabulate, even if at a large scale. Not so.

    Homicide is a "serious crime that many people are concerned with, it is well-measured, and it is to a large degree well-reported and -recorded," says Alfred Blumstein, a criminologist at Carnegie Mellon University. "That is not to say that there aren't a variety of ways for fudging the measurement."

    Among the factors that cloud homicide numbers: gaps between police-reported numbers and counts by public-health organizations. The discrepancy is wide in many African countries and some Caribbean ones. The United Nations attributes the disparity to several factors, including definitional differences—whether honor killings should count—a lack of public-health infrastructure in some countries, and undercounting—possibly deliberate—by police.

    I think this is something the common public often doesn't understand about data. The numbers are entered and analyzed on a computer, so it's easy to mistake data for mechanical output. It must be accurate, right? That's usually not the case though, especially when it comes to data collection outside a controlled lab setting.

    The game always changes when humans are involved. Not everyone responds to surveys, definitions of events vary across organizations, estimation methods change every year, and the list goes on.

    For those who do stuff with data, you have to deal with that uncertainty, and as data consumers, you have to remember that numbers don't automatically mean fact.

    [Wall Street Journal]

  • Texting on the toilet

    January 30, 2012 to Data Sources  •  Share on Twitter  •  Comments (11)

    spacer I thought this riveting post on the New York Times Bits blog about the rise of the toilet texter deserved a graphic. Since their graphics department is no doubt busy with elections, I took the liberty. I am — the 91 percent.

    I got the numbers straight from the Bits post, but you can download the full report from 11mark for all the demographics. You have to register though, and I didn't want to be the guy who creates an online account to just read a report on what people do while they make dooty. I have standards.

  • Visualizing popularity of Yahoo homepage stories

    January 30, 2012 to Visualization  •  Share on Twitter  •  Add Comment

    spacer

    Yahoo is not what it used to be, but many parts of it are still alive and well. In a follow-up to their email interactive, Yahoo, along with visualization firm Periscopic, explores the popularity of articles that appear on the Yahoo homepage. It's a visualization that shows activity within the Content Optimization and Relevance Engine (C.O.R.E. for short).

    The focus is on the center, which has the same layout as that of the stories on the Yahoo homepage. Story on top, and links to more stories on the bottom. Except in the interactive, you can see demographics of those who viewed the story. The slider on the bottom lets you go back up to 24 hours to see what was hot during each hour.

    It gets more fun when you use the buttons on the left and right to view popular stories among age and gender cohorts and button on the right that let you see stories by categories. The rotating particles, each representing a clickable story, in the background provide a final flourish.

    Oh, and extra nerd points for HTML5.

    [Yahoo]

  • More people want to learn statistics

    January 27, 2012 to Statistics  •  Share on Twitter  •  Comments (5)

    Data is hot right now, so as you would expect, more people are signing up and applying to learn about it. Quentin Hardy for The New York Times reports.

    At North Carolina State, an advanced analytics program lasting 10 months has, since its founding in 2006, placed over 90 percent of its students annually. The average graduate’s starting salary for an entry-level job is $73,000. Its current class of 40 students had 185 applicants, and next year’s applications are already twice that. In 2009, Harvard awarded four undergraduate degrees in statistics. Two graduates went into finance, one to political polling and one became a substitute teacher. There were nine graduates in 2010, 13 last year. They headed into Google, biosciences and Wall Street, as well as Stanford's literature department.

    And in 2011, just about everywhere.

    [New York Times via @jsteeleeditor]

  • The Fixie Bike Index and hipsters

    January 27, 2012 to Statistics  •  Share on Twitter  •  Comments (17)

    spacer

    Priceonomics takes the association of fixie bikes to hipsters, and creates the Fixie Bike Index. After starting with New York, they branch out to national numbers.

    In short, fixed gear bikes = hipsters, and New York boroughs that have more fixies per capita should have more hipsters per capita. We sampled our data to see the number of used bikes for sale per capita in each borough with the term "fixie" or "fixed gear" in the product title to create the Fixie Index.

    I don't know about these numbers. I lived in Modesto for a year and don't remember people riding bikes — or hipsters, and riding your bike in Los Angeles kind of sucks.

    [Priceonomics]

  • Own and securely store your location with OpenPaths

    January 26, 2012 to Self-surveillance  •  Share on Twitter  •  Comments (2)

    spacer

    There are a lot of ways to collect your location, whether it's for journaling and personal reflection or for sharing with others, but it can be tricky making use of your data once it's stored behind company servers. OpenPaths lets you collect your data via iPhone or their just released Android app.

    We inhabit a world where data are being collected about us on a massive scale. These data are being stored, analyzed and monetized primarily by corporations; there is limited agency for the people whom the data actually represent. We believe that people who generate data through their own day-to-day activities should have a right to keep a copy of that data. When people have access to their personal data in a useful format all kinds of new things become possible. We can become better consumers: for example, we can know whether a monthly rail pass makes sense for us, or which data-plan would be most economical for our smartphone usage. More importantly, when our personal data is readily accessible and under our control we can become active collaborators in the quest for solutions to important social problems in areas such as public health, genetics or urban planning.

    You can easily view your data in the OpenPaths map interface, or download your data as CSV, JSON, or KML, and do what you want. There's also an API. Finally, if you choose to, you can contribute your data for researchers, artists, and

gipoco.com is neither affiliated with the authors of this page nor responsible for its contents. This is a safe-cache copy of the original web site.