A Menagerie of CV Markers

This week, the net’s been exploding with responses to James Bridle’s work on the New Aesthetic. Bruce Sterling set the fuse for this particular conflagration with his Essay on the New Aesthetic in Wired.

My own response, published in The Creator’s Project on Friday, was called What It’s Like to be a 21st Century Thing. I tried to put NA in the context of Object-Oriented Ontology arguing that NA “consists of visual artifacts we make to help us imagine the inner lives of our digital objects and also of the visual representations produced by our digital objects as a kind of pigeon language between their inaccessible inner lives and ours.” This is an approach I’m excited about and plan to flesh out more here soon.

Today, though, I want to engage in a bit of OOO ontography and close-looking as a way of responding to what I thought was one of the more interesting takes on Sterling’s essay.

In his post Why the New Aesthetic isn’t about 8bit retro, the Robot Readable World, computer vision and pirates, Rev Dan Catt tries to address the 8-bit quality of much New Aesthetic visual work. Specifically, he’s trying to answer a criticism of NA as retro, a throwback to “the colors and 8 bit graphics of the 80s” as Tom Coates put it.

For Catt that resemblance comes from the primitive state of computer vision today. “Computer vision isn’t very advanced, to exist with machines in the real world we need to mark up the world to help them see”, he says. In other words, the current limitations of computer vision algorithms require intentionally designed bold blocky 8-bit graphics for them to function. And therefore the markers we design to meet this requirement end up looking like primitive computer graphics, which resulted from similar technical limitations in the systems that produced them. As Catt says, “put another way, current computer vision can probably ‘see’ computer graphics from around 20–30 years ago.”

In a conversation about this idea, Kyle McDonald argued that Catt’s taking the comparison too far. While there is a functional comparison between the current state of computer vision and the state of computer graphics in the 80s, the actual markers we’re using in CV work today don’t much resemble 8-bit graphics aesthetically.

To explore this idea, Kyle and I decided to put together a collection of as many different kinds of markers as we could think of along with links to the algorithms and processes that create and track them (though I’m sure there are many we’ve missed – more contributions are welcome in the comments). It was our hope that such a collection might widen the New Aesthetic visual vocabulary by adding additional ingredients as well as focusing some attention on the actual computational techniques used to create and track these images. Since so many of us were raised looking at 8-bit video games and graphics I think it quite helps to look at the actual markers themselves in their surprising variety rather than just filing them away with Pitfall Harry’s rope, Mario’s mushroom, and Donkey Kong’s barrel, which we already know so well.

So, what do real CV markers actually look like? Browse the images and links below to see for yourself, but I’ll make a few quick general characterizations. There is a lot of high contrast black and white as well as stark geometry that emphasizes edges. However the grid that characterizes 8-bit images and games is nearly never kept fully in tact. Most of the marker designs are specifically trying to defeat repetition in favor of identifying a few specific features. Curves and circles are nearly as common as squares and grids.

I’d love to collect more technical links about the tracking techniques associated with each of these kinds of markers. So jump in with the comments if you’ve got suggestions.

figcaption {
display: none;
}

Corner detection for calibration

Dot tracking markers

Traditional bar codes

Stacked bar code

Data Matrix 2D

Text EZCode

Data Glyphs

QR codes

Custom QR codes

Different flavors of Fiducial Markers

Retro-reflective motion capture markers

Hybrid marker approaches

This entry was posted in Opinion. Bookmark the permalink.

0 Responses to A Menagerie of CV Markers

method says:

April 9, 2012 at 7:29 am

One that a lot of people have interacted with is the Kinect calibration card: http://kinect.dashhacks.com/kinect-calibration-card

Blaine Cook says:

April 9, 2012 at 8:35 am

To Tom’s original point, work like DTAM that uses everything in the image as machine-readable markers, and Astrometry that uses the stars themselves as machine-readable markers both prove your point, I think. Once you start going down that path, there are very many examples: DNA sequencing, just about anything done with ‘R’, Google, and so on.

My major point of departure with NA is that the fact that machines have been instructed to recognise features which we imbue with meaning does not mean that the machines themselves read meaning into those features. Just because some primitive algorithms use assistive technologies that we can see and ponder does not mean that those algorithms also ponder, any more than their more advanced cousins.

- greg says:
  
  April 11, 2012 at 11:23 pm
  
  Blaine — But “pondering” is only the human way of digesting bits of images and the world. And it’s not even the chief one of those. Most of our looking faculties are actually doing all kinds of other low-levels things that we only have the vaguest notion of. Not pondering. The point of this isn’t to project a human subjectivity onto the machines, to imagine them “pondering” in the same way we do. The point is that something is going on inside there. Computation. These computer vision systems exist just as much as people do, even if in a very different way. I’m not positing any kind of equivalence between the two, much the opposite. I’m advocating for a process of speculatively imagining the quite different way that these systems see and compute. By necessity this process of speculation will involve quite serious and focused attention on understanding the details of the algorithms and techniques involved. That’s why I started Makematics which is specifically aimed at doing just that. However just because we’re not going to fall into the pathetic fallacy of personifying these systems that also doesn’t mean that we shouldn’t try to imagine them at all. And analyzing the algorithms from a purely functional point of view is inadequate, by itself, to this task. We have to engage some of our other imaginative faculties as well.
  
  This is something I tried to get at with my Machine Pareidolia project. The breakdown in that post of the three different types of faces FaceTracker detected (agreement, near agreement, and totally other) is an attempt to imagine the algorithm’s point of view which somewhat overlaps with humans but also diverges significantly.

A Menagerie of CV Markers

OpenCV calibration checker pattern for homography

Reactivision

Graphtracker

Rune Tags

Corner detection for calibration

Dot tracking markers

Traditional bar codes

Stacked bar code

Data Matrix 2D

Text EZCode

Data Glyphs

QR codes

Custom QR codes

Microsoft tags aka High Capacity Color Barcodes

Maxi Codes

Short Codes

Different flavors of Fiducial Markers

9-Point Landmark

Cantags

AR tracking marker for After Effects

ARTag markers

Retro-reflective motion capture markers

Hybrid marker approaches

0 Responses to A Menagerie of CV Markers

Leave a Reply Cancel reply

Urban Honking

Recent Posts

Recent Comments

Archives

Categories

Meta