Addendum on search and support vector machines

by Michael Nielsen on February 6, 2012

In the last post I described how to use support vector machines (SVMs) to combine multiple notions of search relevance. After posting I realized I could greatly simplify my discussion of an important subject in the final section of that post. What I can simplify is this: once you’ve found the SVM parameters, how should you use them to rank a long list of documents for a particular query, in practice?

Here’s the setup. We suppose we’ve already taken a bunch of training data, as in the last post, and used that training data to find a SVM with soft margin. That SVM can be used to rank any query and document. In particular, the SVM is specified by a vector , and we rank a document above a document for query if:

where is the feature difference vector. (In the last post a parameter also appeared in this condition, but as I noted in one of the exercises in that post, for the particular problem of search , which is why it doesn’t appear here.)

In the final section of the last post I did lots of perambulations with the condition [*] to figure out how to construct a linear order ranking documents, given a particular query . We can simplify life quite a bit by recalling that where is the feature vector for query and document . The condition above can then be rewritten as:

This condition suggests a much easier way of ranking documents: simply treat as a score of how good document is when the query is , so that we rank above whenever the score for is better than . We then just construct a linear ordering of the documents, based on the score. Or, if we prefer, we can find just the top 10 (or 20, or whatever) documents by iterating linearly over the documents, and keeping a running tally of the best 10 (or 20) found to date.

This is quite a bit simpler than the discussion at the end of my last post. It only works, though, because our classifer (the SVM) is a linear function of the feature difference vectors. It’s not difficult to construct other types of classifer for which it really wouldn’t be obvious how to construct this kind of score. For those classifiers you could still fall back on the analysis I did in the last post, so it wasn’t all wasted effort.

Incidentally, one thing I like about the equation [**] is that it makes it a lot clearer why is called the weight vector. It means that the score for page on query is just a linear combination of the different feature values, with weights given by the weight vector. That makes the notation and nomenclature “weight vector” much clearer, as well as making it a lot clearer what the vector means!

From → Uncategorized

2 Comments

Kaveh permalink

Hi Mike

Thanks for these two posts. I think you need another w dot on the rhs of **.
- Michael Nielsen permalink
  
  Thanks, corrected!

Comments are closed.

Recent Comments

Rich on How the Bitcoin protocol actually works

Kaitlyn on If correlation doesn’t imply causation, then what does?

What I learned at Build Peace (by Jonathan Stray) | Build Peace on If correlation doesn’t imply causation, then what does?

What I learned at Build Peace (Jonathan Stray) | Build Peace on If correlation doesn’t imply causation, then what does?

Linklog

OSI: The Internet That Wasn’t - IEEE Spectrum

How TCP/IP won over the OSI. Interesting in part for the discussion of what openness means, exactly, and when it is advantageous for a process to be open.

Lester Dent's Master Plot Formula

More in the Moorcock vein. It's easy to imagine the reaction of the critic, holding their nose at writing to formula. But you can turn that around, regarding Dent (and, more plausibly, Moorcock) as a student and theoretician of structure. And that's a pretty powerful point of view. Of course, word-by-word Dent is a poor […]

How to Write a Book in Three Days: Lessons from Michael Moorcock

Fascinating both intrinsically, and for the commentary. The commentary first: part of the interest is from people who desire an easy way to write (or, more accurately, to have written). But there is also clearly a genuine interest on the part of many: what does this guy know that I don't about storytelling? You may […]

Gossip is Philosophy

Kevin Kelly interviews Brian Eno. Slow to get going, but fascinating. Eno proposes "process, not product", says that it's his "ease of seduction" that means he often gets things first, talks about putting more "Africa" into computers, and generally makes many interesting comments.

Susan Sontag interview

Conveys well how much force of personality and intellect Sontag brought to her writing.

Documentaries

Visual Storytelling: The Digital Video Documentary

A brief skim suggests that this is a very good basic guide to making documentaries.

The Great Ecstasy of the Woodcarver Steiner - Werner Herzog

Documentary of Wolfgang Steiner, one of the world's top ski-jumpers in the 1970s. The spine of the documentary is a sequence of extraordinary shots of Steiner's jumps, taken with a pair of high-speed cameras.

Inge Druckrey: Teaching to See

A reminder of the diversity of life: the passion and learning and insight people pour into calligraphy, font design, and design more generally.

Indie Game: The Movie (2012) - IMDb

Striking for the level of emotional commitment and uncertainty (and turmoil) on the part of the creators!

Step Into Liquid (2003)

Remarkable survey of the cutting edge of surfing. We see the origins of tow-rope surfing (where surfers are pulled by jet skis into waves that are too big to paddle out to), the use of hydrofoil designs that put the board a foot or two _above_ the wave, and even the use of weather stations […]

Addendum on search and support vector machines

Follow Michael

Other places on the web

Books

Other projects

Recent Posts

Recent Comments

Linklog

Documentaries

Search

Archives