All posts by David Pennock

Algorithmic economics postdoc position at Microsoft Research, NYC

We are beginning to search for a postdoc in Algorithmic Economics at Microsoft Research, health New York City with a start date in Summer/Fall 2016. Strong graduating students should apply by December 8, 2015. Faculty: please encourage your best students to apply. Please feel free to forward and distribute the announcement as you see fit.

Research in the Algorithmic Economics group at MSR-NYC spans a wide variety of topics at the interface of economics and computation. Application areas include auctions, crowdsourcing, gaming, information aggregation, machine learning in markets, market interfaces, market makers, monetization, online advertising, optimization, polling, prediction engines, preference elicitation, scoring rules, and social media.

More information:

Keep Kirk in Lurk

Keep Kirk in lurk.
Control the jerk.
That piece of work.
The bull Berserk.

He’s loud and crass.
A frequent ass.
Can’t not make pass,
At pretty lass.

The silly dope.
He preaches Pope.
And clings to hope.
The human trope.

Not Mr. Spock,
He’s run a-mok,
He’s awe and shock,
He doesn’t grok.

Single minded.
Needs reminded,
Not to find id
Ego blinded.

Few effective.
Spews invective.
Treats prime directive
As elective.

Law evaded.
Ethics faded.
Simply stated:
Overrated!

Ignore his slick,
Adoring shtick.
He’s Captain Prick,
Not James but Dick.

Just keep him penned,
He’s rare the friend.
But hand extend,
We’re not at end.

For inci-dents,
Of rare intense,
Need lack of sense,
And slack of fence.

For these events,
The best compense,
Are stubborn gents,
Of Kirk immense.

When last in place,
In hole your ace:
Old K can chase,
Blow up the race.

Escaping Dodge?
You need the Stodge.
Engaging perp?
No better twerp.

His winning smile,
Full-throttle style,
There’s no denial,
Puts thought on trial.

So tame and train,
Your James T. brain.
To first, refrain.
Yet not abstain.

Yes, Kirk has perk.
Don’t fully shirk,
His style of work:
The bully quirk.


See also: Unlock Your Spock

Unlock your Spock

Unlock your Spock,
The thinking jock.
In reason, stock.
Your head full chock.

Emotion, block.
Religion, knock.
Illogic, mock.
To scissors, rock.

Don’t jump rejoice,
Without his voice,
To guide your choice,
Of rolls or Royce.

Don’t improvise.
When chance arise,
Try on for size,
This Enterprise.

This guy insider.
Calm decider.
Anger hider.
Truth abider.

The one that bends.
Who letter sends.
Ends pretends,
And makes amends.

The man who plans.
Buys minivans.
Stores up his cans.
Controls his glands.

No chitter chat,
Or swagger frat.
A diplomat.
A clever cat.

His omissions:
Intuitions,
Superstitions,
Hopeless missions.

He’ll exercise
Humility.
He’ll maximize
Utility.

To measure right,
Looks not hindsight.
Makes best of plight
In current light.

Odds in favor?
Risk won’t waiver.
More than saver.
Future paver.

He never lies.
Yet more, this guy’s.
He flies. He dies.
And then he’ll rise.

Unlike portrayed,
He makes the grade,
He’s wealthy paid,
He so gets laid.

The lesson cinch:
Just inch by inch,
Turn your winch
On Vulcan pinch.

Take lesson stock:
No dove or hawk.
For life to rock,
Embrace your Spock.


RIP Leonard Nimoy
See also: Keep Kirk in Lurk

Microsoft Researchers co-authored 21% of papers at the ACM Conference on Economics and Computation

Twenty-six researchers from Microsoft Research labs in Boston, China, India, Israel, New York City, Redmond, Silicon Valley, and the United Kingdom co-authored a remarkable seventeen of the eighty papers published in the 2014 ACM Conference on Economics and Computation (EC’14).

Moshe Babaioff served as General Chair for the conference and many other Microsoft Researchers served roles including (senior) PC members, workshop organizers, and tutorial speakers.

For research at the intersection of economics and computation, IMHO there’s no stronger “department” in the world than MSR.

Sébastien Lahaie and Jennifer Wortman Vaughan co-authored three papers each. Remarkably, Jenn accomplished that feat and gave birth!

The full list of authors are: Shipra Agrawal, Moshe Babaioff, Yoram Bachrach, Wei Chen, Sofia Ceppi, Nikhil R. Devanur, Fernando Diaz, Hu Fu, Rafael Frongillo, Daniel Goldstein, Nicole Immorlica, Ian Kash, Peter Key, Sébastien Lahaie, Tie-Yan Liu, Brendan Lucier, Yishay Mansour, Preston McAfee, Noam Nisan, David M. Pennock, Tao Qin, Justin Rao, Aleksandrs Slivkins, Siddharth Suri, Jennifer Wortman Vaughan, and Duncan Watts.

The full list of papers are:

Optimal Auctions for Correlated Bidders with Sampling
Hu Fu, Nima Haghpanah, Jason Hartline and Robert Kleinberg

Generalized Second Price Auction with Probabilistic Broad Match
Wei Chen, Di He, Tie-Yan Liu, Tao Qin, Yixin Tao and Liwei Wang

Optimising Trade‐offs Among Stakeholders in Ad Auctions
Yoram Bachrach, Sofia Ceppi, Ian Kash, Peter Key and David Kurokaw

Neutrality and Geometry of Mean Voting
Sébastien Lahaie and Nisarg Shah

Adaptive Contract Design for Crowdsourcing Markets: Bandit Algorithms for Repeated Principal‐Agent Problems
Chien-Ju Ho, Aleksandrs Slivkins and Jennifer Wortman Vaughan

Removing Arbitrage from Wagering Mechanisms
Yiling Chen, Nikhil R. Devanur, David M. Pennock and Jennifer Wortman Vaughan

Information Aggregation in Exponential Family Markets
Jacob Abernethy, Sindhu Kutty, Sébastien Lahaie and Rahul Sami

A General Volume‐ Parameterized Market Making Framework
Jacob Abernethy, Rafael Frongillo, Xiaolong Li and Jennifer Wortman Vaughan

Reasoning about Optimal Stable Matchings under Partial Information
Baharak Rastegari, Anne Condon, Nicole Immorlica, Robert Irving and Kevin Leyton-Brown

The Wisdom of Smaller, Smarter Crowds
Daniel Goldstein, Preston McAfee and Siddharth Suri

Incentivized Optimal Advert Assignment via Utility Decomposition
Frank Kelly, Peter Key and Neil Walton

Whole Page Optimization: How Page Elements Interact with the Position Auction
Pavel Metrikov, Fernando Diaz, Sébastien Lahaie and Justin Rao

Local Computation Mechanism Design
Shai Vardi, Avinatan Hassidim and Yishay Mansour

On the Efficiency of the Walrasian Mechanism
Moshe Babaioff, Brendan Lucier, Noam Nisan and Renato Paes Leme

Long‐run Learning in Games of Cooperation
Winter Mason, Siddharth Suri and Duncan Watts

Contract Complexity
Moshe Babaioff and Eyal Winter

Bandits with concave rewards and convex knapsacks
Shipra Agrawal and Nikhil R. Devanur

Bernie’s Credo

My dad died this morning, smiling up until the end. He was an amazing man and an incredible dad — a modern, sensitive, involved dad who was way ahead of his time — a model for me. I wrote about my Dad in 2010. At the end of his beautiful memoir (composed using Blurb) — a document I cherish — my dad wrote his “personal credo”. It not only is eloquent and profound (and references calculus), it really does reflect the way he lived his life every single day. Here it is:

  • Be honest (always be truthful)
  • Be kind (care about others)
  • Be fair (judge others with care)
  • Focus and act on what is important only
  • Pay attention to your surroundings (people and places)
  • Speak only to convey information
  • Make decisions based not only on the present but also on the anticipated future (This concept is the essence of the contribution of calculus (differentiation) to mathematics)
  • I am proud of who I am.
  • Be Happy !
Mom and Dad
Mom and Dad
Mom and Dad cooking
Mom and Dad cooking — one of their favorite activities
Dad teaching physics
Dad teaching physics to the grandkids. It was fun to see “teacher Bernie” come alive.
Bernie's personal credo
Bernie Pennock’s personal credo

Last call: Postdoc positions at Microsoft Research NYC

Microsoft Research New York City seeks outstanding applicants for 2-year postdoctoral researcher positions. We welcome applicants with a strong academic record in one of the following areas:

We will also consider applicants in other focus areas of the lab, including information retrieval, and behavioral & empirical economics. Additional information about these areas is included below. Please submit all application materials by January 11, 2013 for full consideration. Instructions are here.


COMPUTATIONAL SOCIAL SCIENCE

With an increasing amount of data on every aspect of our daily activities — from what we buy, to where we travel, to who we know — we are able to measure human behavior with precision largely thought impossible just a decade ago. Lying at the intersection of computer science, statistics and the social sciences, the emerging field of computational social science uses large-scale demographic, behavioral and network data to address longstanding questions in sociology, economics, politics, and beyond. We seek postdoc applicants with a diverse set of skills, including experience with large-scale data, scalable statistical and machine learning methods, and knowledge of a substantive social science field, such as sociology, economics, psychology, political science, or marketing.

ONLINE EXPERIMENTAL SOCIAL SCIENCE

Online experimental social science involves using the web, including crowdsourcing platforms such as Amazon’s Mechanical Turk, to study human behavior in “virtual lab” environments. Among other topics, virtual labs have been used to study the relationship between financial incentives and performance, the honesty of online workers, advertising impact as a function of exposure time, the implicit cost of “bad ads,” the testing of graphical user interfaces eliciting probabilistic information and also the relationship between network structure and social dynamics, related to social phenomena such as cooperation, learning, and collective problem solving. We seek postdoc applicants with a diverse mix of skills, including awareness of the theoretical and experimental social science literature, and experience with experimental design, as well as demonstrated statistical modeling and programming expertise. Specific experience running experiments on Amazon’s Mechanical Turk or related crowdsourcing websites, as well as managing virtual participant pools is also desirable, as is evidence of UI design ability.

ALGORITHMIC ECONOMICS AND MARKET DESIGN

Market design, the engineering arm of economics, benefits from an understanding of computation: complexity, algorithms, engineering practice, and data. Conversely, computer science in a networked world benefits from a solid foundation in economics: incentives and game theory. Scientists with hybrid expertise are crucial as social systems of all types move to electronic platforms, as people increasingly rely on programmatic trading aids, as market designers rely more on equilibrium simulations, and as optimization and machine learning algorithms become part of the inner loop of social and economic mechanisms. We seek applicants who embody a diverse mix of skills, including a background in computer science (e.g., artificial intelligence or theory) or related field, and knowledge of the theoretical and experimental economics literature. Experience building prototype systems, and a comfort level with modern programming paradigms (e.g., web programming and map-reduce) are also desirable.

MACHINE LEARNING

Machine learning is the discipline of designing efficient algorithms for making accurate predictions and optimal decisions in the face of uncertainty. It combines tools and techniques from computer science, signal processing, statistics and optimization. Microsoft offers a unique opportunity to work with extremely diverse data sources, both big and small, while also offering a very stimulating environment for cutting-edge theoretical research. We seek postdoc applicants who have demonstrated ability to do independent research, have a strong publication record at top research venues and thrive in a multidisciplinary environment.

A toast to the number 303: A redemptive election night for science, and The Signal

The night of February 15, 2012, was an uncomfortable one for me. Not a natural talker, I was out of my element at a press dinner organized by Yahoo! with journalists from the New York Times, Fast Company, MIT Tech Review, Forbes, SF Chronicle, WIRED, Reuters, and several more [1]. Even worse, the reporters kept leading with, “wow, this must a big night for you, huh? You just called the election.”

We were there to promote The Signal, a partnership between Yahoo! Research and Yahoo! News to put a quantitative lens on the election and beyond. The Signal was our data-driven antidote to two media extremes: the pundits who commit to statements without evidence; and some journalists who, in the name of balance, commit to nothing. As MIT Tech Review billed it, The Signal would be the “mother of all political prediction engines”. We like to joke that that quote undersold us: our aim was to be the mother of all prediction engines, period. The Signal was a broad project with many moving parts, featuring predictions, social media analysis, infographics, interactives, polls, and games. Led by David “Force-of-Nature” Rothschild, myself, and Chris Wilson, the full cast included over 30 researchers, engineers, and news editors [2]. We confirmed quickly that there’s a clear thirst for numeracy in news reporting: The Signal grew in 4 months to 2 million unique users per month [3].

On that night, though, the journalists kept coming back to the Yahoo! PR hook that brought them in the door: our insanely early election “call”. At that time in February, Romney hadn’t even been nominated.

No, we didn’t call the election, we predicted the election. That may sound like the same thing but, in scientific terms, there is a world of difference. We estimated the most likely outcome – Obama would win 303 Electoral College votes, more than enough to return him to the White House — and assigned a probability to it. Of less than one. Implying a probability of more than zero of being wrong. But that nuance is hard to explain to journalists and the public, and not nearly as exciting.

Although most of our predictions were based on markets and polls, the “303” prediction was not: it was a statistical model trained on historical data of past elections, authored by economists Patrick Hummel and David Rothschild. It doesn’t even care about the identities of the candidates.

I have to give Yahoo! enormous credit. It took a lot of guts to put faith in some number-crunching eggheads in their Research division and go to press with their conclusions. On February 16, Yahoo! went further. They put the 303 prediction front and center, literally, as an “Exclusive” banner item on Yahoo.com, a place that 300 million people call home every month.

The Signal 303 prediction "Exclusive" top banner item on Yahoo.com 2012-02-16

The firestorm was immediate and monstrous. Nearly a million people read the article and almost 40,000 left comments. Writing for Yahoo! News, I had grown used to the barrage of comments and emails, some comic, irrelevant, or snarky; others hateful or alert-the-FBI scary. But nothing could prepare us for that day. Responses ranged from skeptical to utterly outraged, mostly from people who read the headline or reactions but not the article itself. How dare Yahoo! call the election this far out?! (We didn’t.) Yahoo! is a mouthpiece for Obama! (The model is transparent and published: take it for what it’s worth.) Even Yahoo! News editor Chris Suellentrop grew uncomfortable, especially with the spin from Homepage (“Has Obama won?”) and PR (see “call” versus “predict”), keeping a tighter rein on us from then on. Plenty of other outlets “got it” and reported on it for what it was – a prediction with a solid scientific basis, and a margin for error.

This morning, with Florida still undecided, Obama had secured exactly 303 Electoral College votes.

New York Times 2012 election results Big Board 2011-11-07

Just today Obama wrapped up Florida too, giving him 29 more EVs than we predicted. Still, Florida was the closest vote in the nation, and for all 50 other entities — 49 states plus Washington D.C. — we predicted the correct outcome back in February. The model was not 100% confident about every state of course, formally expecting to get 6.8 wrong, and rating Florida the most likely state to flip from red to blue. The Hummel-Rothschild model, based only on a handful of variables like approval rating and second-quarter economic trends, completely ignored everything else of note, including money, debates, bail outs, binders, third-quarter numbers, and more than 47% of all surreptitious recordings. Yet it came within 74,000 votes of sweeping the board. Think about that the next time you hear an “obvious” explanation for why Obama won (his data was biggi-er!) or why Romney failed (too much fundraising!).

Kudos to Nate Silver, Simon Jackman, Drew Linzer, and Sam Wang for predicting all 51 states correctly on election eve.

As Felix Salmon said, “The dominant narrative, the day after the presidential election, is the triumph of the quants.” Mashable’s Chris Taylor remarked, “here is the absolute, undoubted winner of this election: Nate Silver and his running mate, big data.” ReadWrite declared, “This is about the triumph of machines and software over gut instinct. The age of voodoo is over.” The new news quants “bring their own data” and represent a refreshing trend in media toward accountability at least, if not total objectivity, away from rhetoric and anecdote. We need more people like them. Whether you agree or not, their kind — our kind — will proliferate.

Congrats to David, Patrick, Chris, Yahoo! News, and the entire Signal team for going out on a limb, taking significant heat for it, and correctly predicting 50 out of 51 states and an Obama victory nearly nine months prior to the election.

Footnotes

[1] Here was the day-before guest list for the February 15 Yahoo! press dinner, though one or two didn’t make it:
-  New York Times, John Markoff
-  New York Times, David Corcoran
-  Fast Company, EB Boyd
-  Forbes, Tomio Geron
-  MIT Tech Review, Tom Simonite
-  New Scientist, Jim Giles
-  Scobleizer, Robert Scoble
-  WIRED, Cade Metz
-  Bloomberg/BusinessWeek, Doug MacMillan
-  Reuters, Alexei Oreskovic
-  San Francisco Chronicle, James Temple

[2] The extended Signal cast included Kim Farrell, Kim Capps-Tanaka, Sebastien Lahaie, Miro Dudik, Patrick Hummel, Alex Jaimes, Ingemar Weber, Ana-Maria Popescu, Peter Mika, Rob Barrett, Thomas Kelly, Chris Suellentrop, Hillary Frey, EJ Lao, Steve Enders, Grant Wong, Paula McMahon, Shirish Anand, Laura Davis, Mridul Muralidharan, Navneet Nair, Arun Kumar, Shrikant Naidu, and Sudar Muthu.

[3] Although I continue to be amazed at how greener the grass is at Microsoft compared to Yahoo!, my one significant regret is not being able to see The Signal project through to its natural conclusion. Although The Signal blog was by no means the sole product of the project, it was certainly the hub. In the end, I wrote 22 articles and David Rothschild at least three times that many.

Raise your WiseQ to the 57th power

One of the few aspects of my job I enjoy more than designing a new market is actually building it. Turning some wild concept that sprung from the minds of a bunch of scientists into a working artifact is a huge rush, and I can only smile as people from around the world commence tinkering with the thing, often in ways I never expected. The “build it” phase of a research project, besides being a ton of fun, inevitably sheds important light back on the original design in a virtuous cycle.

In that vein, I am thrilled to announce the beta launch of PredictWiseQ, a fully operational example of our latest combinatorial prediction market design: “A tractable combinatorial market maker using constraint generation”, published in the 2012 ACM Conference on Electronic Commerce.

You read the paper.1  Now play the game.2 Help us close the loop.

PredictWiseQ Make-a-Prediction screenshot October 2012

PredictWiseQ is our greedy attempt to scarf up as much information as is humanly possible and use it, wisely, to forecast nearly every possible detail about the upcoming US presidential election. For example, we can project how likely it is that Romney will win Colorado but lose the election (6.2%), or that the same party will win both Ohio and Pennsylvania (77.6%), or that Obama will paint a path of blue from Canada to Mexico (99.5%). But don’t just window shop, go ahead and customize and buy a prediction or ten for yourself. Your actions help inform the odds of your own predictions and, crucially, thousands of other related predictions at the same time.

For example, a bet on Obama to win both Ohio and Florida can automatically raise his odds of winning Ohio alone. That’s because our market maker knows and enforces the fact that Obama winning OH and FL can never be more likely than him winning OH. After every trade, we find and fix thousands of these logical inconsistencies. In other words, our market maker identifies and cleans up arbitrage wherever it finds it. But there’s a limit to how fastidious our market maker can be. It’s effectively impossible to rid the system of all arbitrage: doing so is NP-hard, or computationally intractable. So we clean up a good bit of arbitrage, but there should be plenty left.

So here’s a reader’s challenge: try to identify arbitrage on PredictWiseQ that we did not. Go ahead and profit from it and, when you’re ready, please let me and others know about it in the comments. I’ll award kudos to the reader who finds the simplest arbitrage.

Why not leave all of the arbitrage for our traders to profit from themselves? That’s what nearly every other market does, from Ireland-based Intrade, to Las Vegas bookmakers, to the Chicago Board Options Exchange. The reason is, we’re operating a prediction market. Our goal is to elicit information. Even a completely uninformed trader can profit from arbitrage via a mechanical plug-and-chug process. We should reserve the spoils for people who provide good information, not those armed (solely) with fast or clever algorithms. Moreover, we want every little crumb of information that we get, in whatever form we get it, to immediately impact as many of the thousands or millions of predictions that it relates to as possible. We don’t want to wait around for traders to perform this propagation on their own and, besides, it’s a waste of their brain cells: it’s a job much better suited for a computer anyway.

Intrade offers an impressive array of predictions about the election, including who will win in all fifty states. In a sense, PredictWiseQ is Intrade to the 57th power. In a combinatorial market, a prediction can be any (Boolean) function of the state outcomes, an ungodly degree of flexibility. Let’s do some counting. In the election, there are actually 57 “states”: 48 winner-takes-all states, Washington DC, and two proportional states — Nebraska and Maine — that can split their electoral votes in 5 and 3 unique ways, respectively. Ignoring independent candidates, all 57 base “states” can end up colored Democratic blue or Republican Red. So that’s 2 to the power 57, or 144 quadrillion possible maps that newscasters might show us after the votes are tallied on November 6th. A prediction, like “Romney wins Ohio”, is the set of all outcomes where the prediction is true, in this case all 72 quadrillion maps where Ohio is red. The number of possible predictions is the number of sets of outcomes, or 2 to the power 144 quadrillion. That’s more than a googol, though less than a googolplex (maybe next year). To get a sense of how big that is, if today’s fastest supercomputer starting counting at the instant of the big bang, it still wouldn’t be anywhere close reaching a googol yet.

Create your own league to compare your political WiseQ among friends. If you tell us how much each player is in for, we’ll tell you how to divvy things up at the end. Or join the “Friends Of Dave” (FOD) league. If you finish ahead of me in my league, I’ll buy you a beer (or beverage of your choice) the next time I see you, or I’ll paypal you $5 if we don’t cross paths.

PredictWiseQ is part of PredictWise, a fascinating startup of its own. Founded by my colleague David Rothschild, PredictWise is the place to go for thousands of accurate, real-time predictions on politics, sports, finance, and entertainment, aggregated and curated from around the web. The PredictWiseQ Game is a joint effort among David, Miro, Sebastien, Clinton, and myself.

The academic paper that PredictWiseQ is based on is one of my favorites — owed in large part to my coauthors Miro and Sebastien, two incredible sciengineers. As is often the case, the theory looks bulletproof on paper. But I’ve learned the hard way many times that you don’t really know if a design is good until you try it. Or more accurately, until you build it and let a crowd of other people try it.

So, dear crowd, please try it! Bang on it. Break it. (Though please tell me how you did, so we might fix it.) Tell me what you like and what is horribly wrong. Mostly, have fun playing a market that I believe represents the future of markets in the post-CDA era, a.k.a the digital age.

__________
1 Or not.
2 Or not.

Oddhead Blog hacked… for the third time

My blog has been hacked yet again. For those keeping track, that’s infection number three. This latest exploit is very similar to the previous one. To humans arriving via browser (e.g., me), the site appears perfectly normal and healthy. Even upon clicking ‘view source’, nothing untoward is revealed. The <title> of my blog is, as always, Oddhead Blog.

However, when Google’s or Bing’s crawlers arrive to index my corner of the web, they see a different <title> altogether — Buy Cheap Cialis Online  — and immediately roll their eyes. (Actually even if you run 'curl http://blog.oddhead.com', you’ll see the spam keywords.) The effect of the attack is a kind of reverse cloaking. Cloaking is the black-hat SEO practice of serving legitimate content to crawlers and spam content to people. Here, the spam content is shown to the crawlers and the legitimate content to the people.

Once the crawlers report this appalling information back to their respective mother ships, the search engines have no choice but to delist and demote my blog in their pagerankings. Right now, if you search for or within Oddhead Blog on Google, you’ll see how poorly the bots in Mountain View think of me:

Oddhead Blog hacked again: Spam titles in Google's cache 2012-04-27

You can hardly find any deep links into my blog by searching Google. For example, try searching for Bem+Wom, my invented term for “BEtter Mousetrap, Word of Mouth”. Even try “Bem+Wom oddhead blog”. You”ll find aggregators republishing my content, but no links to the original source, my blog, anywhere in sight. (Note to self: the Bing results for Bem+Wom are awful.)

Once again I am at a loss to understand my attacker’s motivation. Clearly it’s not to sell Cialis to my users, as they remain blissfully ignorant of any changes. The only benefit to anyone is to remove one relatively obscure blog from the search engine rankings and thus to move the attacker one slot up. Having a blog tangentially about gambling probably puts me into a shady neighborhood of the web, yet reverse-cloaking your competition (even if it can be somewhat automated and strike more than one competitor) seems like an awfully indirect way to improve one’s standing in Google. It’s also possible this is an act of pure vandalism.

So what should I do? Although I partly blame WordPress for writing insecure software, I may end up paying WordPress protection money to make this problem go away. I am seriously considering giving up on self hosting and moving my whole operation to worpress.com’s hosted service, where presumably security is tighter, or at least it’s not my responsibility any more. My web hosting service, DreamHost, may also be partly to blame, yet I like the company and have been quite happy with them in many respects. Any advice, dear reader? WordPress.com? Blogger? Try again and hope the fourth time is the charm? Should I be looking to ditch DreamHost as well?

Microsoft Research New York City, First Days

Microsoft Research NYC logoNow that I’ve said my goodbyes, I’m thrilled to announce that I’ve joined Microsoft Research, an organization with going-on twenty-one years of commitment to basic and applied research, employing 850 Ph.D. scientists around the globe including Turing Award winners, Fields Medalists, and many long-time colleagues that I hugely respect. If that were all, I would be over-the-top happy right now.

But that’s not all. Together with fourteen other founding members (seven of whom I can name: Duncan Watts, John Langford, David Rothschild, Sharad Goel, Dan Goldstein, Jake Hofman, and Sid Suri), we are cutting the ribbon on a new outpost for Microsoft Research in New York City. We will report to Jennifer Chayes, the founder and director of Microsoft Research New England in Cambridge, MA. It’s been amazing to watch her up close pursue a goal relentlessly with boundless positive energy. I get the feeling it’s how she approaches everything she does, a realization that played no small part in my decision. The New England Lab, like us, is an interdisciplinary research group that blends computer science, social science, and machine learning, yet from different enough perspectives to make this an almost perfect marriage. It’s no exaggeration to say that helping to found and lead a new research group amid the bursting tech scene in New York City, with the resources of Microsoft behind us, is — as Duncan says — a once-in-a-career opportunity.

The press coverage Thursday was gratifying, including nice pieces in PCMag (source of the sweet logo above), NYTimes.com, AllThingsD, and dozens more. Here is the official press release. For science perspectives, see John Langford’s, Lance Fortnow’s, Dan Goldstein’s, and Jennifer Chayes’s blog posts. One of the coolest moments came when New York Mayor Michael Bloomberg tweeted about us.

Note that, despite the attrition, Yahoo! Labs lives on, probably more applied but not solely so. Ron Brachman, the new head of Yahoo! Labs, is terrific and may be able to do something special there. The Barcelona group remains largely intact and just got 7 (!) papers into SIGIR. Other groups remain intact as well.

The reception within Microsoft research and product orgs has been swift and very warm. The breadth and scope of the place can be daunting at first but invigorating. The ability to impact products that touch hundreds of millions of people’s lives is, as always, a rewarding draw of corporate research. Yet one of the deciding factors for many of us in joining Microsoft is the freedom to interact with universities in research, service, teaching, hosting visitors, hiring interns and postdocs, etc. In addition, we’d like to play our part in the New York City tech scene, including the startup, venture-capitalist, and hack/make communities, plus the new Cornell-Technion campus, contributing to Mayor Bloomberg’s vision of New York City as a tech hub.

An interesting side note that bodes well for my two daughters ages 7 and 4 is that my primary decision boiled down to working for one of two brilliant and accomplished women: Jennifer Chayes at Microsoft, or Corinna Cortes at Google, who is absolutely terrific. Google is a incredible place, a model of efficiency, innovation, and ambition, with an impressive roster of people, and the company is in a very strong position. But this opportunity at Microsoft simply proved to be too good to pass up. I can’t believe how perfectly everything fell into place. I’m beyond thrilled at the outcome and excited to begin this next chapter of my career.