Presentation about Luigi

I like the editing!

From: Erik Bernhardsson

Being data driven

I picked up an issue of Foreign Affairs while flying back to NYC from SFO. It features this long interview with U.S. General Stanley McChrystal and I thought it was pretty interesting how striking some of the similarities are between fighting in a war and developing software.

From: Erik Bernhardsson

Annoy

Annoy is a simple package to find approximate nearest neighbors (ANN) that I just put on Github. I'm not trying to compete with existing packages, but Annoy has a couple of features that makes it pretty useful.

From: Erik Bernhardsson

More Luigi!

Elias Freider just talked about Luigi at PyData 2013: The presentation above is much better than one I put together a few weeks ago. In case anyone is interested I'll include it too:

From: Erik Bernhardsson

ML at Twitter

I recently came across this paper describing how they do ML at Twitter. TL;DR Their approach is pretty interesting. Everything is a Pig workflow and then they do everything as UDF's. This approach seems pretty interesting.

From: Erik Bernhardsson

I'm featured in Mashable

This article from today in Mashable describes some of the fun stuff I get to work with: Erik Bernhardsson is technical lead at Spotify, where he helped to build a music recommendation system based on large-scale machine learning algorithms, mainly matrix factorization of big matrices using Hadoop.

From: Erik Bernhardsson

Slides from NYC Machine Learning talk

Slides from the talk. Slightly edited because (a) some of the slides make little sense taken out of context (b) Slideshare seem to have problem converting some of the stuff. Collaborative filtering at Spotify from Erik Bernhardsson

From: Erik Bernhardsson

NYC Machine Learning meetup

From the NYC Machine Learning talk I had last week: Haven't looked at it yet except briefly. Unfortunately the quality isn't the best.

From: Erik Bernhardsson

Part 3: Adobe hotfix released for "Serious security threat for ColdFusion servers"

Adobe has come out with a new security hotfix for a very serious attack on ColdFusion servers which had hit many (perhaps most) CF shops over the past couple of weeks, and it's vital that all shops apply that fix.

From: Charlie Arehart - Server Troubleshooting

Momentum and mean reversion might just be volatility bias

The Economist just published an article called The best, the worst and the ugly.

From: Erik Bernhardsson

Part 2: Serious security threat for ColdFusion servers [now covered by a hotfix]

Since I posted my entry earlier today about a Serious security threat for #ColdFusion servers [not now covered by a hotfix], I have had many questions and discussions which lead me to share more info.

From: Charlie Arehart - Server Troubleshooting

Serious security threat for ColdFusion servers [now covered by a hotfix]

Hey folks, there's a fairly serious security threat out in the wild, and you may want to check if your server's been hit.

From: Charlie Arehart - Server Troubleshooting

Calculating cosine similarities using dimensionality reduction

This was posted on the Twitter Engineering blog a few days ago: Dimension Independent Similarity Computation (DISCO) I just glanced at the paper, and there's some cool stuff going on from a theoretical perspective.

From: Erik Bernhardsson

Tumblr's awesome project names

Not sure how I managed to miss this, but I'm watching this Tumblr presentation and they talk about their projects named after Arrested Development topics: Gob, Parmesan, Buster, Jetpants, Oscar, George and Motherboy. Still, the best software project name is probably still Apple's BHA.

From: Erik Bernhardsson

A neat little trick with time decay

Something that pops up pretty frequently is to implement time decay, especially where you have recursive chains of jobs.

From: Erik Bernhardsson

Luigi: complex pipelines of tasks in Python

I'm shamelessly promoting my first major open source project.

From: Erik Bernhardsson

Updates for Dreamweaver, HomeSite, CF Studio for ColdFusion 8, 9, or 10

If you may still be using Dreamweaver or HomeSite+ (or its older brother, HomeSite, or its older uncle, ColdFusion Studio), you may find that the tag insight, tag help, tag completion, and other features are not recognizing newer ColdFusion tags and functions, if perhaps you have updated to later ve...

From: Charlie Arehart - Server Troubleshooting

How to tell what, if any, hotfixes have been applied to ColdFusion (9 and earlier)

I often see people struggling with confusion over what hotfixes have been applied to CF.

From: Charlie Arehart - Server Troubleshooting

CF911: Want to monitor ColdFusion "out of process" (from outside the instance itself)? Many ways.

I just blogged about how the hidden gem "enable monitoring server" option in CF 9.

From: Charlie Arehart - Server Troubleshooting

CF911: "Enable Monitoring Server" option (new in ColdFusion 9.01) DOES NOT monitor "out of process"

Have you been led to believe that the "Enable Monitoring Server" option (new in the CF 9.

From: Charlie Arehart - Server Troubleshooting