Creative Commons

Posted on July 15, 2012

Not thinking, I initially placed a copyright message in this blog’s footer claiming “all rights reserved”. A friend pointed out that my real goal in running this blog—to share ideas in the hopes of giving back to the open source community (and maybe getting my name out there a bit)—goes against a strict copyright.

I’ve known about Creative Commons licensing for some time, but, for whatever reason, didn’t think to use it on this site. I’m pleased to announce that this blog is now licensed under a Creative Commons Attribution-ShareAlike 3.0 Unported License. I strongly encourage you to consider using this license for your own works.

5 Things to Know About Python

Posted on July 13, 2012

My brother-in-law is a recent materials science grad. He knows MATLAB but asked my advice on becoming a better programmer. I suggested he teach himself Python and then sent him this email:

Here’s what you need to know to get started with Python:

It’s already on your Mac: open a terminal (Applications -> Utilities -> Terminal.app) and type “python”, then keep Terminal pinned to your dock.
You’ll want a real text editor. Sublime Text is good but asks you for money (you don’t have to pay). TextWrangler is less good but actually free.
The official Python Tutorial is probably the best way to get started.
Python is not a strongly typed language (like C++ or Java), which means you get ease-of-use at the cost of performance. If you learn Python, you’ll be able to quickly pick up any scripting language (Ruby, PHP, JavaScript, Perl), and you’ll find Java horribly verbose.
The SciPy library effectively gives Python the same scientific capabilities as MATLAB.

Also, FYI, here’s “the” list of programming language dominance.

3D Maps Could Lead to Great New Applications

Posted on July 11, 2012

As the world gets ready for both Google and Apple’s competing 3D map services, a lot of people are wondering just why we need 3D maps in the first place. 3D maps are cool and all, but are they game changing? Are they actually useful?

While I haven’t seen any killer 3D map apps yet, the truth is detailed 3D maps of the world’s cities could enable some really cool new applications. Of course, the utility of these mapping services will depend on how much access is given to developers.

The computer vision literature is filled with high-tech algorithms that could exploit a 3D model and mobile camera. For example, “SLAM” (Simultaneous Localization and Mapping) is a break-through algorithm now used frequently in robotics research that allows a robot (or mobile device) to build a 3D map of its surroundings from a video feed and localize itself against this map. It’s not a far stretch of imagination to think that using a 3D map from the web could help solve this problem faster and more accurately. And there other algorithms that match a 2D image to a 3D model.

If mobile developers are able to access the geometries of these 3D models (either as point clouds or meshes), get ready for some truly amazing augmented reality apps. However, if the 3D models are confined in proprietary viewing applications (like Google Earth or Apple’s new map application), I think these 3D maps will quickly lose their novelty.

3D Models, if implemented correctly, could enable new augmented reality applications.

Thoughts on the Intel Xeon Phi (Knights Corner) Announcement

Posted on July 5, 2012

Intel’s Rajeeb Hazra announced an interesting new product last month at ISC 2012. Knights Corner is the first of the new Xeon Phi brand, and it’s a co-processor that looks roughly like a GPU without the display ports. Intel has referred to this architecture as “Many Integrated Core” (MIC), but it’s not clear to me whether that name is being dropped. So far we have very little technical information. Anandtech has a good summary of information that’s been made available. There’s a “solution brief” on the Intel website indicating Knights corner uses a 22 nm process, Intel’s new 3D Tri-Gate transistors, and that the chip contains “more than 50 cores” (shall we assume 64?).

Is it a GPU? No, but it’s not far off.

Simple Cores

Intel follows the trend we’re seeing with most Chip Multi-Processors (CMPs), in which complicated cores are being thrown out in favor of dozens of simpler cores. We clearly see this in GPUs (where the processing cores are essentially single-precision floating point units), and it’s also the cornerstone of Tilera’s TILE architecture. Knights Corner cores are Pentiums with 64-bit floating point units, which should provide strong compute capability per core for a variety of applications, while still boasting power efficiency thanks to the state-of-the-art 22 nm process.

It’s the Network

I can only speculate about the on-chip network. Intel released a MIC prototype (codenamed Knights Ferry) to developers in 2010. Knights Ferry featured a 1,024-bit ring bus network connecting its 32 cores to memory, and the entire board had 2 GB shared DDR memory with a coherent 8 MB L2 cache (thanks Wikipedia).

What will we see in Knights Corner? The marketing material is really pushing programmability and compatibility with x86. I would guess we’ll see more of the same (global DDR memory, coherent L2 cache, and a ring bus network). But I have to ask, have we now reached the scalability limit of a shared memory architecture? One would think at some point CMPs will have to embrace Non-Uniform Memory Access (NUMA) and start treating processing cores (or groups of them) as network nodes. Once these on-chip networks get large enough, a shared global memory is going to be the bottleneck for almost every application. Developers will need some control over the communication patterns in order to get optimal performance.

How Do We Program It?

This architecture presents a very simple programming model to the developer. OpenCL will make it easy to write code for both data-parallel (like CUDA) and task-parallel (like pThreads) applications. Moving data will be handled under the hood. I expect a host/kernel model, where the main CPU will launch parallel kernels. I’m sure Intel Parallel Studio will provide C++ and Fortran programmers with some cool tools for improving performance. I’m also pleased to see standalone Windows and Linux versions (the product was originally a Microsoft Visual Studio plug-in). It would also be nice if Intel releases a free Xeon Phi Linux SDK, which will encourage academia to try out this new product (assuming the cost for the actual card isn’t over the top). If they provide pThreads support, I have some code I’d love to benchmark.

What This Chip Is Not

This is potentially great for scientific research and other High Performance Computing (HPC) applications, but it’s not going to do much for the armies of developers doing “real work” using distributed databases and writing MapReduce jobs. Someday there will be a market for a cluster-on-chip: distributed memory with NUMA, addressable processing cores, and real message passing.

My Take

I’d really like to try this out for math-intensive HPC applications. I’m sure we’ll see Knights Corner in many of the biggest supercomputers. I predict CUDA developers will embrace the Xeon Phi if the price is right (they’ll need to get it down around $1,000 after a high initial launch price) and the software ecosystem is strong.

Independence Day

Posted on July 4, 2012

For those of us who have worked in defense, let’s remember our goal is to make sure American service members never enter a fair fight.

Made in the USA. USS Lake Erie (CG-70) with its AN/SPY-1A/B radar and Aegis Ballistic Missile Defense System.