Registers of the Itanium CPU Architecture

In this second installment of the CPU register series, I take a look at the Itanium CPUs. Intel and HP designed Itanium throughout the 1990s. Intel hoped that it would be the successor to the old x86 architecture, with a bonus of not being legally obliged to share these secrets with anyone else (AMD specifically). When it went on the market in 2001, its performance was not competitive with x86, and was super expensive. While Itanium had x86 emulation, it was not fast enough to be useful. At the time, AMD was busy at work expanding x86 to 64-bit, which proved to be the winning strategy.

Hopefully this will be less complicated than x86, but the way Itanium uses its registers is vastly different. Itanium is a very long instruction word (VLIW) design, but Intel likes to call it explicitly parallel instruction computing (EPIC). This means that instructions are bigger than in other CPU designs, but the instructions are large enough to engage several execution units and registers. Compare it to x86's complex instruction set computing (CISC) design, which uses smaller instructions with fewer registers, but can be tricked out with out-of-order execution, register renaming, and branch prediction. Itanium relies on compile time optimizations, rather than runtime optimizations.

There are 128 65-bit general purpose (integer) registers GR0-GR127, 128 83-bit floating point registers FR0-FR127, 64 one bit predicate registers, eight 64-bit branch registers BR0-BR7, a 64-bit instruction pointer IP, and a 38-bit current frame marker CFM. The last bit of both the general purpose and floating point registers is a not-a-thing bit, which I can see as being useful for nulls. Applications can only access the last 96 general purpose registers (GR32-GR127), because the first 32 (GR0-GR31) are static registers. GR1 is called the global pointer, and GR12 is the stack pointer. GR0 is hardwired to 0, FR0 is 0.0, and FR1 is 1.0.

Itanium sections off regions of its massive register array and uses them as stack frames. The idea was to keep as much data in the registers as possible, thus enabling independent processing of large data sets. When things get crowded, these frames are SPILLed into memory and FILLed back when necessary. Depending on the app, I think this comes off as rather wasteful, because there could be lots of registers not doing anything, because they would contain data from 1 to N functions up the call stack, which some subroutine can't access or care about.

The good news is that Itaniums were never really popular, affordable, or that fast for most things. What little marketshare it has is being eroded away, but since it's found its way into things like mainframes and other super-reliable systems, these things aren't going to die out over night, or over a decade. At best, it's a novelty; at worst, it's a waste of money. It didn't get called the Itanic for nothing.

Since you've made it this far, you might be interested in reading:

'THEANDREWBAILEY.COM DOESN'T GET ENOUGH TRAFFIC TO JUSTIFY A REVERSE PROXY! YOU CAN'T EVEN SCALE YOURSELF OUT OF A PAPER BAGINO NOOOOOOOO' 'haha in-memory cache go brrr' — Toilet Blog Engine, Version 8

Since I've been stuck at home for a few months, I've been updating this blog. There's been some major improvements, because the whole stack has been upgraded: the OS (Xubuntu 16.04 to Xubuntu 20.04), Postgres (9.5 to 12), JVM (8 to 11), and web server (Payara 5.191 to 5.2020.3). (There was one Payara version that enabled TLS 1.3, but it's bugged. Maybe I'll try next time!) With PostgreSQL 12, I finally have access to the websearch_to_tsquery function for searching. You can use quotes to force include something, and hyphens to exclude something. However, naively connecting trigrams to it like how I did destroys the cool functionality, so I dropped it. I've built a search suggestion feature to cover for it; try it out.

Screenshot of theAndrewBailey.com, with exaggerated colors. — Background Text and Fake jQuery

I love having a fast website. A lot of it comes down to not having a lot of frameworks and libraries running. On my page, I only have one stylesheet, and one script. That doesn't mean that I can't get creative. My favorite is the glowing links when you hover over them. The summaries that look like rockstar autographed posters on the homepage are pretty sweet. (Those posters might be my favorite, if it wasn't for the difficulty in getting it to work just right.)

Screenshot of Firefox inspecting my homepage. — Multithreaded Rockstar AMP

Despite the fact that this blog does what it's supposed to do (I hope), I can't help but keep messing with it. I guess with my day job being mostly backend work on internet shopping websites, this is my way of venting. Sometimes, it gives me an idea of what is going on behind the abstractions beneath what I work on, like search indexes. Other times, I want to toy around doing visual design.

Screenshot of Chrome's remote debugger — Responsive Images & Remote Debugging

I got a new phone last month, because the screen on my old one broke. My new phone has a 5-ish inch 1080p screen. That's means it has an insane pixel density! On my podcast, I occasionally talk about some new program that increases efficiency, but doesn't change standards and fits within existing ecosystems. My favorite is MozJPEG. It's a program that encodes JPEG images much better than (almost) all others. Since I keep high resolution images of almost all the images on my blog (and share them), I experimented.

A screenshot of the Andrew Bailey as it is right now. — Welcome to the New Andrew Bailey

Hello and welcome to my blog. If you've been here before, things might look a little different, especially if you came in through the homepage. I have implemented a few things I have gathered by doing research for my podcast, and several hours of toying around.

Registers of the Alpha CPU Architecture