Stuff I read last week

About: A collection of links and short commentary, published weekly. In theory the inclusion criteria are for it to be something I read last week (duh) and was either particularly interesting or something I might want to refer to later. Subscribe with RSS or see my main blog for long-form writing.

2017-11-05

Splitting Strings How and why splitting and empty string is different across programming languages.
How I hacked Google’s bug tracking system
Creating a bogus @google.com address using one bug, and then getting rights to access the details of all (most?) bugs in the system.
There's a funny thing where for most software vulnerabilities a writeup like this can explain exactly what happened why. But then for web-based services all you see is a bug that looks totally braindead. It's user accounts! how hard can that possibly be?
I work on stuff related to security of account systems at the moment; turns out that just like in every other area of modern computing, it's incredibly complicated.
‘I forgot my PIN’: An epic tale of losing $30,000 in bitcoin
You've forgot the PIN and passphrase of your hardware bitcoin wallet. Must be out of luck, those things can't possibly have trivial vulnerabilities. I mean, surely those sophisticated cryptocurrency enthusiasts would be able to spot the snakeoil a mile away. Oh... No?
Hotswapping Haskell Live code reloading in Haskell for a counter-abuse system at Facebook. The HN comments are predictable in a sad way: why would anyone want to do that? Surely there must be better ways, and all the engineers working on this were total morons.
My VM is Lighter (and Safer) than your Container
Rewriting the Xen control plane to optimize the boot time of thousands of tiny VMs. Benchmarked against Docker, which seems just absurdly bad at this one.
(Not impressed by the numbers outside of booting with regard to "lightness". E.g. the throughput and scaling numbers for the personal firewall application are just miserable.)
More Taste: Less Greed? > I decided to discover why the system was so much larger and (apparently) complicated than previous UNIX systems. I thought that a good way to do this was gradually to replace parts of it, and compare the size and complexity of the old and new versions.

2017-10-30

Filesystem error handling Reproducing a 2005 paper on how Linux filesystems expose disk corruption to userspace.
On the passive measurability of QUIC An intro to my pet subject, the measurability of QUIC, and a new proposal needing just a single spin-bit. Good for efficiency, and surely there can't be any privacy objections. (Haha, only kidding. Of course somebody immediately objected.)

2017-10-16

Branch Prediction and the Performance of Interpreters – Don’t Trust Folklore > We show that the accuracy of branch prediction on interpreters has been dramatically improved over the three last Intel processor generations. This accuracy on Haswell, the most recent Intel processor generation, has reached a level where it can not be considered as an obstacle for performance anymore.
Serving 100 Gbps from an Open Connect Appliance Network stack improvements for doing 100Gbps of TLS traffic with in-kernel networking (on FreeBSD).
Breaking DKIM - on Purpose and by Chance > This article questions the quality of this proof by showing how fragile DKIM is as used in practice. It gets shown how in relevant cases the content of a mail can be changed without invalidating the DKIM signature, thus severely undermining the trust one should have in the signature. It gets also shown how easily DKIM breaks by chance and makes the recipient believe that the mail was spoofed even though it was not. And finally it is shown how DKIM can be used properly to actually meet most of the trust expected from it.
Modern anti-spam and E2E crypto The tradeoff between privacy and fighting abuse, and what that means for encryption in decentralized systems like email.
Dangers of CSV Injection Really didn't expect spreadsheets to work like this.
The web at maximum FPS: How WebRender gets rid of jank A very understandable walk through the new Firefox rendering infrastructure.

2017-09-26

Proximity of "house address twins"
It's possible for multiple houses (e.g. in different cities) to have the same street name and number. What's the closest pair of such "twins" in the UK?

I truly appreciate the dedication that went into researching this bit of trivia.
What’s So Bad About POSIX I/O?
Why POSIX filesystem semantics aren't a good fit for large scale systems. (The funny thing is that it never occurred to me that anyone would use POSIX APIs in a modern distributed context. But apparently in supercomputing they do).
Mison: A Fast JSON Parser for Data Analytics Execute queries on a JSON file without parsing the file proper. Do a quick SIMD-based pass that gets the field/value locations. Do a full parse of a small sample of the file. Use the sample to predict the structural locations of the fields the user is interested in. Combine these bits of information to speculatively find the physical locations of the relevant fields.
COST in the land of databases > In the past two years the database community has gotten substantially worse at doing graph processing with SQL. I wouldn't recommend it, really. They've gone from a 16-core machine being only 10x slower than a single-threaded for-loop, to being up to 1000x slower but only on graphs up to 500MB in size.
Rendezvous Hashing: My Baseline “Consistent” Distribution Method > Regardless of the reason for consistent hashing’s popularity, I feel the go-to technique should instead be rendezvous hashing. Its basic form is simple enough to remember without really trying (one of those desert island algorithms), it is more memory efficient than consistent hashing in practice, and its downside–a simple implementation assigns a location in time linear in the number of hosts–is not a problem for small deployments, or even medium (a couple racks) scale ones if you actually think about failure domains.
Building multiplayer games with Socket.io and HTML5 Canvas The key insight here is speculation that the reason all the games like agar.io control like crap (small acceleration, tons of momentum) is that it allows for hiding all the networking flakiness.

2017-09-11

Jonesforth A Forth interpreter written in assembly, with extensive comments explaining the data structures and the concepts behind threaded code.
Creating a language using only assembly language
How do you enjoy the process of creating a new language, when you've been writing compilers for a long time? By adding artificial restrictions. Only assembly; no libraries, programming languages, or code generators.

Implement a minimal BCPL-like language in assembly. Then use that to implement a Lisp interpreter, that interpreter to create a reasonably featured VM (e.g. garbage collector, delim. continuations). Write a compiler, assembler, disassembler, linker targeting the VM. Then use these tools to write the language you originally wanted with objects, pattern matching, and non-sexp macros.
JavaScript: The Curious Case of null >= 0 How the implicit type coercions in == and > are inconsistent, and create some very odd looking results.
Why Command And Vector Processors Rock > You see, the Amiga documented its command processor. The designers wanted you to write programs that ran on it. They wanted you to use it for doing all sorts of clever things. They recognized that the power to operate the underlying horsepower directly was something that could amplify the capabilities of a system way past the limits of its original design.

2017-09-04

What makes a good REPL?
Thoughts about what features a REPL (and the language being evaluated in the REPL!) should have to be useful.
Structure of a LISP system using two-level storage (1966)
Read this as part of some archaeology into numeric representation in early Lisp systems. But it actually turned out to be pretty neat systems paper in general. One thing that's striking is how readable this 50 year old paper still is. The vocabulary of systems programming has changed surprisingly little (just switched from words to bytes), and even the problems being solved are at the core the same. It's all about memory hierarchies, even at the dawn of computing.

Paper describes an early version of BBN Lisp for a machine with 16K words of core memory, and 88K words of absurdly slow drum memory. Hardware has no paging support. How do you make efficient use of the drum memory, to fit in meaningful programs? So you need to somehow do paging in software, and reorganize the data layouts to minimize pointer chasing and page faults. (The latter bit is what I was really interested in, while looking at the history of tagged pointers).
A map of the universe of 65536-bit sets
Take a bitset implementation that splits the set into blocks, and adaptively uses the best data representation for each block. How do you determine which internal representations actually make sense?
More Performance, More Gameplay
Slides with anecdotes on game optimization in general, but on the Jaguar CPU in particular. E.g. didn't realize you really have to use SIMD on those CPUs, or you can't even use the full cache bandwidth. Neat example of a custom spatial database near the end.
On using Internet RTT measurements for accurate geolocation, a white paper
"tl;dr don't bother".
High-Level SIL Optimization in the Swift Compiler
The painful step-by-step journey of implementing a seemingly trivial optimization in a production compiler. Especially the "Lessons Learned" part is great; I'm fighting the temptation not to just quote all of it here.

> I switched to a 12” MacBook before I started working on my swiftc PR. It was so slow that I was only able to iterate on the code once a day, because a single compile and test run would take all night. I ended up buying a top-of-the-line 15” MacBook Pro because it was the only way to iterate on the codebase more than once a day.

> It’s really easy to break swiftc because of how complex it is. My original pull request was approved and merged in a month. Despite only having about 200 lines of changes, I received 125 comments from six reviewers. Even after that much scruitiny, it was reverted almost immediately because it introduced a memory leak that a seventh person found after running a four hour long standard library integration test.
Bitcoin's Academic Pedigree
Yes, it's a Bitcoin article. But it's also really good!

> Bitcoin neatly avoids the double-spending problem plaguing proof-of-work-as-cash schemes because it eschews puzzle solutions themselves having value.

2017-08-28

Ranges, Coroutines, and React: Early Musings on the Future of Async in C++
Example of how some of the new features in the C++ standard will work together.
Examining a vintage RAM chip, I find a counterfeit with a Touch-Tone dialer inside
A chip reverse engineering story with the best digressions. It's not just about figuring out that the supposed RAM chip is actually a touch tone dialtone generator; it's also figuring out the maths on every dialtone generator on the market to exactly identify this one. And then going into some semiconductor physics for good measure.
The Three Projections of Doctor Futamura
An introduction to Futamura projections, phrased in terms of physical objects rather than partial evaluation of source code.
How do you cut a monolith in half?
> In practice, a message broker is a service that transforms network errors and machine failures into filled disks. Then you add more disks.

On why you probably want either a load balancer or a database, not a pubsub system.
Yesterday's NeWS
Slava Pestov reads through The NeWS Book: An Introduction to the Network/Extensible Window System from 1989. I never knew anything about NeWS, except from the Unix Haters Handbook X11 rant, so it was nice to fill it in with some more facts.
Designing a Tree Diff Algorithm Using Dynamic Programming and A*
> Specifically, what I needed was mostly like a tree diff but I wasn’t optimizing for the same thing as other algorithms, what I wanted to optimize for was resulting file size, including indentation.

Many people don't appreciate how complicated handling configuration data is in the real world. (Pretty much every one of my jobs has at some point turned into a configuration handling nightmare). This is a good story on exactly that. There's a need for a seemingly very simple config manipulation operation, but a couple of weeks later you find yourself doing dynamic programming.

(Also, this is not just a good story, but a great example on just how to present an algorithm).
A history of branch prediction up to 1995
A walk through early CPU branch prediction strategies.
BitFunnel: Revisiting Signatures for Search
How to make a practical web search system using bloom filters rather than an inverted index. I especially like the notes on how classical problems of signature-based don't really matter in this domain. E.g. a modest amount of false positives is not a problem, since the full result set needs to be scored no matter what. Or how sharding the index by number-of-unique-terms was impractical in the past due to excessive disk seeks, but no problem when the index needs to be sharded to hundreds of machines anyway.

2017-08-21

How Postgres Makes Transactions Atomic
A deep dive of how MVCC works in Postgres, from concepts all the way down to the exact source code.
Reverse Engineering x86 Processor Microcode
Reverse engineering the microcode in Athlons and Phenoms. Half of this work was done by mutating existing microcode update files, and probing the behavior of various instructions in a minimal operating system. The other half was done by delayering a CPU and using a electron microscope to find and read the microcode ROM.

Then write a proof-of-concept remote triggerable trojan in microcode.
Vista Multimedia Playback and Network Throughput
Another trip to crazytown. How Windows Vista would artificially limit network throughput if any sound was playing. (With an effect that would be magnified linearly as more NICs were added to the machine). Brought up in the HN discussion of my PS4 download speed post.
A Review of Perl 6
I like the idea of treating programming languages as a creative work to be reviewed critically.
Why I do not like Hugo
A case study in how not to change defaults when evolving a program from one use case to another. (Any blog platform will inevitably try to transform into a general purpose CMS and call a dystopian hellscape of ecommerce plugins an "ecosystem"). But I can't understand how anyone would think that changing the default RSS feed item count from 10 (which sounds pretty standard) to infinite could be the right thing.
Issues with transparent huge pages
A good discussion on the problems with transparent huge pages. (I turn them off at work for our data analysis machines, due to some absolutely crippling throughput issues they cause. Really need to check whether that server is already running on a 4.6+ kernel, with the supposedly improved THP behavior mentioned in this thread.)

2017-08-14

Linux Load Averages: Solving the Mystery
Why does Linux load average include processes that are blocked on swapping. (Never realized they did; thought it used the classical definition). You know it's good software archaeology when it's treating with something that's still relevant today, and the search bottoms out in MACRO-10 code.
Font-size: An Unexpectedly Complex CSS Property
> font-size is the worst.

Just how hard coan it be to determine which font size should be used for an element based on the CSS? Pretty damn hard, it turns out.

> To recap, we are now at four different notions of font size being inherited: ...
UnityScript’s long ride off into the sunset
Why and how to deprecate a programming language.
Why Github can't host the Linux Kernel Community
The thesis here is that the Linux kernel isn't a monorepo. Instead it's a monotree with multiple repositories. There are multiple repositories, e.g. the main one by Linus, subsystem specific ones, etc. Hence not a monorepo. But all of those repositories are rooted in the same tree, with changes flowing between the repos arbitrarily (so they're not polyrepos, which would generally need to be totally independent of each other). Hence the need for the new term.

Unsurprisingly, Github doesn't support this fairly unique workflow.
Papers I like (part 1)
Computer science paper recommendations from Fabien Giesen, with long summaries of exactly why these papers are particularly useful/interesting.
Structures-of-arrays rather than arrays-of-structures on the 6502
A HN comment from 2015 explaining why the 6502 instruction set encouraged a SOA layout over AOS.

◀ Earlier | Index | Later ▶