In a strange twist, I ran the test using an nVidia driver with the Intel video chip. Actually the libGL library from nVidia got symlinked by mistake. CPU was virtually untouched during the test but it took about twice as long as with the correct driver. Amazing that it worked at all really! I guess it bypassed the software rendering and somehow libGL made enough of the right calls to make things happen on the card.
Bookmarks