Intel tweaks SSE 4 to speed text processing
String theory
18th March 2008 10:52 GMT
Intel's 45nm 'Nehalem' processor design will incorporate the second generation of the chip maker's SSE 4 technology. For now, the company is calling the post-'Penryn' Streaming SIMD Extensions instruction set SSE 4.2.
Nehalem's implementation of SSE 4 essentially matches that of Penryn. The key additions centre on the Application-Targeted Accelerators (ATAs) Intel introduced as part of SSE 4. Penryn got two of these, Nehalem will get seven more.
Nehalem's ATAs centre on text and string processing, Intel said yesterday. The need to accelerate text handling may sound rather unnecessary in this era of pervasive multimedia and intensive 3D graphics apps, but Intel claims the ATAs will benefit a range of important tasks, from virus signature scanning to parsing XML files.
Its pitch is that these are everyday routines, and the faster Nehalem can run them not only the quicker the tasks will be completed but the sooner it can close down on-die components to conserve energy.

Intel's 'Nehalem': better at text processing than Penryn
Not that Nehalem's design ignores more advanced data types. Chips based on the design will also speed access to data that doesn't sit comfortably in alignment with Nehalem's cache structure, such as multimeda code and data. That should allow the CPU to process such information more quickly, sending out the frame to be rendered then powering down for a longer period - or working on other tasks - before it needs to pick up and process the next frame, for example.
Again, the emphasis is not on raw processing - we know modern CPUs can do video smoothly - but on getting the job finished more quickly, the better to improve power efficiency.
That's the logic behind the re-introduction of HyperThreading with Nehalem, which will also be able to handle 33 per cent more micro-ops - the Core-specific instructions the x86 instructions are decoded into when they're loaded into the CPU - at any given time than the Penryn architecture can.


Intel Core i7 I7-920 Quad Core Processor (2.66GHz, 4x256kB, 4.8GT/s QPI, LGA 1336 Socket B)
AMD Phenom II X4 965 Black Edition Quad Core Processor (3.4GHz, 6MB L3 Cache, 4x512KB L2 Cache, 2000 MHz Bus, Socket AM3)
Intel Core i5 750 Qaud Core Processor (2.66GHz, 8MB L3 Cache, 2.5 GT/s Bus, Socket H LGA1156)
Asus P7P55D Motherboard (Intel Socket H LGA1156, P55 Express, ATX, 16GB DDR3)
Asus M4A785TD-V EVO AMD 785G/SB710 Socket AM3 ATX Motherboard