Skip to content
  • Andre Przywara's avatar
    target-i386: add SSE4a instruction support · d9f4bb27
    Andre Przywara authored
    
    
    This adds support for the AMD Phenom/Barcelona's SSE4a instructions.
    Those include insertq and extrq, which are doing shift and mask on
    XMM registers, in two versions (immediate shift/length values and
    stored in another XMM register).
    Additionally it implements movntss, movntsd, which are scalar
    non-temporal stores (avoiding cache trashing). These are implemented
    as normal stores, though.
    SSE4a is guarded by the SSE4A CPUID bit (Fn8000_0001:ECX[6]).
    
    Signed-off-by: default avatarAndre Przywara <andre.przywara@amd.com>
    Signed-off-by: default avatarAurelien Jarno <aurelien@aurel32.net>
    d9f4bb27