Floating-point Conversion Operations for Streaming SIMD Extensions 2

Each conversion intrinsic takes one data type and performs a conversion to a different type. Some conversions such as _mm_cvtpd_ps result in a loss of precision. The rounding mode used in such cases is determined by the value in the MXCSR register. The default rounding mode is round-to-nearest. Note that the rounding mode used by the C and C++ languages when performing a type conversion is to truncate. The _mm_cvttpd_epi32 and _mm_cvttsd_si32 intrinsics use the truncate rounding mode regardless of the mode specified by the MXCSR register.

The conversion-operation intrinsics for Streaming SIMD Extensions 2 (SSE2) are listed in the following table followed by detailed descriptions.

For detailed information about an intrinsic, click on that intrinsic name in the following table.

The results of each intrinsic operation are placed in registers. The information about what is placed in each register appears in the tables below, in the detailed explanation of each intrinsic. R, R0, R1, R2 and R3 represent the registers in which results are placed.

The prototypes for SSE2 intrinsics are in the emmintrin.h header file.

Intrinsic Name	Operation	Corresponding SSE2 Instruction
_mm_cvtpd_ps	Convert DP FP to SP FP	CVTPD2PS
_mm_cvtps_pd	Convert from SP FP to DP FP	CVTPS2PD
_mm_cvtepi32_pd	Convert lower integer values to DP FP	CVTDQ2PD
_mm_cvtpd_epi32	Convert DP FP values to integer values	CVTPD2DQ
_mm_cvtsd_si32	Convert lower DP FP value to integer value	CVTSD2SI
_mm_cvtsd_ss	Convert lower DP FP value to SP FP	CVTSD2SS
_mm_cvtsi32_sd	Convert signed integer value to DP FP	CVTSI2SD
_mm_cvtss_sd	Convert lower SP FP value to DP FP	CVTSS2SD
_mm_cvttpd_epi32	Convert DP FP values to signed integers	CVTTPD2DQ
_mm_cvttsd_si32	Convert lower DP FP to signed integer	CVTTSD2SI
_mm_cvtpd_pi32	Convert two DP FP values to signed integer values	CVTPD2PI
_mm_cvttpd_pi32	Convert two DP FP values to signed integer values using truncate	CVTTPD2PI
_mm_cvtpi32_pd	Convert two signed integer values to DP FP	CVTPI2PD
_mm_cvtsd_f64	Extract DP FP value from first vector element	None

__m128 _mm_cvtpd_ps(__m128d a)

Converts the two DP FP values of a to SP FP values.

R0	R1	R2	R3
(float) a0	(float) a1	0.0	0.0

__m128d _mm_cvtps_pd(__m128 a)

Converts the lower two SP FP values of a to DP FP values.

R0	R1
(double) a0	(double) a1

__m128d _mm_cvtepi32_pd(__m128i a)

Converts the lower two signed 32-bit integer values of a to DP FP values.

R0	R1
(double) a0	(double) a1

__m128i _mm_cvtpd_epi32(__m128d a)

Converts the two DP FP values of a to 32-bit signed integer values.

R0	R1	R2	R3
(int) a0	(int) a1	0x0	0x0

int _mm_cvtsd_si32(__m128d a)

Converts the lower DP FP value of a to a 32-bit signed integer value.

R
(int) a0

__m128 _mm_cvtsd_ss(__m128 a, __m128d b)

Converts the lower DP FP value of b to an SP FP value. The upper SP FP values in a are passed through.

R0	R1	R2	R3
(float) b0	a1	a2	a3

__m128d _mm_cvtsi32_sd(__m128d a, int b)

Converts the signed integer value in b to a DP FP value. The upper DP FP value in a is passed through.

R0	R1
(double) b	a1

__m128d _mm_cvtss_sd(__m128d a, __m128 b)

Converts the lower SP FP value of b to a DP FP value. The upper value DP FP value in a is passed through.

R0	R1
(double) b0	a1

__m128i _mm_cvttpd_epi32(__m128d a)

Converts the two DP FP values of a to 32-bit signed integers using truncate.

R0	R1	R2	R3
(int) a0	(int) a1	0x0	0x0

int _mm_cvttsd_si32(__m128d a)

Converts the lower DP FP value of a to a 32-bit signed integer using truncate.

R
(int) a0

__m64 _mm_cvtpd_pi32(__m128d a)

Converts the two DP FP values of a to 32-bit signed integer values.

R0	R1
(int)a0	(int) a1

__m64 _mm_cvttpd_pi32(__m128d a)

Converts the two DP FP values of a to 32-bit signed integer values using truncate.

R0	R1
(int)a0	(int) a1

__m128d _mm_cvtpi32_pd(__m64 a)

Converts the two 32-bit signed integer values of a to DP FP values.

R0	R1
(double)a0	(double)a1

_mm_cvtsd_f64(__m128d a)

This intrinsic extracts a double precision floating point value from the first vector element of an __m128d. It does so in the most efficient manner possible in the context used. This intrinsic does not map to any specific SSE2 instruction.