18c2ecf20Sopenharmony_ci| 28c2ecf20Sopenharmony_ci| ssin.sa 3.3 7/29/91 38c2ecf20Sopenharmony_ci| 48c2ecf20Sopenharmony_ci| The entry point sSIN computes the sine of an input argument 58c2ecf20Sopenharmony_ci| sCOS computes the cosine, and sSINCOS computes both. The 68c2ecf20Sopenharmony_ci| corresponding entry points with a "d" computes the same 78c2ecf20Sopenharmony_ci| corresponding function values for denormalized inputs. 88c2ecf20Sopenharmony_ci| 98c2ecf20Sopenharmony_ci| Input: Double-extended number X in location pointed to 108c2ecf20Sopenharmony_ci| by address register a0. 118c2ecf20Sopenharmony_ci| 128c2ecf20Sopenharmony_ci| Output: The function value sin(X) or cos(X) returned in Fp0 if SIN or 138c2ecf20Sopenharmony_ci| COS is requested. Otherwise, for SINCOS, sin(X) is returned 148c2ecf20Sopenharmony_ci| in Fp0, and cos(X) is returned in Fp1. 158c2ecf20Sopenharmony_ci| 168c2ecf20Sopenharmony_ci| Modifies: Fp0 for SIN or COS; both Fp0 and Fp1 for SINCOS. 178c2ecf20Sopenharmony_ci| 188c2ecf20Sopenharmony_ci| Accuracy and Monotonicity: The returned result is within 1 ulp in 198c2ecf20Sopenharmony_ci| 64 significant bit, i.e. within 0.5001 ulp to 53 bits if the 208c2ecf20Sopenharmony_ci| result is subsequently rounded to double precision. The 218c2ecf20Sopenharmony_ci| result is provably monotonic in double precision. 228c2ecf20Sopenharmony_ci| 238c2ecf20Sopenharmony_ci| Speed: The programs sSIN and sCOS take approximately 150 cycles for 248c2ecf20Sopenharmony_ci| input argument X such that |X| < 15Pi, which is the usual 258c2ecf20Sopenharmony_ci| situation. The speed for sSINCOS is approximately 190 cycles. 268c2ecf20Sopenharmony_ci| 278c2ecf20Sopenharmony_ci| Algorithm: 288c2ecf20Sopenharmony_ci| 298c2ecf20Sopenharmony_ci| SIN and COS: 308c2ecf20Sopenharmony_ci| 1. If SIN is invoked, set AdjN := 0; otherwise, set AdjN := 1. 318c2ecf20Sopenharmony_ci| 328c2ecf20Sopenharmony_ci| 2. If |X| >= 15Pi or |X| < 2**(-40), go to 7. 338c2ecf20Sopenharmony_ci| 348c2ecf20Sopenharmony_ci| 3. Decompose X as X = N(Pi/2) + r where |r| <= Pi/4. Let 358c2ecf20Sopenharmony_ci| k = N mod 4, so in particular, k = 0,1,2,or 3. Overwrite 368c2ecf20Sopenharmony_ci| k by k := k + AdjN. 378c2ecf20Sopenharmony_ci| 388c2ecf20Sopenharmony_ci| 4. If k is even, go to 6. 398c2ecf20Sopenharmony_ci| 408c2ecf20Sopenharmony_ci| 5. (k is odd) Set j := (k-1)/2, sgn := (-1)**j. Return sgn*cos(r) 418c2ecf20Sopenharmony_ci| where cos(r) is approximated by an even polynomial in r, 428c2ecf20Sopenharmony_ci| 1 + r*r*(B1+s*(B2+ ... + s*B8)), s = r*r. 438c2ecf20Sopenharmony_ci| Exit. 448c2ecf20Sopenharmony_ci| 458c2ecf20Sopenharmony_ci| 6. (k is even) Set j := k/2, sgn := (-1)**j. Return sgn*sin(r) 468c2ecf20Sopenharmony_ci| where sin(r) is approximated by an odd polynomial in r 478c2ecf20Sopenharmony_ci| r + r*s*(A1+s*(A2+ ... + s*A7)), s = r*r. 488c2ecf20Sopenharmony_ci| Exit. 498c2ecf20Sopenharmony_ci| 508c2ecf20Sopenharmony_ci| 7. If |X| > 1, go to 9. 518c2ecf20Sopenharmony_ci| 528c2ecf20Sopenharmony_ci| 8. (|X|<2**(-40)) If SIN is invoked, return X; otherwise return 1. 538c2ecf20Sopenharmony_ci| 548c2ecf20Sopenharmony_ci| 9. Overwrite X by X := X rem 2Pi. Now that |X| <= Pi, go back to 3. 558c2ecf20Sopenharmony_ci| 568c2ecf20Sopenharmony_ci| SINCOS: 578c2ecf20Sopenharmony_ci| 1. If |X| >= 15Pi or |X| < 2**(-40), go to 6. 588c2ecf20Sopenharmony_ci| 598c2ecf20Sopenharmony_ci| 2. Decompose X as X = N(Pi/2) + r where |r| <= Pi/4. Let 608c2ecf20Sopenharmony_ci| k = N mod 4, so in particular, k = 0,1,2,or 3. 618c2ecf20Sopenharmony_ci| 628c2ecf20Sopenharmony_ci| 3. If k is even, go to 5. 638c2ecf20Sopenharmony_ci| 648c2ecf20Sopenharmony_ci| 4. (k is odd) Set j1 := (k-1)/2, j2 := j1 (EOR) (k mod 2), i.e. 658c2ecf20Sopenharmony_ci| j1 exclusive or with the l.s.b. of k. 668c2ecf20Sopenharmony_ci| sgn1 := (-1)**j1, sgn2 := (-1)**j2. 678c2ecf20Sopenharmony_ci| SIN(X) = sgn1 * cos(r) and COS(X) = sgn2*sin(r) where 688c2ecf20Sopenharmony_ci| sin(r) and cos(r) are computed as odd and even polynomials 698c2ecf20Sopenharmony_ci| in r, respectively. Exit 708c2ecf20Sopenharmony_ci| 718c2ecf20Sopenharmony_ci| 5. (k is even) Set j1 := k/2, sgn1 := (-1)**j1. 728c2ecf20Sopenharmony_ci| SIN(X) = sgn1 * sin(r) and COS(X) = sgn1*cos(r) where 738c2ecf20Sopenharmony_ci| sin(r) and cos(r) are computed as odd and even polynomials 748c2ecf20Sopenharmony_ci| in r, respectively. Exit 758c2ecf20Sopenharmony_ci| 768c2ecf20Sopenharmony_ci| 6. If |X| > 1, go to 8. 778c2ecf20Sopenharmony_ci| 788c2ecf20Sopenharmony_ci| 7. (|X|<2**(-40)) SIN(X) = X and COS(X) = 1. Exit. 798c2ecf20Sopenharmony_ci| 808c2ecf20Sopenharmony_ci| 8. Overwrite X by X := X rem 2Pi. Now that |X| <= Pi, go back to 2. 818c2ecf20Sopenharmony_ci| 828c2ecf20Sopenharmony_ci 838c2ecf20Sopenharmony_ci| Copyright (C) Motorola, Inc. 1990 848c2ecf20Sopenharmony_ci| All Rights Reserved 858c2ecf20Sopenharmony_ci| 868c2ecf20Sopenharmony_ci| For details on the license for this file, please see the 878c2ecf20Sopenharmony_ci| file, README, in this same directory. 888c2ecf20Sopenharmony_ci 898c2ecf20Sopenharmony_ci|SSIN idnt 2,1 | Motorola 040 Floating Point Software Package 908c2ecf20Sopenharmony_ci 918c2ecf20Sopenharmony_ci |section 8 928c2ecf20Sopenharmony_ci 938c2ecf20Sopenharmony_ci#include "fpsp.h" 948c2ecf20Sopenharmony_ci 958c2ecf20Sopenharmony_ciBOUNDS1: .long 0x3FD78000,0x4004BC7E 968c2ecf20Sopenharmony_ciTWOBYPI: .long 0x3FE45F30,0x6DC9C883 978c2ecf20Sopenharmony_ci 988c2ecf20Sopenharmony_ciSINA7: .long 0xBD6AAA77,0xCCC994F5 998c2ecf20Sopenharmony_ciSINA6: .long 0x3DE61209,0x7AAE8DA1 1008c2ecf20Sopenharmony_ci 1018c2ecf20Sopenharmony_ciSINA5: .long 0xBE5AE645,0x2A118AE4 1028c2ecf20Sopenharmony_ciSINA4: .long 0x3EC71DE3,0xA5341531 1038c2ecf20Sopenharmony_ci 1048c2ecf20Sopenharmony_ciSINA3: .long 0xBF2A01A0,0x1A018B59,0x00000000,0x00000000 1058c2ecf20Sopenharmony_ci 1068c2ecf20Sopenharmony_ciSINA2: .long 0x3FF80000,0x88888888,0x888859AF,0x00000000 1078c2ecf20Sopenharmony_ci 1088c2ecf20Sopenharmony_ciSINA1: .long 0xBFFC0000,0xAAAAAAAA,0xAAAAAA99,0x00000000 1098c2ecf20Sopenharmony_ci 1108c2ecf20Sopenharmony_ciCOSB8: .long 0x3D2AC4D0,0xD6011EE3 1118c2ecf20Sopenharmony_ciCOSB7: .long 0xBDA9396F,0x9F45AC19 1128c2ecf20Sopenharmony_ci 1138c2ecf20Sopenharmony_ciCOSB6: .long 0x3E21EED9,0x0612C972 1148c2ecf20Sopenharmony_ciCOSB5: .long 0xBE927E4F,0xB79D9FCF 1158c2ecf20Sopenharmony_ci 1168c2ecf20Sopenharmony_ciCOSB4: .long 0x3EFA01A0,0x1A01D423,0x00000000,0x00000000 1178c2ecf20Sopenharmony_ci 1188c2ecf20Sopenharmony_ciCOSB3: .long 0xBFF50000,0xB60B60B6,0x0B61D438,0x00000000 1198c2ecf20Sopenharmony_ci 1208c2ecf20Sopenharmony_ciCOSB2: .long 0x3FFA0000,0xAAAAAAAA,0xAAAAAB5E 1218c2ecf20Sopenharmony_ciCOSB1: .long 0xBF000000 1228c2ecf20Sopenharmony_ci 1238c2ecf20Sopenharmony_ciINVTWOPI: .long 0x3FFC0000,0xA2F9836E,0x4E44152A 1248c2ecf20Sopenharmony_ci 1258c2ecf20Sopenharmony_ciTWOPI1: .long 0x40010000,0xC90FDAA2,0x00000000,0x00000000 1268c2ecf20Sopenharmony_ciTWOPI2: .long 0x3FDF0000,0x85A308D4,0x00000000,0x00000000 1278c2ecf20Sopenharmony_ci 1288c2ecf20Sopenharmony_ci |xref PITBL 1298c2ecf20Sopenharmony_ci 1308c2ecf20Sopenharmony_ci .set INARG,FP_SCR4 1318c2ecf20Sopenharmony_ci 1328c2ecf20Sopenharmony_ci .set X,FP_SCR5 1338c2ecf20Sopenharmony_ci .set XDCARE,X+2 1348c2ecf20Sopenharmony_ci .set XFRAC,X+4 1358c2ecf20Sopenharmony_ci 1368c2ecf20Sopenharmony_ci .set RPRIME,FP_SCR1 1378c2ecf20Sopenharmony_ci .set SPRIME,FP_SCR2 1388c2ecf20Sopenharmony_ci 1398c2ecf20Sopenharmony_ci .set POSNEG1,L_SCR1 1408c2ecf20Sopenharmony_ci .set TWOTO63,L_SCR1 1418c2ecf20Sopenharmony_ci 1428c2ecf20Sopenharmony_ci .set ENDFLAG,L_SCR2 1438c2ecf20Sopenharmony_ci .set N,L_SCR2 1448c2ecf20Sopenharmony_ci 1458c2ecf20Sopenharmony_ci .set ADJN,L_SCR3 1468c2ecf20Sopenharmony_ci 1478c2ecf20Sopenharmony_ci | xref t_frcinx 1488c2ecf20Sopenharmony_ci |xref t_extdnrm 1498c2ecf20Sopenharmony_ci |xref sto_cos 1508c2ecf20Sopenharmony_ci 1518c2ecf20Sopenharmony_ci .global ssind 1528c2ecf20Sopenharmony_cissind: 1538c2ecf20Sopenharmony_ci|--SIN(X) = X FOR DENORMALIZED X 1548c2ecf20Sopenharmony_ci bra t_extdnrm 1558c2ecf20Sopenharmony_ci 1568c2ecf20Sopenharmony_ci .global scosd 1578c2ecf20Sopenharmony_ciscosd: 1588c2ecf20Sopenharmony_ci|--COS(X) = 1 FOR DENORMALIZED X 1598c2ecf20Sopenharmony_ci 1608c2ecf20Sopenharmony_ci fmoves #0x3F800000,%fp0 1618c2ecf20Sopenharmony_ci| 1628c2ecf20Sopenharmony_ci| 9D25B Fix: Sometimes the previous fmove.s sets fpsr bits 1638c2ecf20Sopenharmony_ci| 1648c2ecf20Sopenharmony_ci fmovel #0,%fpsr 1658c2ecf20Sopenharmony_ci| 1668c2ecf20Sopenharmony_ci bra t_frcinx 1678c2ecf20Sopenharmony_ci 1688c2ecf20Sopenharmony_ci .global ssin 1698c2ecf20Sopenharmony_cissin: 1708c2ecf20Sopenharmony_ci|--SET ADJN TO 0 1718c2ecf20Sopenharmony_ci movel #0,ADJN(%a6) 1728c2ecf20Sopenharmony_ci bras SINBGN 1738c2ecf20Sopenharmony_ci 1748c2ecf20Sopenharmony_ci .global scos 1758c2ecf20Sopenharmony_ciscos: 1768c2ecf20Sopenharmony_ci|--SET ADJN TO 1 1778c2ecf20Sopenharmony_ci movel #1,ADJN(%a6) 1788c2ecf20Sopenharmony_ci 1798c2ecf20Sopenharmony_ciSINBGN: 1808c2ecf20Sopenharmony_ci|--SAVE FPCR, FP1. CHECK IF |X| IS TOO SMALL OR LARGE 1818c2ecf20Sopenharmony_ci 1828c2ecf20Sopenharmony_ci fmovex (%a0),%fp0 | ...LOAD INPUT 1838c2ecf20Sopenharmony_ci 1848c2ecf20Sopenharmony_ci movel (%a0),%d0 1858c2ecf20Sopenharmony_ci movew 4(%a0),%d0 1868c2ecf20Sopenharmony_ci fmovex %fp0,X(%a6) 1878c2ecf20Sopenharmony_ci andil #0x7FFFFFFF,%d0 | ...COMPACTIFY X 1888c2ecf20Sopenharmony_ci 1898c2ecf20Sopenharmony_ci cmpil #0x3FD78000,%d0 | ...|X| >= 2**(-40)? 1908c2ecf20Sopenharmony_ci bges SOK1 1918c2ecf20Sopenharmony_ci bra SINSM 1928c2ecf20Sopenharmony_ci 1938c2ecf20Sopenharmony_ciSOK1: 1948c2ecf20Sopenharmony_ci cmpil #0x4004BC7E,%d0 | ...|X| < 15 PI? 1958c2ecf20Sopenharmony_ci blts SINMAIN 1968c2ecf20Sopenharmony_ci bra REDUCEX 1978c2ecf20Sopenharmony_ci 1988c2ecf20Sopenharmony_ciSINMAIN: 1998c2ecf20Sopenharmony_ci|--THIS IS THE USUAL CASE, |X| <= 15 PI. 2008c2ecf20Sopenharmony_ci|--THE ARGUMENT REDUCTION IS DONE BY TABLE LOOK UP. 2018c2ecf20Sopenharmony_ci fmovex %fp0,%fp1 2028c2ecf20Sopenharmony_ci fmuld TWOBYPI,%fp1 | ...X*2/PI 2038c2ecf20Sopenharmony_ci 2048c2ecf20Sopenharmony_ci|--HIDE THE NEXT THREE INSTRUCTIONS 2058c2ecf20Sopenharmony_ci lea PITBL+0x200,%a1 | ...TABLE OF N*PI/2, N = -32,...,32 2068c2ecf20Sopenharmony_ci 2078c2ecf20Sopenharmony_ci 2088c2ecf20Sopenharmony_ci|--FP1 IS NOW READY 2098c2ecf20Sopenharmony_ci fmovel %fp1,N(%a6) | ...CONVERT TO INTEGER 2108c2ecf20Sopenharmony_ci 2118c2ecf20Sopenharmony_ci movel N(%a6),%d0 2128c2ecf20Sopenharmony_ci asll #4,%d0 2138c2ecf20Sopenharmony_ci addal %d0,%a1 | ...A1 IS THE ADDRESS OF N*PIBY2 2148c2ecf20Sopenharmony_ci| ...WHICH IS IN TWO PIECES Y1 & Y2 2158c2ecf20Sopenharmony_ci 2168c2ecf20Sopenharmony_ci fsubx (%a1)+,%fp0 | ...X-Y1 2178c2ecf20Sopenharmony_ci|--HIDE THE NEXT ONE 2188c2ecf20Sopenharmony_ci fsubs (%a1),%fp0 | ...FP0 IS R = (X-Y1)-Y2 2198c2ecf20Sopenharmony_ci 2208c2ecf20Sopenharmony_ciSINCONT: 2218c2ecf20Sopenharmony_ci|--continuation from REDUCEX 2228c2ecf20Sopenharmony_ci 2238c2ecf20Sopenharmony_ci|--GET N+ADJN AND SEE IF SIN(R) OR COS(R) IS NEEDED 2248c2ecf20Sopenharmony_ci movel N(%a6),%d0 2258c2ecf20Sopenharmony_ci addl ADJN(%a6),%d0 | ...SEE IF D0 IS ODD OR EVEN 2268c2ecf20Sopenharmony_ci rorl #1,%d0 | ...D0 WAS ODD IFF D0 IS NEGATIVE 2278c2ecf20Sopenharmony_ci cmpil #0,%d0 2288c2ecf20Sopenharmony_ci blt COSPOLY 2298c2ecf20Sopenharmony_ci 2308c2ecf20Sopenharmony_ciSINPOLY: 2318c2ecf20Sopenharmony_ci|--LET J BE THE LEAST SIG. BIT OF D0, LET SGN := (-1)**J. 2328c2ecf20Sopenharmony_ci|--THEN WE RETURN SGN*SIN(R). SGN*SIN(R) IS COMPUTED BY 2338c2ecf20Sopenharmony_ci|--R' + R'*S*(A1 + S(A2 + S(A3 + S(A4 + ... + SA7)))), WHERE 2348c2ecf20Sopenharmony_ci|--R' = SGN*R, S=R*R. THIS CAN BE REWRITTEN AS 2358c2ecf20Sopenharmony_ci|--R' + R'*S*( [A1+T(A3+T(A5+TA7))] + [S(A2+T(A4+TA6))]) 2368c2ecf20Sopenharmony_ci|--WHERE T=S*S. 2378c2ecf20Sopenharmony_ci|--NOTE THAT A3 THROUGH A7 ARE STORED IN DOUBLE PRECISION 2388c2ecf20Sopenharmony_ci|--WHILE A1 AND A2 ARE IN DOUBLE-EXTENDED FORMAT. 2398c2ecf20Sopenharmony_ci fmovex %fp0,X(%a6) | ...X IS R 2408c2ecf20Sopenharmony_ci fmulx %fp0,%fp0 | ...FP0 IS S 2418c2ecf20Sopenharmony_ci|---HIDE THE NEXT TWO WHILE WAITING FOR FP0 2428c2ecf20Sopenharmony_ci fmoved SINA7,%fp3 2438c2ecf20Sopenharmony_ci fmoved SINA6,%fp2 2448c2ecf20Sopenharmony_ci|--FP0 IS NOW READY 2458c2ecf20Sopenharmony_ci fmovex %fp0,%fp1 2468c2ecf20Sopenharmony_ci fmulx %fp1,%fp1 | ...FP1 IS T 2478c2ecf20Sopenharmony_ci|--HIDE THE NEXT TWO WHILE WAITING FOR FP1 2488c2ecf20Sopenharmony_ci 2498c2ecf20Sopenharmony_ci rorl #1,%d0 2508c2ecf20Sopenharmony_ci andil #0x80000000,%d0 2518c2ecf20Sopenharmony_ci| ...LEAST SIG. BIT OF D0 IN SIGN POSITION 2528c2ecf20Sopenharmony_ci eorl %d0,X(%a6) | ...X IS NOW R'= SGN*R 2538c2ecf20Sopenharmony_ci 2548c2ecf20Sopenharmony_ci fmulx %fp1,%fp3 | ...TA7 2558c2ecf20Sopenharmony_ci fmulx %fp1,%fp2 | ...TA6 2568c2ecf20Sopenharmony_ci 2578c2ecf20Sopenharmony_ci faddd SINA5,%fp3 | ...A5+TA7 2588c2ecf20Sopenharmony_ci faddd SINA4,%fp2 | ...A4+TA6 2598c2ecf20Sopenharmony_ci 2608c2ecf20Sopenharmony_ci fmulx %fp1,%fp3 | ...T(A5+TA7) 2618c2ecf20Sopenharmony_ci fmulx %fp1,%fp2 | ...T(A4+TA6) 2628c2ecf20Sopenharmony_ci 2638c2ecf20Sopenharmony_ci faddd SINA3,%fp3 | ...A3+T(A5+TA7) 2648c2ecf20Sopenharmony_ci faddx SINA2,%fp2 | ...A2+T(A4+TA6) 2658c2ecf20Sopenharmony_ci 2668c2ecf20Sopenharmony_ci fmulx %fp3,%fp1 | ...T(A3+T(A5+TA7)) 2678c2ecf20Sopenharmony_ci 2688c2ecf20Sopenharmony_ci fmulx %fp0,%fp2 | ...S(A2+T(A4+TA6)) 2698c2ecf20Sopenharmony_ci faddx SINA1,%fp1 | ...A1+T(A3+T(A5+TA7)) 2708c2ecf20Sopenharmony_ci fmulx X(%a6),%fp0 | ...R'*S 2718c2ecf20Sopenharmony_ci 2728c2ecf20Sopenharmony_ci faddx %fp2,%fp1 | ...[A1+T(A3+T(A5+TA7))]+[S(A2+T(A4+TA6))] 2738c2ecf20Sopenharmony_ci|--FP3 RELEASED, RESTORE NOW AND TAKE SOME ADVANTAGE OF HIDING 2748c2ecf20Sopenharmony_ci|--FP2 RELEASED, RESTORE NOW AND TAKE FULL ADVANTAGE OF HIDING 2758c2ecf20Sopenharmony_ci 2768c2ecf20Sopenharmony_ci 2778c2ecf20Sopenharmony_ci fmulx %fp1,%fp0 | ...SIN(R')-R' 2788c2ecf20Sopenharmony_ci|--FP1 RELEASED. 2798c2ecf20Sopenharmony_ci 2808c2ecf20Sopenharmony_ci fmovel %d1,%FPCR |restore users exceptions 2818c2ecf20Sopenharmony_ci faddx X(%a6),%fp0 |last inst - possible exception set 2828c2ecf20Sopenharmony_ci bra t_frcinx 2838c2ecf20Sopenharmony_ci 2848c2ecf20Sopenharmony_ci 2858c2ecf20Sopenharmony_ciCOSPOLY: 2868c2ecf20Sopenharmony_ci|--LET J BE THE LEAST SIG. BIT OF D0, LET SGN := (-1)**J. 2878c2ecf20Sopenharmony_ci|--THEN WE RETURN SGN*COS(R). SGN*COS(R) IS COMPUTED BY 2888c2ecf20Sopenharmony_ci|--SGN + S'*(B1 + S(B2 + S(B3 + S(B4 + ... + SB8)))), WHERE 2898c2ecf20Sopenharmony_ci|--S=R*R AND S'=SGN*S. THIS CAN BE REWRITTEN AS 2908c2ecf20Sopenharmony_ci|--SGN + S'*([B1+T(B3+T(B5+TB7))] + [S(B2+T(B4+T(B6+TB8)))]) 2918c2ecf20Sopenharmony_ci|--WHERE T=S*S. 2928c2ecf20Sopenharmony_ci|--NOTE THAT B4 THROUGH B8 ARE STORED IN DOUBLE PRECISION 2938c2ecf20Sopenharmony_ci|--WHILE B2 AND B3 ARE IN DOUBLE-EXTENDED FORMAT, B1 IS -1/2 2948c2ecf20Sopenharmony_ci|--AND IS THEREFORE STORED AS SINGLE PRECISION. 2958c2ecf20Sopenharmony_ci 2968c2ecf20Sopenharmony_ci fmulx %fp0,%fp0 | ...FP0 IS S 2978c2ecf20Sopenharmony_ci|---HIDE THE NEXT TWO WHILE WAITING FOR FP0 2988c2ecf20Sopenharmony_ci fmoved COSB8,%fp2 2998c2ecf20Sopenharmony_ci fmoved COSB7,%fp3 3008c2ecf20Sopenharmony_ci|--FP0 IS NOW READY 3018c2ecf20Sopenharmony_ci fmovex %fp0,%fp1 3028c2ecf20Sopenharmony_ci fmulx %fp1,%fp1 | ...FP1 IS T 3038c2ecf20Sopenharmony_ci|--HIDE THE NEXT TWO WHILE WAITING FOR FP1 3048c2ecf20Sopenharmony_ci fmovex %fp0,X(%a6) | ...X IS S 3058c2ecf20Sopenharmony_ci rorl #1,%d0 3068c2ecf20Sopenharmony_ci andil #0x80000000,%d0 3078c2ecf20Sopenharmony_ci| ...LEAST SIG. BIT OF D0 IN SIGN POSITION 3088c2ecf20Sopenharmony_ci 3098c2ecf20Sopenharmony_ci fmulx %fp1,%fp2 | ...TB8 3108c2ecf20Sopenharmony_ci|--HIDE THE NEXT TWO WHILE WAITING FOR THE XU 3118c2ecf20Sopenharmony_ci eorl %d0,X(%a6) | ...X IS NOW S'= SGN*S 3128c2ecf20Sopenharmony_ci andil #0x80000000,%d0 3138c2ecf20Sopenharmony_ci 3148c2ecf20Sopenharmony_ci fmulx %fp1,%fp3 | ...TB7 3158c2ecf20Sopenharmony_ci|--HIDE THE NEXT TWO WHILE WAITING FOR THE XU 3168c2ecf20Sopenharmony_ci oril #0x3F800000,%d0 | ...D0 IS SGN IN SINGLE 3178c2ecf20Sopenharmony_ci movel %d0,POSNEG1(%a6) 3188c2ecf20Sopenharmony_ci 3198c2ecf20Sopenharmony_ci faddd COSB6,%fp2 | ...B6+TB8 3208c2ecf20Sopenharmony_ci faddd COSB5,%fp3 | ...B5+TB7 3218c2ecf20Sopenharmony_ci 3228c2ecf20Sopenharmony_ci fmulx %fp1,%fp2 | ...T(B6+TB8) 3238c2ecf20Sopenharmony_ci fmulx %fp1,%fp3 | ...T(B5+TB7) 3248c2ecf20Sopenharmony_ci 3258c2ecf20Sopenharmony_ci faddd COSB4,%fp2 | ...B4+T(B6+TB8) 3268c2ecf20Sopenharmony_ci faddx COSB3,%fp3 | ...B3+T(B5+TB7) 3278c2ecf20Sopenharmony_ci 3288c2ecf20Sopenharmony_ci fmulx %fp1,%fp2 | ...T(B4+T(B6+TB8)) 3298c2ecf20Sopenharmony_ci fmulx %fp3,%fp1 | ...T(B3+T(B5+TB7)) 3308c2ecf20Sopenharmony_ci 3318c2ecf20Sopenharmony_ci faddx COSB2,%fp2 | ...B2+T(B4+T(B6+TB8)) 3328c2ecf20Sopenharmony_ci fadds COSB1,%fp1 | ...B1+T(B3+T(B5+TB7)) 3338c2ecf20Sopenharmony_ci 3348c2ecf20Sopenharmony_ci fmulx %fp2,%fp0 | ...S(B2+T(B4+T(B6+TB8))) 3358c2ecf20Sopenharmony_ci|--FP3 RELEASED, RESTORE NOW AND TAKE SOME ADVANTAGE OF HIDING 3368c2ecf20Sopenharmony_ci|--FP2 RELEASED. 3378c2ecf20Sopenharmony_ci 3388c2ecf20Sopenharmony_ci 3398c2ecf20Sopenharmony_ci faddx %fp1,%fp0 3408c2ecf20Sopenharmony_ci|--FP1 RELEASED 3418c2ecf20Sopenharmony_ci 3428c2ecf20Sopenharmony_ci fmulx X(%a6),%fp0 3438c2ecf20Sopenharmony_ci 3448c2ecf20Sopenharmony_ci fmovel %d1,%FPCR |restore users exceptions 3458c2ecf20Sopenharmony_ci fadds POSNEG1(%a6),%fp0 |last inst - possible exception set 3468c2ecf20Sopenharmony_ci bra t_frcinx 3478c2ecf20Sopenharmony_ci 3488c2ecf20Sopenharmony_ci 3498c2ecf20Sopenharmony_ciSINBORS: 3508c2ecf20Sopenharmony_ci|--IF |X| > 15PI, WE USE THE GENERAL ARGUMENT REDUCTION. 3518c2ecf20Sopenharmony_ci|--IF |X| < 2**(-40), RETURN X OR 1. 3528c2ecf20Sopenharmony_ci cmpil #0x3FFF8000,%d0 3538c2ecf20Sopenharmony_ci bgts REDUCEX 3548c2ecf20Sopenharmony_ci 3558c2ecf20Sopenharmony_ci 3568c2ecf20Sopenharmony_ciSINSM: 3578c2ecf20Sopenharmony_ci movel ADJN(%a6),%d0 3588c2ecf20Sopenharmony_ci cmpil #0,%d0 3598c2ecf20Sopenharmony_ci bgts COSTINY 3608c2ecf20Sopenharmony_ci 3618c2ecf20Sopenharmony_ciSINTINY: 3628c2ecf20Sopenharmony_ci movew #0x0000,XDCARE(%a6) | ...JUST IN CASE 3638c2ecf20Sopenharmony_ci fmovel %d1,%FPCR |restore users exceptions 3648c2ecf20Sopenharmony_ci fmovex X(%a6),%fp0 |last inst - possible exception set 3658c2ecf20Sopenharmony_ci bra t_frcinx 3668c2ecf20Sopenharmony_ci 3678c2ecf20Sopenharmony_ci 3688c2ecf20Sopenharmony_ciCOSTINY: 3698c2ecf20Sopenharmony_ci fmoves #0x3F800000,%fp0 3708c2ecf20Sopenharmony_ci 3718c2ecf20Sopenharmony_ci fmovel %d1,%FPCR |restore users exceptions 3728c2ecf20Sopenharmony_ci fsubs #0x00800000,%fp0 |last inst - possible exception set 3738c2ecf20Sopenharmony_ci bra t_frcinx 3748c2ecf20Sopenharmony_ci 3758c2ecf20Sopenharmony_ci 3768c2ecf20Sopenharmony_ciREDUCEX: 3778c2ecf20Sopenharmony_ci|--WHEN REDUCEX IS USED, THE CODE WILL INEVITABLY BE SLOW. 3788c2ecf20Sopenharmony_ci|--THIS REDUCTION METHOD, HOWEVER, IS MUCH FASTER THAN USING 3798c2ecf20Sopenharmony_ci|--THE REMAINDER INSTRUCTION WHICH IS NOW IN SOFTWARE. 3808c2ecf20Sopenharmony_ci 3818c2ecf20Sopenharmony_ci fmovemx %fp2-%fp5,-(%a7) | ...save FP2 through FP5 3828c2ecf20Sopenharmony_ci movel %d2,-(%a7) 3838c2ecf20Sopenharmony_ci fmoves #0x00000000,%fp1 3848c2ecf20Sopenharmony_ci|--If compact form of abs(arg) in d0=$7ffeffff, argument is so large that 3858c2ecf20Sopenharmony_ci|--there is a danger of unwanted overflow in first LOOP iteration. In this 3868c2ecf20Sopenharmony_ci|--case, reduce argument by one remainder step to make subsequent reduction 3878c2ecf20Sopenharmony_ci|--safe. 3888c2ecf20Sopenharmony_ci cmpil #0x7ffeffff,%d0 |is argument dangerously large? 3898c2ecf20Sopenharmony_ci bnes LOOP 3908c2ecf20Sopenharmony_ci movel #0x7ffe0000,FP_SCR2(%a6) |yes 3918c2ecf20Sopenharmony_ci| ;create 2**16383*PI/2 3928c2ecf20Sopenharmony_ci movel #0xc90fdaa2,FP_SCR2+4(%a6) 3938c2ecf20Sopenharmony_ci clrl FP_SCR2+8(%a6) 3948c2ecf20Sopenharmony_ci ftstx %fp0 |test sign of argument 3958c2ecf20Sopenharmony_ci movel #0x7fdc0000,FP_SCR3(%a6) |create low half of 2**16383* 3968c2ecf20Sopenharmony_ci| ;PI/2 at FP_SCR3 3978c2ecf20Sopenharmony_ci movel #0x85a308d3,FP_SCR3+4(%a6) 3988c2ecf20Sopenharmony_ci clrl FP_SCR3+8(%a6) 3998c2ecf20Sopenharmony_ci fblt red_neg 4008c2ecf20Sopenharmony_ci orw #0x8000,FP_SCR2(%a6) |positive arg 4018c2ecf20Sopenharmony_ci orw #0x8000,FP_SCR3(%a6) 4028c2ecf20Sopenharmony_cired_neg: 4038c2ecf20Sopenharmony_ci faddx FP_SCR2(%a6),%fp0 |high part of reduction is exact 4048c2ecf20Sopenharmony_ci fmovex %fp0,%fp1 |save high result in fp1 4058c2ecf20Sopenharmony_ci faddx FP_SCR3(%a6),%fp0 |low part of reduction 4068c2ecf20Sopenharmony_ci fsubx %fp0,%fp1 |determine low component of result 4078c2ecf20Sopenharmony_ci faddx FP_SCR3(%a6),%fp1 |fp0/fp1 are reduced argument. 4088c2ecf20Sopenharmony_ci 4098c2ecf20Sopenharmony_ci|--ON ENTRY, FP0 IS X, ON RETURN, FP0 IS X REM PI/2, |X| <= PI/4. 4108c2ecf20Sopenharmony_ci|--integer quotient will be stored in N 4118c2ecf20Sopenharmony_ci|--Intermediate remainder is 66-bit long; (R,r) in (FP0,FP1) 4128c2ecf20Sopenharmony_ci 4138c2ecf20Sopenharmony_ciLOOP: 4148c2ecf20Sopenharmony_ci fmovex %fp0,INARG(%a6) | ...+-2**K * F, 1 <= F < 2 4158c2ecf20Sopenharmony_ci movew INARG(%a6),%d0 4168c2ecf20Sopenharmony_ci movel %d0,%a1 | ...save a copy of D0 4178c2ecf20Sopenharmony_ci andil #0x00007FFF,%d0 4188c2ecf20Sopenharmony_ci subil #0x00003FFF,%d0 | ...D0 IS K 4198c2ecf20Sopenharmony_ci cmpil #28,%d0 4208c2ecf20Sopenharmony_ci bles LASTLOOP 4218c2ecf20Sopenharmony_ciCONTLOOP: 4228c2ecf20Sopenharmony_ci subil #27,%d0 | ...D0 IS L := K-27 4238c2ecf20Sopenharmony_ci movel #0,ENDFLAG(%a6) 4248c2ecf20Sopenharmony_ci bras WORK 4258c2ecf20Sopenharmony_ciLASTLOOP: 4268c2ecf20Sopenharmony_ci clrl %d0 | ...D0 IS L := 0 4278c2ecf20Sopenharmony_ci movel #1,ENDFLAG(%a6) 4288c2ecf20Sopenharmony_ci 4298c2ecf20Sopenharmony_ciWORK: 4308c2ecf20Sopenharmony_ci|--FIND THE REMAINDER OF (R,r) W.R.T. 2**L * (PI/2). L IS SO CHOSEN 4318c2ecf20Sopenharmony_ci|--THAT INT( X * (2/PI) / 2**(L) ) < 2**29. 4328c2ecf20Sopenharmony_ci 4338c2ecf20Sopenharmony_ci|--CREATE 2**(-L) * (2/PI), SIGN(INARG)*2**(63), 4348c2ecf20Sopenharmony_ci|--2**L * (PIby2_1), 2**L * (PIby2_2) 4358c2ecf20Sopenharmony_ci 4368c2ecf20Sopenharmony_ci movel #0x00003FFE,%d2 | ...BIASED EXPO OF 2/PI 4378c2ecf20Sopenharmony_ci subl %d0,%d2 | ...BIASED EXPO OF 2**(-L)*(2/PI) 4388c2ecf20Sopenharmony_ci 4398c2ecf20Sopenharmony_ci movel #0xA2F9836E,FP_SCR1+4(%a6) 4408c2ecf20Sopenharmony_ci movel #0x4E44152A,FP_SCR1+8(%a6) 4418c2ecf20Sopenharmony_ci movew %d2,FP_SCR1(%a6) | ...FP_SCR1 is 2**(-L)*(2/PI) 4428c2ecf20Sopenharmony_ci 4438c2ecf20Sopenharmony_ci fmovex %fp0,%fp2 4448c2ecf20Sopenharmony_ci fmulx FP_SCR1(%a6),%fp2 4458c2ecf20Sopenharmony_ci|--WE MUST NOW FIND INT(FP2). SINCE WE NEED THIS VALUE IN 4468c2ecf20Sopenharmony_ci|--FLOATING POINT FORMAT, THE TWO FMOVE'S FMOVE.L FP <--> N 4478c2ecf20Sopenharmony_ci|--WILL BE TOO INEFFICIENT. THE WAY AROUND IT IS THAT 4488c2ecf20Sopenharmony_ci|--(SIGN(INARG)*2**63 + FP2) - SIGN(INARG)*2**63 WILL GIVE 4498c2ecf20Sopenharmony_ci|--US THE DESIRED VALUE IN FLOATING POINT. 4508c2ecf20Sopenharmony_ci 4518c2ecf20Sopenharmony_ci|--HIDE SIX CYCLES OF INSTRUCTION 4528c2ecf20Sopenharmony_ci movel %a1,%d2 4538c2ecf20Sopenharmony_ci swap %d2 4548c2ecf20Sopenharmony_ci andil #0x80000000,%d2 4558c2ecf20Sopenharmony_ci oril #0x5F000000,%d2 | ...D2 IS SIGN(INARG)*2**63 IN SGL 4568c2ecf20Sopenharmony_ci movel %d2,TWOTO63(%a6) 4578c2ecf20Sopenharmony_ci 4588c2ecf20Sopenharmony_ci movel %d0,%d2 4598c2ecf20Sopenharmony_ci addil #0x00003FFF,%d2 | ...BIASED EXPO OF 2**L * (PI/2) 4608c2ecf20Sopenharmony_ci 4618c2ecf20Sopenharmony_ci|--FP2 IS READY 4628c2ecf20Sopenharmony_ci fadds TWOTO63(%a6),%fp2 | ...THE FRACTIONAL PART OF FP1 IS ROUNDED 4638c2ecf20Sopenharmony_ci 4648c2ecf20Sopenharmony_ci|--HIDE 4 CYCLES OF INSTRUCTION; creating 2**(L)*Piby2_1 and 2**(L)*Piby2_2 4658c2ecf20Sopenharmony_ci movew %d2,FP_SCR2(%a6) 4668c2ecf20Sopenharmony_ci clrw FP_SCR2+2(%a6) 4678c2ecf20Sopenharmony_ci movel #0xC90FDAA2,FP_SCR2+4(%a6) 4688c2ecf20Sopenharmony_ci clrl FP_SCR2+8(%a6) | ...FP_SCR2 is 2**(L) * Piby2_1 4698c2ecf20Sopenharmony_ci 4708c2ecf20Sopenharmony_ci|--FP2 IS READY 4718c2ecf20Sopenharmony_ci fsubs TWOTO63(%a6),%fp2 | ...FP2 is N 4728c2ecf20Sopenharmony_ci 4738c2ecf20Sopenharmony_ci addil #0x00003FDD,%d0 4748c2ecf20Sopenharmony_ci movew %d0,FP_SCR3(%a6) 4758c2ecf20Sopenharmony_ci clrw FP_SCR3+2(%a6) 4768c2ecf20Sopenharmony_ci movel #0x85A308D3,FP_SCR3+4(%a6) 4778c2ecf20Sopenharmony_ci clrl FP_SCR3+8(%a6) | ...FP_SCR3 is 2**(L) * Piby2_2 4788c2ecf20Sopenharmony_ci 4798c2ecf20Sopenharmony_ci movel ENDFLAG(%a6),%d0 4808c2ecf20Sopenharmony_ci 4818c2ecf20Sopenharmony_ci|--We are now ready to perform (R+r) - N*P1 - N*P2, P1 = 2**(L) * Piby2_1 and 4828c2ecf20Sopenharmony_ci|--P2 = 2**(L) * Piby2_2 4838c2ecf20Sopenharmony_ci fmovex %fp2,%fp4 4848c2ecf20Sopenharmony_ci fmulx FP_SCR2(%a6),%fp4 | ...W = N*P1 4858c2ecf20Sopenharmony_ci fmovex %fp2,%fp5 4868c2ecf20Sopenharmony_ci fmulx FP_SCR3(%a6),%fp5 | ...w = N*P2 4878c2ecf20Sopenharmony_ci fmovex %fp4,%fp3 4888c2ecf20Sopenharmony_ci|--we want P+p = W+w but |p| <= half ulp of P 4898c2ecf20Sopenharmony_ci|--Then, we need to compute A := R-P and a := r-p 4908c2ecf20Sopenharmony_ci faddx %fp5,%fp3 | ...FP3 is P 4918c2ecf20Sopenharmony_ci fsubx %fp3,%fp4 | ...W-P 4928c2ecf20Sopenharmony_ci 4938c2ecf20Sopenharmony_ci fsubx %fp3,%fp0 | ...FP0 is A := R - P 4948c2ecf20Sopenharmony_ci faddx %fp5,%fp4 | ...FP4 is p = (W-P)+w 4958c2ecf20Sopenharmony_ci 4968c2ecf20Sopenharmony_ci fmovex %fp0,%fp3 | ...FP3 A 4978c2ecf20Sopenharmony_ci fsubx %fp4,%fp1 | ...FP1 is a := r - p 4988c2ecf20Sopenharmony_ci 4998c2ecf20Sopenharmony_ci|--Now we need to normalize (A,a) to "new (R,r)" where R+r = A+a but 5008c2ecf20Sopenharmony_ci|--|r| <= half ulp of R. 5018c2ecf20Sopenharmony_ci faddx %fp1,%fp0 | ...FP0 is R := A+a 5028c2ecf20Sopenharmony_ci|--No need to calculate r if this is the last loop 5038c2ecf20Sopenharmony_ci cmpil #0,%d0 5048c2ecf20Sopenharmony_ci bgt RESTORE 5058c2ecf20Sopenharmony_ci 5068c2ecf20Sopenharmony_ci|--Need to calculate r 5078c2ecf20Sopenharmony_ci fsubx %fp0,%fp3 | ...A-R 5088c2ecf20Sopenharmony_ci faddx %fp3,%fp1 | ...FP1 is r := (A-R)+a 5098c2ecf20Sopenharmony_ci bra LOOP 5108c2ecf20Sopenharmony_ci 5118c2ecf20Sopenharmony_ciRESTORE: 5128c2ecf20Sopenharmony_ci fmovel %fp2,N(%a6) 5138c2ecf20Sopenharmony_ci movel (%a7)+,%d2 5148c2ecf20Sopenharmony_ci fmovemx (%a7)+,%fp2-%fp5 5158c2ecf20Sopenharmony_ci 5168c2ecf20Sopenharmony_ci 5178c2ecf20Sopenharmony_ci movel ADJN(%a6),%d0 5188c2ecf20Sopenharmony_ci cmpil #4,%d0 5198c2ecf20Sopenharmony_ci 5208c2ecf20Sopenharmony_ci blt SINCONT 5218c2ecf20Sopenharmony_ci bras SCCONT 5228c2ecf20Sopenharmony_ci 5238c2ecf20Sopenharmony_ci .global ssincosd 5248c2ecf20Sopenharmony_cissincosd: 5258c2ecf20Sopenharmony_ci|--SIN AND COS OF X FOR DENORMALIZED X 5268c2ecf20Sopenharmony_ci 5278c2ecf20Sopenharmony_ci fmoves #0x3F800000,%fp1 5288c2ecf20Sopenharmony_ci bsr sto_cos |store cosine result 5298c2ecf20Sopenharmony_ci bra t_extdnrm 5308c2ecf20Sopenharmony_ci 5318c2ecf20Sopenharmony_ci .global ssincos 5328c2ecf20Sopenharmony_cissincos: 5338c2ecf20Sopenharmony_ci|--SET ADJN TO 4 5348c2ecf20Sopenharmony_ci movel #4,ADJN(%a6) 5358c2ecf20Sopenharmony_ci 5368c2ecf20Sopenharmony_ci fmovex (%a0),%fp0 | ...LOAD INPUT 5378c2ecf20Sopenharmony_ci 5388c2ecf20Sopenharmony_ci movel (%a0),%d0 5398c2ecf20Sopenharmony_ci movew 4(%a0),%d0 5408c2ecf20Sopenharmony_ci fmovex %fp0,X(%a6) 5418c2ecf20Sopenharmony_ci andil #0x7FFFFFFF,%d0 | ...COMPACTIFY X 5428c2ecf20Sopenharmony_ci 5438c2ecf20Sopenharmony_ci cmpil #0x3FD78000,%d0 | ...|X| >= 2**(-40)? 5448c2ecf20Sopenharmony_ci bges SCOK1 5458c2ecf20Sopenharmony_ci bra SCSM 5468c2ecf20Sopenharmony_ci 5478c2ecf20Sopenharmony_ciSCOK1: 5488c2ecf20Sopenharmony_ci cmpil #0x4004BC7E,%d0 | ...|X| < 15 PI? 5498c2ecf20Sopenharmony_ci blts SCMAIN 5508c2ecf20Sopenharmony_ci bra REDUCEX 5518c2ecf20Sopenharmony_ci 5528c2ecf20Sopenharmony_ci 5538c2ecf20Sopenharmony_ciSCMAIN: 5548c2ecf20Sopenharmony_ci|--THIS IS THE USUAL CASE, |X| <= 15 PI. 5558c2ecf20Sopenharmony_ci|--THE ARGUMENT REDUCTION IS DONE BY TABLE LOOK UP. 5568c2ecf20Sopenharmony_ci fmovex %fp0,%fp1 5578c2ecf20Sopenharmony_ci fmuld TWOBYPI,%fp1 | ...X*2/PI 5588c2ecf20Sopenharmony_ci 5598c2ecf20Sopenharmony_ci|--HIDE THE NEXT THREE INSTRUCTIONS 5608c2ecf20Sopenharmony_ci lea PITBL+0x200,%a1 | ...TABLE OF N*PI/2, N = -32,...,32 5618c2ecf20Sopenharmony_ci 5628c2ecf20Sopenharmony_ci 5638c2ecf20Sopenharmony_ci|--FP1 IS NOW READY 5648c2ecf20Sopenharmony_ci fmovel %fp1,N(%a6) | ...CONVERT TO INTEGER 5658c2ecf20Sopenharmony_ci 5668c2ecf20Sopenharmony_ci movel N(%a6),%d0 5678c2ecf20Sopenharmony_ci asll #4,%d0 5688c2ecf20Sopenharmony_ci addal %d0,%a1 | ...ADDRESS OF N*PIBY2, IN Y1, Y2 5698c2ecf20Sopenharmony_ci 5708c2ecf20Sopenharmony_ci fsubx (%a1)+,%fp0 | ...X-Y1 5718c2ecf20Sopenharmony_ci fsubs (%a1),%fp0 | ...FP0 IS R = (X-Y1)-Y2 5728c2ecf20Sopenharmony_ci 5738c2ecf20Sopenharmony_ciSCCONT: 5748c2ecf20Sopenharmony_ci|--continuation point from REDUCEX 5758c2ecf20Sopenharmony_ci 5768c2ecf20Sopenharmony_ci|--HIDE THE NEXT TWO 5778c2ecf20Sopenharmony_ci movel N(%a6),%d0 5788c2ecf20Sopenharmony_ci rorl #1,%d0 5798c2ecf20Sopenharmony_ci 5808c2ecf20Sopenharmony_ci cmpil #0,%d0 | ...D0 < 0 IFF N IS ODD 5818c2ecf20Sopenharmony_ci bge NEVEN 5828c2ecf20Sopenharmony_ci 5838c2ecf20Sopenharmony_ciNODD: 5848c2ecf20Sopenharmony_ci|--REGISTERS SAVED SO FAR: D0, A0, FP2. 5858c2ecf20Sopenharmony_ci 5868c2ecf20Sopenharmony_ci fmovex %fp0,RPRIME(%a6) 5878c2ecf20Sopenharmony_ci fmulx %fp0,%fp0 | ...FP0 IS S = R*R 5888c2ecf20Sopenharmony_ci fmoved SINA7,%fp1 | ...A7 5898c2ecf20Sopenharmony_ci fmoved COSB8,%fp2 | ...B8 5908c2ecf20Sopenharmony_ci fmulx %fp0,%fp1 | ...SA7 5918c2ecf20Sopenharmony_ci movel %d2,-(%a7) 5928c2ecf20Sopenharmony_ci movel %d0,%d2 5938c2ecf20Sopenharmony_ci fmulx %fp0,%fp2 | ...SB8 5948c2ecf20Sopenharmony_ci rorl #1,%d2 5958c2ecf20Sopenharmony_ci andil #0x80000000,%d2 5968c2ecf20Sopenharmony_ci 5978c2ecf20Sopenharmony_ci faddd SINA6,%fp1 | ...A6+SA7 5988c2ecf20Sopenharmony_ci eorl %d0,%d2 5998c2ecf20Sopenharmony_ci andil #0x80000000,%d2 6008c2ecf20Sopenharmony_ci faddd COSB7,%fp2 | ...B7+SB8 6018c2ecf20Sopenharmony_ci 6028c2ecf20Sopenharmony_ci fmulx %fp0,%fp1 | ...S(A6+SA7) 6038c2ecf20Sopenharmony_ci eorl %d2,RPRIME(%a6) 6048c2ecf20Sopenharmony_ci movel (%a7)+,%d2 6058c2ecf20Sopenharmony_ci fmulx %fp0,%fp2 | ...S(B7+SB8) 6068c2ecf20Sopenharmony_ci rorl #1,%d0 6078c2ecf20Sopenharmony_ci andil #0x80000000,%d0 6088c2ecf20Sopenharmony_ci 6098c2ecf20Sopenharmony_ci faddd SINA5,%fp1 | ...A5+S(A6+SA7) 6108c2ecf20Sopenharmony_ci movel #0x3F800000,POSNEG1(%a6) 6118c2ecf20Sopenharmony_ci eorl %d0,POSNEG1(%a6) 6128c2ecf20Sopenharmony_ci faddd COSB6,%fp2 | ...B6+S(B7+SB8) 6138c2ecf20Sopenharmony_ci 6148c2ecf20Sopenharmony_ci fmulx %fp0,%fp1 | ...S(A5+S(A6+SA7)) 6158c2ecf20Sopenharmony_ci fmulx %fp0,%fp2 | ...S(B6+S(B7+SB8)) 6168c2ecf20Sopenharmony_ci fmovex %fp0,SPRIME(%a6) 6178c2ecf20Sopenharmony_ci 6188c2ecf20Sopenharmony_ci faddd SINA4,%fp1 | ...A4+S(A5+S(A6+SA7)) 6198c2ecf20Sopenharmony_ci eorl %d0,SPRIME(%a6) 6208c2ecf20Sopenharmony_ci faddd COSB5,%fp2 | ...B5+S(B6+S(B7+SB8)) 6218c2ecf20Sopenharmony_ci 6228c2ecf20Sopenharmony_ci fmulx %fp0,%fp1 | ...S(A4+...) 6238c2ecf20Sopenharmony_ci fmulx %fp0,%fp2 | ...S(B5+...) 6248c2ecf20Sopenharmony_ci 6258c2ecf20Sopenharmony_ci faddd SINA3,%fp1 | ...A3+S(A4+...) 6268c2ecf20Sopenharmony_ci faddd COSB4,%fp2 | ...B4+S(B5+...) 6278c2ecf20Sopenharmony_ci 6288c2ecf20Sopenharmony_ci fmulx %fp0,%fp1 | ...S(A3+...) 6298c2ecf20Sopenharmony_ci fmulx %fp0,%fp2 | ...S(B4+...) 6308c2ecf20Sopenharmony_ci 6318c2ecf20Sopenharmony_ci faddx SINA2,%fp1 | ...A2+S(A3+...) 6328c2ecf20Sopenharmony_ci faddx COSB3,%fp2 | ...B3+S(B4+...) 6338c2ecf20Sopenharmony_ci 6348c2ecf20Sopenharmony_ci fmulx %fp0,%fp1 | ...S(A2+...) 6358c2ecf20Sopenharmony_ci fmulx %fp0,%fp2 | ...S(B3+...) 6368c2ecf20Sopenharmony_ci 6378c2ecf20Sopenharmony_ci faddx SINA1,%fp1 | ...A1+S(A2+...) 6388c2ecf20Sopenharmony_ci faddx COSB2,%fp2 | ...B2+S(B3+...) 6398c2ecf20Sopenharmony_ci 6408c2ecf20Sopenharmony_ci fmulx %fp0,%fp1 | ...S(A1+...) 6418c2ecf20Sopenharmony_ci fmulx %fp2,%fp0 | ...S(B2+...) 6428c2ecf20Sopenharmony_ci 6438c2ecf20Sopenharmony_ci 6448c2ecf20Sopenharmony_ci 6458c2ecf20Sopenharmony_ci fmulx RPRIME(%a6),%fp1 | ...R'S(A1+...) 6468c2ecf20Sopenharmony_ci fadds COSB1,%fp0 | ...B1+S(B2...) 6478c2ecf20Sopenharmony_ci fmulx SPRIME(%a6),%fp0 | ...S'(B1+S(B2+...)) 6488c2ecf20Sopenharmony_ci 6498c2ecf20Sopenharmony_ci movel %d1,-(%sp) |restore users mode & precision 6508c2ecf20Sopenharmony_ci andil #0xff,%d1 |mask off all exceptions 6518c2ecf20Sopenharmony_ci fmovel %d1,%FPCR 6528c2ecf20Sopenharmony_ci faddx RPRIME(%a6),%fp1 | ...COS(X) 6538c2ecf20Sopenharmony_ci bsr sto_cos |store cosine result 6548c2ecf20Sopenharmony_ci fmovel (%sp)+,%FPCR |restore users exceptions 6558c2ecf20Sopenharmony_ci fadds POSNEG1(%a6),%fp0 | ...SIN(X) 6568c2ecf20Sopenharmony_ci 6578c2ecf20Sopenharmony_ci bra t_frcinx 6588c2ecf20Sopenharmony_ci 6598c2ecf20Sopenharmony_ci 6608c2ecf20Sopenharmony_ciNEVEN: 6618c2ecf20Sopenharmony_ci|--REGISTERS SAVED SO FAR: FP2. 6628c2ecf20Sopenharmony_ci 6638c2ecf20Sopenharmony_ci fmovex %fp0,RPRIME(%a6) 6648c2ecf20Sopenharmony_ci fmulx %fp0,%fp0 | ...FP0 IS S = R*R 6658c2ecf20Sopenharmony_ci fmoved COSB8,%fp1 | ...B8 6668c2ecf20Sopenharmony_ci fmoved SINA7,%fp2 | ...A7 6678c2ecf20Sopenharmony_ci fmulx %fp0,%fp1 | ...SB8 6688c2ecf20Sopenharmony_ci fmovex %fp0,SPRIME(%a6) 6698c2ecf20Sopenharmony_ci fmulx %fp0,%fp2 | ...SA7 6708c2ecf20Sopenharmony_ci rorl #1,%d0 6718c2ecf20Sopenharmony_ci andil #0x80000000,%d0 6728c2ecf20Sopenharmony_ci faddd COSB7,%fp1 | ...B7+SB8 6738c2ecf20Sopenharmony_ci faddd SINA6,%fp2 | ...A6+SA7 6748c2ecf20Sopenharmony_ci eorl %d0,RPRIME(%a6) 6758c2ecf20Sopenharmony_ci eorl %d0,SPRIME(%a6) 6768c2ecf20Sopenharmony_ci fmulx %fp0,%fp1 | ...S(B7+SB8) 6778c2ecf20Sopenharmony_ci oril #0x3F800000,%d0 6788c2ecf20Sopenharmony_ci movel %d0,POSNEG1(%a6) 6798c2ecf20Sopenharmony_ci fmulx %fp0,%fp2 | ...S(A6+SA7) 6808c2ecf20Sopenharmony_ci 6818c2ecf20Sopenharmony_ci faddd COSB6,%fp1 | ...B6+S(B7+SB8) 6828c2ecf20Sopenharmony_ci faddd SINA5,%fp2 | ...A5+S(A6+SA7) 6838c2ecf20Sopenharmony_ci 6848c2ecf20Sopenharmony_ci fmulx %fp0,%fp1 | ...S(B6+S(B7+SB8)) 6858c2ecf20Sopenharmony_ci fmulx %fp0,%fp2 | ...S(A5+S(A6+SA7)) 6868c2ecf20Sopenharmony_ci 6878c2ecf20Sopenharmony_ci faddd COSB5,%fp1 | ...B5+S(B6+S(B7+SB8)) 6888c2ecf20Sopenharmony_ci faddd SINA4,%fp2 | ...A4+S(A5+S(A6+SA7)) 6898c2ecf20Sopenharmony_ci 6908c2ecf20Sopenharmony_ci fmulx %fp0,%fp1 | ...S(B5+...) 6918c2ecf20Sopenharmony_ci fmulx %fp0,%fp2 | ...S(A4+...) 6928c2ecf20Sopenharmony_ci 6938c2ecf20Sopenharmony_ci faddd COSB4,%fp1 | ...B4+S(B5+...) 6948c2ecf20Sopenharmony_ci faddd SINA3,%fp2 | ...A3+S(A4+...) 6958c2ecf20Sopenharmony_ci 6968c2ecf20Sopenharmony_ci fmulx %fp0,%fp1 | ...S(B4+...) 6978c2ecf20Sopenharmony_ci fmulx %fp0,%fp2 | ...S(A3+...) 6988c2ecf20Sopenharmony_ci 6998c2ecf20Sopenharmony_ci faddx COSB3,%fp1 | ...B3+S(B4+...) 7008c2ecf20Sopenharmony_ci faddx SINA2,%fp2 | ...A2+S(A3+...) 7018c2ecf20Sopenharmony_ci 7028c2ecf20Sopenharmony_ci fmulx %fp0,%fp1 | ...S(B3+...) 7038c2ecf20Sopenharmony_ci fmulx %fp0,%fp2 | ...S(A2+...) 7048c2ecf20Sopenharmony_ci 7058c2ecf20Sopenharmony_ci faddx COSB2,%fp1 | ...B2+S(B3+...) 7068c2ecf20Sopenharmony_ci faddx SINA1,%fp2 | ...A1+S(A2+...) 7078c2ecf20Sopenharmony_ci 7088c2ecf20Sopenharmony_ci fmulx %fp0,%fp1 | ...S(B2+...) 7098c2ecf20Sopenharmony_ci fmulx %fp2,%fp0 | ...s(a1+...) 7108c2ecf20Sopenharmony_ci 7118c2ecf20Sopenharmony_ci 7128c2ecf20Sopenharmony_ci 7138c2ecf20Sopenharmony_ci fadds COSB1,%fp1 | ...B1+S(B2...) 7148c2ecf20Sopenharmony_ci fmulx RPRIME(%a6),%fp0 | ...R'S(A1+...) 7158c2ecf20Sopenharmony_ci fmulx SPRIME(%a6),%fp1 | ...S'(B1+S(B2+...)) 7168c2ecf20Sopenharmony_ci 7178c2ecf20Sopenharmony_ci movel %d1,-(%sp) |save users mode & precision 7188c2ecf20Sopenharmony_ci andil #0xff,%d1 |mask off all exceptions 7198c2ecf20Sopenharmony_ci fmovel %d1,%FPCR 7208c2ecf20Sopenharmony_ci fadds POSNEG1(%a6),%fp1 | ...COS(X) 7218c2ecf20Sopenharmony_ci bsr sto_cos |store cosine result 7228c2ecf20Sopenharmony_ci fmovel (%sp)+,%FPCR |restore users exceptions 7238c2ecf20Sopenharmony_ci faddx RPRIME(%a6),%fp0 | ...SIN(X) 7248c2ecf20Sopenharmony_ci 7258c2ecf20Sopenharmony_ci bra t_frcinx 7268c2ecf20Sopenharmony_ci 7278c2ecf20Sopenharmony_ciSCBORS: 7288c2ecf20Sopenharmony_ci cmpil #0x3FFF8000,%d0 7298c2ecf20Sopenharmony_ci bgt REDUCEX 7308c2ecf20Sopenharmony_ci 7318c2ecf20Sopenharmony_ci 7328c2ecf20Sopenharmony_ciSCSM: 7338c2ecf20Sopenharmony_ci movew #0x0000,XDCARE(%a6) 7348c2ecf20Sopenharmony_ci fmoves #0x3F800000,%fp1 7358c2ecf20Sopenharmony_ci 7368c2ecf20Sopenharmony_ci movel %d1,-(%sp) |save users mode & precision 7378c2ecf20Sopenharmony_ci andil #0xff,%d1 |mask off all exceptions 7388c2ecf20Sopenharmony_ci fmovel %d1,%FPCR 7398c2ecf20Sopenharmony_ci fsubs #0x00800000,%fp1 7408c2ecf20Sopenharmony_ci bsr sto_cos |store cosine result 7418c2ecf20Sopenharmony_ci fmovel (%sp)+,%FPCR |restore users exceptions 7428c2ecf20Sopenharmony_ci fmovex X(%a6),%fp0 7438c2ecf20Sopenharmony_ci bra t_frcinx 7448c2ecf20Sopenharmony_ci 7458c2ecf20Sopenharmony_ci |end 746