162306a36Sopenharmony_ci| 262306a36Sopenharmony_ci| ssin.sa 3.3 7/29/91 362306a36Sopenharmony_ci| 462306a36Sopenharmony_ci| The entry point sSIN computes the sine of an input argument 562306a36Sopenharmony_ci| sCOS computes the cosine, and sSINCOS computes both. The 662306a36Sopenharmony_ci| corresponding entry points with a "d" computes the same 762306a36Sopenharmony_ci| corresponding function values for denormalized inputs. 862306a36Sopenharmony_ci| 962306a36Sopenharmony_ci| Input: Double-extended number X in location pointed to 1062306a36Sopenharmony_ci| by address register a0. 1162306a36Sopenharmony_ci| 1262306a36Sopenharmony_ci| Output: The function value sin(X) or cos(X) returned in Fp0 if SIN or 1362306a36Sopenharmony_ci| COS is requested. Otherwise, for SINCOS, sin(X) is returned 1462306a36Sopenharmony_ci| in Fp0, and cos(X) is returned in Fp1. 1562306a36Sopenharmony_ci| 1662306a36Sopenharmony_ci| Modifies: Fp0 for SIN or COS; both Fp0 and Fp1 for SINCOS. 1762306a36Sopenharmony_ci| 1862306a36Sopenharmony_ci| Accuracy and Monotonicity: The returned result is within 1 ulp in 1962306a36Sopenharmony_ci| 64 significant bit, i.e. within 0.5001 ulp to 53 bits if the 2062306a36Sopenharmony_ci| result is subsequently rounded to double precision. The 2162306a36Sopenharmony_ci| result is provably monotonic in double precision. 2262306a36Sopenharmony_ci| 2362306a36Sopenharmony_ci| Speed: The programs sSIN and sCOS take approximately 150 cycles for 2462306a36Sopenharmony_ci| input argument X such that |X| < 15Pi, which is the usual 2562306a36Sopenharmony_ci| situation. The speed for sSINCOS is approximately 190 cycles. 2662306a36Sopenharmony_ci| 2762306a36Sopenharmony_ci| Algorithm: 2862306a36Sopenharmony_ci| 2962306a36Sopenharmony_ci| SIN and COS: 3062306a36Sopenharmony_ci| 1. If SIN is invoked, set AdjN := 0; otherwise, set AdjN := 1. 3162306a36Sopenharmony_ci| 3262306a36Sopenharmony_ci| 2. If |X| >= 15Pi or |X| < 2**(-40), go to 7. 3362306a36Sopenharmony_ci| 3462306a36Sopenharmony_ci| 3. Decompose X as X = N(Pi/2) + r where |r| <= Pi/4. Let 3562306a36Sopenharmony_ci| k = N mod 4, so in particular, k = 0,1,2,or 3. Overwrite 3662306a36Sopenharmony_ci| k by k := k + AdjN. 3762306a36Sopenharmony_ci| 3862306a36Sopenharmony_ci| 4. If k is even, go to 6. 3962306a36Sopenharmony_ci| 4062306a36Sopenharmony_ci| 5. (k is odd) Set j := (k-1)/2, sgn := (-1)**j. Return sgn*cos(r) 4162306a36Sopenharmony_ci| where cos(r) is approximated by an even polynomial in r, 4262306a36Sopenharmony_ci| 1 + r*r*(B1+s*(B2+ ... + s*B8)), s = r*r. 4362306a36Sopenharmony_ci| Exit. 4462306a36Sopenharmony_ci| 4562306a36Sopenharmony_ci| 6. (k is even) Set j := k/2, sgn := (-1)**j. Return sgn*sin(r) 4662306a36Sopenharmony_ci| where sin(r) is approximated by an odd polynomial in r 4762306a36Sopenharmony_ci| r + r*s*(A1+s*(A2+ ... + s*A7)), s = r*r. 4862306a36Sopenharmony_ci| Exit. 4962306a36Sopenharmony_ci| 5062306a36Sopenharmony_ci| 7. If |X| > 1, go to 9. 5162306a36Sopenharmony_ci| 5262306a36Sopenharmony_ci| 8. (|X|<2**(-40)) If SIN is invoked, return X; otherwise return 1. 5362306a36Sopenharmony_ci| 5462306a36Sopenharmony_ci| 9. Overwrite X by X := X rem 2Pi. Now that |X| <= Pi, go back to 3. 5562306a36Sopenharmony_ci| 5662306a36Sopenharmony_ci| SINCOS: 5762306a36Sopenharmony_ci| 1. If |X| >= 15Pi or |X| < 2**(-40), go to 6. 5862306a36Sopenharmony_ci| 5962306a36Sopenharmony_ci| 2. Decompose X as X = N(Pi/2) + r where |r| <= Pi/4. Let 6062306a36Sopenharmony_ci| k = N mod 4, so in particular, k = 0,1,2,or 3. 6162306a36Sopenharmony_ci| 6262306a36Sopenharmony_ci| 3. If k is even, go to 5. 6362306a36Sopenharmony_ci| 6462306a36Sopenharmony_ci| 4. (k is odd) Set j1 := (k-1)/2, j2 := j1 (EOR) (k mod 2), i.e. 6562306a36Sopenharmony_ci| j1 exclusive or with the l.s.b. of k. 6662306a36Sopenharmony_ci| sgn1 := (-1)**j1, sgn2 := (-1)**j2. 6762306a36Sopenharmony_ci| SIN(X) = sgn1 * cos(r) and COS(X) = sgn2*sin(r) where 6862306a36Sopenharmony_ci| sin(r) and cos(r) are computed as odd and even polynomials 6962306a36Sopenharmony_ci| in r, respectively. Exit 7062306a36Sopenharmony_ci| 7162306a36Sopenharmony_ci| 5. (k is even) Set j1 := k/2, sgn1 := (-1)**j1. 7262306a36Sopenharmony_ci| SIN(X) = sgn1 * sin(r) and COS(X) = sgn1*cos(r) where 7362306a36Sopenharmony_ci| sin(r) and cos(r) are computed as odd and even polynomials 7462306a36Sopenharmony_ci| in r, respectively. Exit 7562306a36Sopenharmony_ci| 7662306a36Sopenharmony_ci| 6. If |X| > 1, go to 8. 7762306a36Sopenharmony_ci| 7862306a36Sopenharmony_ci| 7. (|X|<2**(-40)) SIN(X) = X and COS(X) = 1. Exit. 7962306a36Sopenharmony_ci| 8062306a36Sopenharmony_ci| 8. Overwrite X by X := X rem 2Pi. Now that |X| <= Pi, go back to 2. 8162306a36Sopenharmony_ci| 8262306a36Sopenharmony_ci 8362306a36Sopenharmony_ci| Copyright (C) Motorola, Inc. 1990 8462306a36Sopenharmony_ci| All Rights Reserved 8562306a36Sopenharmony_ci| 8662306a36Sopenharmony_ci| For details on the license for this file, please see the 8762306a36Sopenharmony_ci| file, README, in this same directory. 8862306a36Sopenharmony_ci 8962306a36Sopenharmony_ci|SSIN idnt 2,1 | Motorola 040 Floating Point Software Package 9062306a36Sopenharmony_ci 9162306a36Sopenharmony_ci |section 8 9262306a36Sopenharmony_ci 9362306a36Sopenharmony_ci#include "fpsp.h" 9462306a36Sopenharmony_ci 9562306a36Sopenharmony_ciBOUNDS1: .long 0x3FD78000,0x4004BC7E 9662306a36Sopenharmony_ciTWOBYPI: .long 0x3FE45F30,0x6DC9C883 9762306a36Sopenharmony_ci 9862306a36Sopenharmony_ciSINA7: .long 0xBD6AAA77,0xCCC994F5 9962306a36Sopenharmony_ciSINA6: .long 0x3DE61209,0x7AAE8DA1 10062306a36Sopenharmony_ci 10162306a36Sopenharmony_ciSINA5: .long 0xBE5AE645,0x2A118AE4 10262306a36Sopenharmony_ciSINA4: .long 0x3EC71DE3,0xA5341531 10362306a36Sopenharmony_ci 10462306a36Sopenharmony_ciSINA3: .long 0xBF2A01A0,0x1A018B59,0x00000000,0x00000000 10562306a36Sopenharmony_ci 10662306a36Sopenharmony_ciSINA2: .long 0x3FF80000,0x88888888,0x888859AF,0x00000000 10762306a36Sopenharmony_ci 10862306a36Sopenharmony_ciSINA1: .long 0xBFFC0000,0xAAAAAAAA,0xAAAAAA99,0x00000000 10962306a36Sopenharmony_ci 11062306a36Sopenharmony_ciCOSB8: .long 0x3D2AC4D0,0xD6011EE3 11162306a36Sopenharmony_ciCOSB7: .long 0xBDA9396F,0x9F45AC19 11262306a36Sopenharmony_ci 11362306a36Sopenharmony_ciCOSB6: .long 0x3E21EED9,0x0612C972 11462306a36Sopenharmony_ciCOSB5: .long 0xBE927E4F,0xB79D9FCF 11562306a36Sopenharmony_ci 11662306a36Sopenharmony_ciCOSB4: .long 0x3EFA01A0,0x1A01D423,0x00000000,0x00000000 11762306a36Sopenharmony_ci 11862306a36Sopenharmony_ciCOSB3: .long 0xBFF50000,0xB60B60B6,0x0B61D438,0x00000000 11962306a36Sopenharmony_ci 12062306a36Sopenharmony_ciCOSB2: .long 0x3FFA0000,0xAAAAAAAA,0xAAAAAB5E 12162306a36Sopenharmony_ciCOSB1: .long 0xBF000000 12262306a36Sopenharmony_ci 12362306a36Sopenharmony_ciINVTWOPI: .long 0x3FFC0000,0xA2F9836E,0x4E44152A 12462306a36Sopenharmony_ci 12562306a36Sopenharmony_ciTWOPI1: .long 0x40010000,0xC90FDAA2,0x00000000,0x00000000 12662306a36Sopenharmony_ciTWOPI2: .long 0x3FDF0000,0x85A308D4,0x00000000,0x00000000 12762306a36Sopenharmony_ci 12862306a36Sopenharmony_ci |xref PITBL 12962306a36Sopenharmony_ci 13062306a36Sopenharmony_ci .set INARG,FP_SCR4 13162306a36Sopenharmony_ci 13262306a36Sopenharmony_ci .set X,FP_SCR5 13362306a36Sopenharmony_ci .set XDCARE,X+2 13462306a36Sopenharmony_ci .set XFRAC,X+4 13562306a36Sopenharmony_ci 13662306a36Sopenharmony_ci .set RPRIME,FP_SCR1 13762306a36Sopenharmony_ci .set SPRIME,FP_SCR2 13862306a36Sopenharmony_ci 13962306a36Sopenharmony_ci .set POSNEG1,L_SCR1 14062306a36Sopenharmony_ci .set TWOTO63,L_SCR1 14162306a36Sopenharmony_ci 14262306a36Sopenharmony_ci .set ENDFLAG,L_SCR2 14362306a36Sopenharmony_ci .set N,L_SCR2 14462306a36Sopenharmony_ci 14562306a36Sopenharmony_ci .set ADJN,L_SCR3 14662306a36Sopenharmony_ci 14762306a36Sopenharmony_ci | xref t_frcinx 14862306a36Sopenharmony_ci |xref t_extdnrm 14962306a36Sopenharmony_ci |xref sto_cos 15062306a36Sopenharmony_ci 15162306a36Sopenharmony_ci .global ssind 15262306a36Sopenharmony_cissind: 15362306a36Sopenharmony_ci|--SIN(X) = X FOR DENORMALIZED X 15462306a36Sopenharmony_ci bra t_extdnrm 15562306a36Sopenharmony_ci 15662306a36Sopenharmony_ci .global scosd 15762306a36Sopenharmony_ciscosd: 15862306a36Sopenharmony_ci|--COS(X) = 1 FOR DENORMALIZED X 15962306a36Sopenharmony_ci 16062306a36Sopenharmony_ci fmoves #0x3F800000,%fp0 16162306a36Sopenharmony_ci| 16262306a36Sopenharmony_ci| 9D25B Fix: Sometimes the previous fmove.s sets fpsr bits 16362306a36Sopenharmony_ci| 16462306a36Sopenharmony_ci fmovel #0,%fpsr 16562306a36Sopenharmony_ci| 16662306a36Sopenharmony_ci bra t_frcinx 16762306a36Sopenharmony_ci 16862306a36Sopenharmony_ci .global ssin 16962306a36Sopenharmony_cissin: 17062306a36Sopenharmony_ci|--SET ADJN TO 0 17162306a36Sopenharmony_ci movel #0,ADJN(%a6) 17262306a36Sopenharmony_ci bras SINBGN 17362306a36Sopenharmony_ci 17462306a36Sopenharmony_ci .global scos 17562306a36Sopenharmony_ciscos: 17662306a36Sopenharmony_ci|--SET ADJN TO 1 17762306a36Sopenharmony_ci movel #1,ADJN(%a6) 17862306a36Sopenharmony_ci 17962306a36Sopenharmony_ciSINBGN: 18062306a36Sopenharmony_ci|--SAVE FPCR, FP1. CHECK IF |X| IS TOO SMALL OR LARGE 18162306a36Sopenharmony_ci 18262306a36Sopenharmony_ci fmovex (%a0),%fp0 | ...LOAD INPUT 18362306a36Sopenharmony_ci 18462306a36Sopenharmony_ci movel (%a0),%d0 18562306a36Sopenharmony_ci movew 4(%a0),%d0 18662306a36Sopenharmony_ci fmovex %fp0,X(%a6) 18762306a36Sopenharmony_ci andil #0x7FFFFFFF,%d0 | ...COMPACTIFY X 18862306a36Sopenharmony_ci 18962306a36Sopenharmony_ci cmpil #0x3FD78000,%d0 | ...|X| >= 2**(-40)? 19062306a36Sopenharmony_ci bges SOK1 19162306a36Sopenharmony_ci bra SINSM 19262306a36Sopenharmony_ci 19362306a36Sopenharmony_ciSOK1: 19462306a36Sopenharmony_ci cmpil #0x4004BC7E,%d0 | ...|X| < 15 PI? 19562306a36Sopenharmony_ci blts SINMAIN 19662306a36Sopenharmony_ci bra REDUCEX 19762306a36Sopenharmony_ci 19862306a36Sopenharmony_ciSINMAIN: 19962306a36Sopenharmony_ci|--THIS IS THE USUAL CASE, |X| <= 15 PI. 20062306a36Sopenharmony_ci|--THE ARGUMENT REDUCTION IS DONE BY TABLE LOOK UP. 20162306a36Sopenharmony_ci fmovex %fp0,%fp1 20262306a36Sopenharmony_ci fmuld TWOBYPI,%fp1 | ...X*2/PI 20362306a36Sopenharmony_ci 20462306a36Sopenharmony_ci|--HIDE THE NEXT THREE INSTRUCTIONS 20562306a36Sopenharmony_ci lea PITBL+0x200,%a1 | ...TABLE OF N*PI/2, N = -32,...,32 20662306a36Sopenharmony_ci 20762306a36Sopenharmony_ci 20862306a36Sopenharmony_ci|--FP1 IS NOW READY 20962306a36Sopenharmony_ci fmovel %fp1,N(%a6) | ...CONVERT TO INTEGER 21062306a36Sopenharmony_ci 21162306a36Sopenharmony_ci movel N(%a6),%d0 21262306a36Sopenharmony_ci asll #4,%d0 21362306a36Sopenharmony_ci addal %d0,%a1 | ...A1 IS THE ADDRESS OF N*PIBY2 21462306a36Sopenharmony_ci| ...WHICH IS IN TWO PIECES Y1 & Y2 21562306a36Sopenharmony_ci 21662306a36Sopenharmony_ci fsubx (%a1)+,%fp0 | ...X-Y1 21762306a36Sopenharmony_ci|--HIDE THE NEXT ONE 21862306a36Sopenharmony_ci fsubs (%a1),%fp0 | ...FP0 IS R = (X-Y1)-Y2 21962306a36Sopenharmony_ci 22062306a36Sopenharmony_ciSINCONT: 22162306a36Sopenharmony_ci|--continuation from REDUCEX 22262306a36Sopenharmony_ci 22362306a36Sopenharmony_ci|--GET N+ADJN AND SEE IF SIN(R) OR COS(R) IS NEEDED 22462306a36Sopenharmony_ci movel N(%a6),%d0 22562306a36Sopenharmony_ci addl ADJN(%a6),%d0 | ...SEE IF D0 IS ODD OR EVEN 22662306a36Sopenharmony_ci rorl #1,%d0 | ...D0 WAS ODD IFF D0 IS NEGATIVE 22762306a36Sopenharmony_ci cmpil #0,%d0 22862306a36Sopenharmony_ci blt COSPOLY 22962306a36Sopenharmony_ci 23062306a36Sopenharmony_ciSINPOLY: 23162306a36Sopenharmony_ci|--LET J BE THE LEAST SIG. BIT OF D0, LET SGN := (-1)**J. 23262306a36Sopenharmony_ci|--THEN WE RETURN SGN*SIN(R). SGN*SIN(R) IS COMPUTED BY 23362306a36Sopenharmony_ci|--R' + R'*S*(A1 + S(A2 + S(A3 + S(A4 + ... + SA7)))), WHERE 23462306a36Sopenharmony_ci|--R' = SGN*R, S=R*R. THIS CAN BE REWRITTEN AS 23562306a36Sopenharmony_ci|--R' + R'*S*( [A1+T(A3+T(A5+TA7))] + [S(A2+T(A4+TA6))]) 23662306a36Sopenharmony_ci|--WHERE T=S*S. 23762306a36Sopenharmony_ci|--NOTE THAT A3 THROUGH A7 ARE STORED IN DOUBLE PRECISION 23862306a36Sopenharmony_ci|--WHILE A1 AND A2 ARE IN DOUBLE-EXTENDED FORMAT. 23962306a36Sopenharmony_ci fmovex %fp0,X(%a6) | ...X IS R 24062306a36Sopenharmony_ci fmulx %fp0,%fp0 | ...FP0 IS S 24162306a36Sopenharmony_ci|---HIDE THE NEXT TWO WHILE WAITING FOR FP0 24262306a36Sopenharmony_ci fmoved SINA7,%fp3 24362306a36Sopenharmony_ci fmoved SINA6,%fp2 24462306a36Sopenharmony_ci|--FP0 IS NOW READY 24562306a36Sopenharmony_ci fmovex %fp0,%fp1 24662306a36Sopenharmony_ci fmulx %fp1,%fp1 | ...FP1 IS T 24762306a36Sopenharmony_ci|--HIDE THE NEXT TWO WHILE WAITING FOR FP1 24862306a36Sopenharmony_ci 24962306a36Sopenharmony_ci rorl #1,%d0 25062306a36Sopenharmony_ci andil #0x80000000,%d0 25162306a36Sopenharmony_ci| ...LEAST SIG. BIT OF D0 IN SIGN POSITION 25262306a36Sopenharmony_ci eorl %d0,X(%a6) | ...X IS NOW R'= SGN*R 25362306a36Sopenharmony_ci 25462306a36Sopenharmony_ci fmulx %fp1,%fp3 | ...TA7 25562306a36Sopenharmony_ci fmulx %fp1,%fp2 | ...TA6 25662306a36Sopenharmony_ci 25762306a36Sopenharmony_ci faddd SINA5,%fp3 | ...A5+TA7 25862306a36Sopenharmony_ci faddd SINA4,%fp2 | ...A4+TA6 25962306a36Sopenharmony_ci 26062306a36Sopenharmony_ci fmulx %fp1,%fp3 | ...T(A5+TA7) 26162306a36Sopenharmony_ci fmulx %fp1,%fp2 | ...T(A4+TA6) 26262306a36Sopenharmony_ci 26362306a36Sopenharmony_ci faddd SINA3,%fp3 | ...A3+T(A5+TA7) 26462306a36Sopenharmony_ci faddx SINA2,%fp2 | ...A2+T(A4+TA6) 26562306a36Sopenharmony_ci 26662306a36Sopenharmony_ci fmulx %fp3,%fp1 | ...T(A3+T(A5+TA7)) 26762306a36Sopenharmony_ci 26862306a36Sopenharmony_ci fmulx %fp0,%fp2 | ...S(A2+T(A4+TA6)) 26962306a36Sopenharmony_ci faddx SINA1,%fp1 | ...A1+T(A3+T(A5+TA7)) 27062306a36Sopenharmony_ci fmulx X(%a6),%fp0 | ...R'*S 27162306a36Sopenharmony_ci 27262306a36Sopenharmony_ci faddx %fp2,%fp1 | ...[A1+T(A3+T(A5+TA7))]+[S(A2+T(A4+TA6))] 27362306a36Sopenharmony_ci|--FP3 RELEASED, RESTORE NOW AND TAKE SOME ADVANTAGE OF HIDING 27462306a36Sopenharmony_ci|--FP2 RELEASED, RESTORE NOW AND TAKE FULL ADVANTAGE OF HIDING 27562306a36Sopenharmony_ci 27662306a36Sopenharmony_ci 27762306a36Sopenharmony_ci fmulx %fp1,%fp0 | ...SIN(R')-R' 27862306a36Sopenharmony_ci|--FP1 RELEASED. 27962306a36Sopenharmony_ci 28062306a36Sopenharmony_ci fmovel %d1,%FPCR |restore users exceptions 28162306a36Sopenharmony_ci faddx X(%a6),%fp0 |last inst - possible exception set 28262306a36Sopenharmony_ci bra t_frcinx 28362306a36Sopenharmony_ci 28462306a36Sopenharmony_ci 28562306a36Sopenharmony_ciCOSPOLY: 28662306a36Sopenharmony_ci|--LET J BE THE LEAST SIG. BIT OF D0, LET SGN := (-1)**J. 28762306a36Sopenharmony_ci|--THEN WE RETURN SGN*COS(R). SGN*COS(R) IS COMPUTED BY 28862306a36Sopenharmony_ci|--SGN + S'*(B1 + S(B2 + S(B3 + S(B4 + ... + SB8)))), WHERE 28962306a36Sopenharmony_ci|--S=R*R AND S'=SGN*S. THIS CAN BE REWRITTEN AS 29062306a36Sopenharmony_ci|--SGN + S'*([B1+T(B3+T(B5+TB7))] + [S(B2+T(B4+T(B6+TB8)))]) 29162306a36Sopenharmony_ci|--WHERE T=S*S. 29262306a36Sopenharmony_ci|--NOTE THAT B4 THROUGH B8 ARE STORED IN DOUBLE PRECISION 29362306a36Sopenharmony_ci|--WHILE B2 AND B3 ARE IN DOUBLE-EXTENDED FORMAT, B1 IS -1/2 29462306a36Sopenharmony_ci|--AND IS THEREFORE STORED AS SINGLE PRECISION. 29562306a36Sopenharmony_ci 29662306a36Sopenharmony_ci fmulx %fp0,%fp0 | ...FP0 IS S 29762306a36Sopenharmony_ci|---HIDE THE NEXT TWO WHILE WAITING FOR FP0 29862306a36Sopenharmony_ci fmoved COSB8,%fp2 29962306a36Sopenharmony_ci fmoved COSB7,%fp3 30062306a36Sopenharmony_ci|--FP0 IS NOW READY 30162306a36Sopenharmony_ci fmovex %fp0,%fp1 30262306a36Sopenharmony_ci fmulx %fp1,%fp1 | ...FP1 IS T 30362306a36Sopenharmony_ci|--HIDE THE NEXT TWO WHILE WAITING FOR FP1 30462306a36Sopenharmony_ci fmovex %fp0,X(%a6) | ...X IS S 30562306a36Sopenharmony_ci rorl #1,%d0 30662306a36Sopenharmony_ci andil #0x80000000,%d0 30762306a36Sopenharmony_ci| ...LEAST SIG. BIT OF D0 IN SIGN POSITION 30862306a36Sopenharmony_ci 30962306a36Sopenharmony_ci fmulx %fp1,%fp2 | ...TB8 31062306a36Sopenharmony_ci|--HIDE THE NEXT TWO WHILE WAITING FOR THE XU 31162306a36Sopenharmony_ci eorl %d0,X(%a6) | ...X IS NOW S'= SGN*S 31262306a36Sopenharmony_ci andil #0x80000000,%d0 31362306a36Sopenharmony_ci 31462306a36Sopenharmony_ci fmulx %fp1,%fp3 | ...TB7 31562306a36Sopenharmony_ci|--HIDE THE NEXT TWO WHILE WAITING FOR THE XU 31662306a36Sopenharmony_ci oril #0x3F800000,%d0 | ...D0 IS SGN IN SINGLE 31762306a36Sopenharmony_ci movel %d0,POSNEG1(%a6) 31862306a36Sopenharmony_ci 31962306a36Sopenharmony_ci faddd COSB6,%fp2 | ...B6+TB8 32062306a36Sopenharmony_ci faddd COSB5,%fp3 | ...B5+TB7 32162306a36Sopenharmony_ci 32262306a36Sopenharmony_ci fmulx %fp1,%fp2 | ...T(B6+TB8) 32362306a36Sopenharmony_ci fmulx %fp1,%fp3 | ...T(B5+TB7) 32462306a36Sopenharmony_ci 32562306a36Sopenharmony_ci faddd COSB4,%fp2 | ...B4+T(B6+TB8) 32662306a36Sopenharmony_ci faddx COSB3,%fp3 | ...B3+T(B5+TB7) 32762306a36Sopenharmony_ci 32862306a36Sopenharmony_ci fmulx %fp1,%fp2 | ...T(B4+T(B6+TB8)) 32962306a36Sopenharmony_ci fmulx %fp3,%fp1 | ...T(B3+T(B5+TB7)) 33062306a36Sopenharmony_ci 33162306a36Sopenharmony_ci faddx COSB2,%fp2 | ...B2+T(B4+T(B6+TB8)) 33262306a36Sopenharmony_ci fadds COSB1,%fp1 | ...B1+T(B3+T(B5+TB7)) 33362306a36Sopenharmony_ci 33462306a36Sopenharmony_ci fmulx %fp2,%fp0 | ...S(B2+T(B4+T(B6+TB8))) 33562306a36Sopenharmony_ci|--FP3 RELEASED, RESTORE NOW AND TAKE SOME ADVANTAGE OF HIDING 33662306a36Sopenharmony_ci|--FP2 RELEASED. 33762306a36Sopenharmony_ci 33862306a36Sopenharmony_ci 33962306a36Sopenharmony_ci faddx %fp1,%fp0 34062306a36Sopenharmony_ci|--FP1 RELEASED 34162306a36Sopenharmony_ci 34262306a36Sopenharmony_ci fmulx X(%a6),%fp0 34362306a36Sopenharmony_ci 34462306a36Sopenharmony_ci fmovel %d1,%FPCR |restore users exceptions 34562306a36Sopenharmony_ci fadds POSNEG1(%a6),%fp0 |last inst - possible exception set 34662306a36Sopenharmony_ci bra t_frcinx 34762306a36Sopenharmony_ci 34862306a36Sopenharmony_ci 34962306a36Sopenharmony_ciSINBORS: 35062306a36Sopenharmony_ci|--IF |X| > 15PI, WE USE THE GENERAL ARGUMENT REDUCTION. 35162306a36Sopenharmony_ci|--IF |X| < 2**(-40), RETURN X OR 1. 35262306a36Sopenharmony_ci cmpil #0x3FFF8000,%d0 35362306a36Sopenharmony_ci bgts REDUCEX 35462306a36Sopenharmony_ci 35562306a36Sopenharmony_ci 35662306a36Sopenharmony_ciSINSM: 35762306a36Sopenharmony_ci movel ADJN(%a6),%d0 35862306a36Sopenharmony_ci cmpil #0,%d0 35962306a36Sopenharmony_ci bgts COSTINY 36062306a36Sopenharmony_ci 36162306a36Sopenharmony_ciSINTINY: 36262306a36Sopenharmony_ci movew #0x0000,XDCARE(%a6) | ...JUST IN CASE 36362306a36Sopenharmony_ci fmovel %d1,%FPCR |restore users exceptions 36462306a36Sopenharmony_ci fmovex X(%a6),%fp0 |last inst - possible exception set 36562306a36Sopenharmony_ci bra t_frcinx 36662306a36Sopenharmony_ci 36762306a36Sopenharmony_ci 36862306a36Sopenharmony_ciCOSTINY: 36962306a36Sopenharmony_ci fmoves #0x3F800000,%fp0 37062306a36Sopenharmony_ci 37162306a36Sopenharmony_ci fmovel %d1,%FPCR |restore users exceptions 37262306a36Sopenharmony_ci fsubs #0x00800000,%fp0 |last inst - possible exception set 37362306a36Sopenharmony_ci bra t_frcinx 37462306a36Sopenharmony_ci 37562306a36Sopenharmony_ci 37662306a36Sopenharmony_ciREDUCEX: 37762306a36Sopenharmony_ci|--WHEN REDUCEX IS USED, THE CODE WILL INEVITABLY BE SLOW. 37862306a36Sopenharmony_ci|--THIS REDUCTION METHOD, HOWEVER, IS MUCH FASTER THAN USING 37962306a36Sopenharmony_ci|--THE REMAINDER INSTRUCTION WHICH IS NOW IN SOFTWARE. 38062306a36Sopenharmony_ci 38162306a36Sopenharmony_ci fmovemx %fp2-%fp5,-(%a7) | ...save FP2 through FP5 38262306a36Sopenharmony_ci movel %d2,-(%a7) 38362306a36Sopenharmony_ci fmoves #0x00000000,%fp1 38462306a36Sopenharmony_ci|--If compact form of abs(arg) in d0=$7ffeffff, argument is so large that 38562306a36Sopenharmony_ci|--there is a danger of unwanted overflow in first LOOP iteration. In this 38662306a36Sopenharmony_ci|--case, reduce argument by one remainder step to make subsequent reduction 38762306a36Sopenharmony_ci|--safe. 38862306a36Sopenharmony_ci cmpil #0x7ffeffff,%d0 |is argument dangerously large? 38962306a36Sopenharmony_ci bnes LOOP 39062306a36Sopenharmony_ci movel #0x7ffe0000,FP_SCR2(%a6) |yes 39162306a36Sopenharmony_ci| ;create 2**16383*PI/2 39262306a36Sopenharmony_ci movel #0xc90fdaa2,FP_SCR2+4(%a6) 39362306a36Sopenharmony_ci clrl FP_SCR2+8(%a6) 39462306a36Sopenharmony_ci ftstx %fp0 |test sign of argument 39562306a36Sopenharmony_ci movel #0x7fdc0000,FP_SCR3(%a6) |create low half of 2**16383* 39662306a36Sopenharmony_ci| ;PI/2 at FP_SCR3 39762306a36Sopenharmony_ci movel #0x85a308d3,FP_SCR3+4(%a6) 39862306a36Sopenharmony_ci clrl FP_SCR3+8(%a6) 39962306a36Sopenharmony_ci fblt red_neg 40062306a36Sopenharmony_ci orw #0x8000,FP_SCR2(%a6) |positive arg 40162306a36Sopenharmony_ci orw #0x8000,FP_SCR3(%a6) 40262306a36Sopenharmony_cired_neg: 40362306a36Sopenharmony_ci faddx FP_SCR2(%a6),%fp0 |high part of reduction is exact 40462306a36Sopenharmony_ci fmovex %fp0,%fp1 |save high result in fp1 40562306a36Sopenharmony_ci faddx FP_SCR3(%a6),%fp0 |low part of reduction 40662306a36Sopenharmony_ci fsubx %fp0,%fp1 |determine low component of result 40762306a36Sopenharmony_ci faddx FP_SCR3(%a6),%fp1 |fp0/fp1 are reduced argument. 40862306a36Sopenharmony_ci 40962306a36Sopenharmony_ci|--ON ENTRY, FP0 IS X, ON RETURN, FP0 IS X REM PI/2, |X| <= PI/4. 41062306a36Sopenharmony_ci|--integer quotient will be stored in N 41162306a36Sopenharmony_ci|--Intermediate remainder is 66-bit long; (R,r) in (FP0,FP1) 41262306a36Sopenharmony_ci 41362306a36Sopenharmony_ciLOOP: 41462306a36Sopenharmony_ci fmovex %fp0,INARG(%a6) | ...+-2**K * F, 1 <= F < 2 41562306a36Sopenharmony_ci movew INARG(%a6),%d0 41662306a36Sopenharmony_ci movel %d0,%a1 | ...save a copy of D0 41762306a36Sopenharmony_ci andil #0x00007FFF,%d0 41862306a36Sopenharmony_ci subil #0x00003FFF,%d0 | ...D0 IS K 41962306a36Sopenharmony_ci cmpil #28,%d0 42062306a36Sopenharmony_ci bles LASTLOOP 42162306a36Sopenharmony_ciCONTLOOP: 42262306a36Sopenharmony_ci subil #27,%d0 | ...D0 IS L := K-27 42362306a36Sopenharmony_ci movel #0,ENDFLAG(%a6) 42462306a36Sopenharmony_ci bras WORK 42562306a36Sopenharmony_ciLASTLOOP: 42662306a36Sopenharmony_ci clrl %d0 | ...D0 IS L := 0 42762306a36Sopenharmony_ci movel #1,ENDFLAG(%a6) 42862306a36Sopenharmony_ci 42962306a36Sopenharmony_ciWORK: 43062306a36Sopenharmony_ci|--FIND THE REMAINDER OF (R,r) W.R.T. 2**L * (PI/2). L IS SO CHOSEN 43162306a36Sopenharmony_ci|--THAT INT( X * (2/PI) / 2**(L) ) < 2**29. 43262306a36Sopenharmony_ci 43362306a36Sopenharmony_ci|--CREATE 2**(-L) * (2/PI), SIGN(INARG)*2**(63), 43462306a36Sopenharmony_ci|--2**L * (PIby2_1), 2**L * (PIby2_2) 43562306a36Sopenharmony_ci 43662306a36Sopenharmony_ci movel #0x00003FFE,%d2 | ...BIASED EXPO OF 2/PI 43762306a36Sopenharmony_ci subl %d0,%d2 | ...BIASED EXPO OF 2**(-L)*(2/PI) 43862306a36Sopenharmony_ci 43962306a36Sopenharmony_ci movel #0xA2F9836E,FP_SCR1+4(%a6) 44062306a36Sopenharmony_ci movel #0x4E44152A,FP_SCR1+8(%a6) 44162306a36Sopenharmony_ci movew %d2,FP_SCR1(%a6) | ...FP_SCR1 is 2**(-L)*(2/PI) 44262306a36Sopenharmony_ci 44362306a36Sopenharmony_ci fmovex %fp0,%fp2 44462306a36Sopenharmony_ci fmulx FP_SCR1(%a6),%fp2 44562306a36Sopenharmony_ci|--WE MUST NOW FIND INT(FP2). SINCE WE NEED THIS VALUE IN 44662306a36Sopenharmony_ci|--FLOATING POINT FORMAT, THE TWO FMOVE'S FMOVE.L FP <--> N 44762306a36Sopenharmony_ci|--WILL BE TOO INEFFICIENT. THE WAY AROUND IT IS THAT 44862306a36Sopenharmony_ci|--(SIGN(INARG)*2**63 + FP2) - SIGN(INARG)*2**63 WILL GIVE 44962306a36Sopenharmony_ci|--US THE DESIRED VALUE IN FLOATING POINT. 45062306a36Sopenharmony_ci 45162306a36Sopenharmony_ci|--HIDE SIX CYCLES OF INSTRUCTION 45262306a36Sopenharmony_ci movel %a1,%d2 45362306a36Sopenharmony_ci swap %d2 45462306a36Sopenharmony_ci andil #0x80000000,%d2 45562306a36Sopenharmony_ci oril #0x5F000000,%d2 | ...D2 IS SIGN(INARG)*2**63 IN SGL 45662306a36Sopenharmony_ci movel %d2,TWOTO63(%a6) 45762306a36Sopenharmony_ci 45862306a36Sopenharmony_ci movel %d0,%d2 45962306a36Sopenharmony_ci addil #0x00003FFF,%d2 | ...BIASED EXPO OF 2**L * (PI/2) 46062306a36Sopenharmony_ci 46162306a36Sopenharmony_ci|--FP2 IS READY 46262306a36Sopenharmony_ci fadds TWOTO63(%a6),%fp2 | ...THE FRACTIONAL PART OF FP1 IS ROUNDED 46362306a36Sopenharmony_ci 46462306a36Sopenharmony_ci|--HIDE 4 CYCLES OF INSTRUCTION; creating 2**(L)*Piby2_1 and 2**(L)*Piby2_2 46562306a36Sopenharmony_ci movew %d2,FP_SCR2(%a6) 46662306a36Sopenharmony_ci clrw FP_SCR2+2(%a6) 46762306a36Sopenharmony_ci movel #0xC90FDAA2,FP_SCR2+4(%a6) 46862306a36Sopenharmony_ci clrl FP_SCR2+8(%a6) | ...FP_SCR2 is 2**(L) * Piby2_1 46962306a36Sopenharmony_ci 47062306a36Sopenharmony_ci|--FP2 IS READY 47162306a36Sopenharmony_ci fsubs TWOTO63(%a6),%fp2 | ...FP2 is N 47262306a36Sopenharmony_ci 47362306a36Sopenharmony_ci addil #0x00003FDD,%d0 47462306a36Sopenharmony_ci movew %d0,FP_SCR3(%a6) 47562306a36Sopenharmony_ci clrw FP_SCR3+2(%a6) 47662306a36Sopenharmony_ci movel #0x85A308D3,FP_SCR3+4(%a6) 47762306a36Sopenharmony_ci clrl FP_SCR3+8(%a6) | ...FP_SCR3 is 2**(L) * Piby2_2 47862306a36Sopenharmony_ci 47962306a36Sopenharmony_ci movel ENDFLAG(%a6),%d0 48062306a36Sopenharmony_ci 48162306a36Sopenharmony_ci|--We are now ready to perform (R+r) - N*P1 - N*P2, P1 = 2**(L) * Piby2_1 and 48262306a36Sopenharmony_ci|--P2 = 2**(L) * Piby2_2 48362306a36Sopenharmony_ci fmovex %fp2,%fp4 48462306a36Sopenharmony_ci fmulx FP_SCR2(%a6),%fp4 | ...W = N*P1 48562306a36Sopenharmony_ci fmovex %fp2,%fp5 48662306a36Sopenharmony_ci fmulx FP_SCR3(%a6),%fp5 | ...w = N*P2 48762306a36Sopenharmony_ci fmovex %fp4,%fp3 48862306a36Sopenharmony_ci|--we want P+p = W+w but |p| <= half ulp of P 48962306a36Sopenharmony_ci|--Then, we need to compute A := R-P and a := r-p 49062306a36Sopenharmony_ci faddx %fp5,%fp3 | ...FP3 is P 49162306a36Sopenharmony_ci fsubx %fp3,%fp4 | ...W-P 49262306a36Sopenharmony_ci 49362306a36Sopenharmony_ci fsubx %fp3,%fp0 | ...FP0 is A := R - P 49462306a36Sopenharmony_ci faddx %fp5,%fp4 | ...FP4 is p = (W-P)+w 49562306a36Sopenharmony_ci 49662306a36Sopenharmony_ci fmovex %fp0,%fp3 | ...FP3 A 49762306a36Sopenharmony_ci fsubx %fp4,%fp1 | ...FP1 is a := r - p 49862306a36Sopenharmony_ci 49962306a36Sopenharmony_ci|--Now we need to normalize (A,a) to "new (R,r)" where R+r = A+a but 50062306a36Sopenharmony_ci|--|r| <= half ulp of R. 50162306a36Sopenharmony_ci faddx %fp1,%fp0 | ...FP0 is R := A+a 50262306a36Sopenharmony_ci|--No need to calculate r if this is the last loop 50362306a36Sopenharmony_ci cmpil #0,%d0 50462306a36Sopenharmony_ci bgt RESTORE 50562306a36Sopenharmony_ci 50662306a36Sopenharmony_ci|--Need to calculate r 50762306a36Sopenharmony_ci fsubx %fp0,%fp3 | ...A-R 50862306a36Sopenharmony_ci faddx %fp3,%fp1 | ...FP1 is r := (A-R)+a 50962306a36Sopenharmony_ci bra LOOP 51062306a36Sopenharmony_ci 51162306a36Sopenharmony_ciRESTORE: 51262306a36Sopenharmony_ci fmovel %fp2,N(%a6) 51362306a36Sopenharmony_ci movel (%a7)+,%d2 51462306a36Sopenharmony_ci fmovemx (%a7)+,%fp2-%fp5 51562306a36Sopenharmony_ci 51662306a36Sopenharmony_ci 51762306a36Sopenharmony_ci movel ADJN(%a6),%d0 51862306a36Sopenharmony_ci cmpil #4,%d0 51962306a36Sopenharmony_ci 52062306a36Sopenharmony_ci blt SINCONT 52162306a36Sopenharmony_ci bras SCCONT 52262306a36Sopenharmony_ci 52362306a36Sopenharmony_ci .global ssincosd 52462306a36Sopenharmony_cissincosd: 52562306a36Sopenharmony_ci|--SIN AND COS OF X FOR DENORMALIZED X 52662306a36Sopenharmony_ci 52762306a36Sopenharmony_ci fmoves #0x3F800000,%fp1 52862306a36Sopenharmony_ci bsr sto_cos |store cosine result 52962306a36Sopenharmony_ci bra t_extdnrm 53062306a36Sopenharmony_ci 53162306a36Sopenharmony_ci .global ssincos 53262306a36Sopenharmony_cissincos: 53362306a36Sopenharmony_ci|--SET ADJN TO 4 53462306a36Sopenharmony_ci movel #4,ADJN(%a6) 53562306a36Sopenharmony_ci 53662306a36Sopenharmony_ci fmovex (%a0),%fp0 | ...LOAD INPUT 53762306a36Sopenharmony_ci 53862306a36Sopenharmony_ci movel (%a0),%d0 53962306a36Sopenharmony_ci movew 4(%a0),%d0 54062306a36Sopenharmony_ci fmovex %fp0,X(%a6) 54162306a36Sopenharmony_ci andil #0x7FFFFFFF,%d0 | ...COMPACTIFY X 54262306a36Sopenharmony_ci 54362306a36Sopenharmony_ci cmpil #0x3FD78000,%d0 | ...|X| >= 2**(-40)? 54462306a36Sopenharmony_ci bges SCOK1 54562306a36Sopenharmony_ci bra SCSM 54662306a36Sopenharmony_ci 54762306a36Sopenharmony_ciSCOK1: 54862306a36Sopenharmony_ci cmpil #0x4004BC7E,%d0 | ...|X| < 15 PI? 54962306a36Sopenharmony_ci blts SCMAIN 55062306a36Sopenharmony_ci bra REDUCEX 55162306a36Sopenharmony_ci 55262306a36Sopenharmony_ci 55362306a36Sopenharmony_ciSCMAIN: 55462306a36Sopenharmony_ci|--THIS IS THE USUAL CASE, |X| <= 15 PI. 55562306a36Sopenharmony_ci|--THE ARGUMENT REDUCTION IS DONE BY TABLE LOOK UP. 55662306a36Sopenharmony_ci fmovex %fp0,%fp1 55762306a36Sopenharmony_ci fmuld TWOBYPI,%fp1 | ...X*2/PI 55862306a36Sopenharmony_ci 55962306a36Sopenharmony_ci|--HIDE THE NEXT THREE INSTRUCTIONS 56062306a36Sopenharmony_ci lea PITBL+0x200,%a1 | ...TABLE OF N*PI/2, N = -32,...,32 56162306a36Sopenharmony_ci 56262306a36Sopenharmony_ci 56362306a36Sopenharmony_ci|--FP1 IS NOW READY 56462306a36Sopenharmony_ci fmovel %fp1,N(%a6) | ...CONVERT TO INTEGER 56562306a36Sopenharmony_ci 56662306a36Sopenharmony_ci movel N(%a6),%d0 56762306a36Sopenharmony_ci asll #4,%d0 56862306a36Sopenharmony_ci addal %d0,%a1 | ...ADDRESS OF N*PIBY2, IN Y1, Y2 56962306a36Sopenharmony_ci 57062306a36Sopenharmony_ci fsubx (%a1)+,%fp0 | ...X-Y1 57162306a36Sopenharmony_ci fsubs (%a1),%fp0 | ...FP0 IS R = (X-Y1)-Y2 57262306a36Sopenharmony_ci 57362306a36Sopenharmony_ciSCCONT: 57462306a36Sopenharmony_ci|--continuation point from REDUCEX 57562306a36Sopenharmony_ci 57662306a36Sopenharmony_ci|--HIDE THE NEXT TWO 57762306a36Sopenharmony_ci movel N(%a6),%d0 57862306a36Sopenharmony_ci rorl #1,%d0 57962306a36Sopenharmony_ci 58062306a36Sopenharmony_ci cmpil #0,%d0 | ...D0 < 0 IFF N IS ODD 58162306a36Sopenharmony_ci bge NEVEN 58262306a36Sopenharmony_ci 58362306a36Sopenharmony_ciNODD: 58462306a36Sopenharmony_ci|--REGISTERS SAVED SO FAR: D0, A0, FP2. 58562306a36Sopenharmony_ci 58662306a36Sopenharmony_ci fmovex %fp0,RPRIME(%a6) 58762306a36Sopenharmony_ci fmulx %fp0,%fp0 | ...FP0 IS S = R*R 58862306a36Sopenharmony_ci fmoved SINA7,%fp1 | ...A7 58962306a36Sopenharmony_ci fmoved COSB8,%fp2 | ...B8 59062306a36Sopenharmony_ci fmulx %fp0,%fp1 | ...SA7 59162306a36Sopenharmony_ci movel %d2,-(%a7) 59262306a36Sopenharmony_ci movel %d0,%d2 59362306a36Sopenharmony_ci fmulx %fp0,%fp2 | ...SB8 59462306a36Sopenharmony_ci rorl #1,%d2 59562306a36Sopenharmony_ci andil #0x80000000,%d2 59662306a36Sopenharmony_ci 59762306a36Sopenharmony_ci faddd SINA6,%fp1 | ...A6+SA7 59862306a36Sopenharmony_ci eorl %d0,%d2 59962306a36Sopenharmony_ci andil #0x80000000,%d2 60062306a36Sopenharmony_ci faddd COSB7,%fp2 | ...B7+SB8 60162306a36Sopenharmony_ci 60262306a36Sopenharmony_ci fmulx %fp0,%fp1 | ...S(A6+SA7) 60362306a36Sopenharmony_ci eorl %d2,RPRIME(%a6) 60462306a36Sopenharmony_ci movel (%a7)+,%d2 60562306a36Sopenharmony_ci fmulx %fp0,%fp2 | ...S(B7+SB8) 60662306a36Sopenharmony_ci rorl #1,%d0 60762306a36Sopenharmony_ci andil #0x80000000,%d0 60862306a36Sopenharmony_ci 60962306a36Sopenharmony_ci faddd SINA5,%fp1 | ...A5+S(A6+SA7) 61062306a36Sopenharmony_ci movel #0x3F800000,POSNEG1(%a6) 61162306a36Sopenharmony_ci eorl %d0,POSNEG1(%a6) 61262306a36Sopenharmony_ci faddd COSB6,%fp2 | ...B6+S(B7+SB8) 61362306a36Sopenharmony_ci 61462306a36Sopenharmony_ci fmulx %fp0,%fp1 | ...S(A5+S(A6+SA7)) 61562306a36Sopenharmony_ci fmulx %fp0,%fp2 | ...S(B6+S(B7+SB8)) 61662306a36Sopenharmony_ci fmovex %fp0,SPRIME(%a6) 61762306a36Sopenharmony_ci 61862306a36Sopenharmony_ci faddd SINA4,%fp1 | ...A4+S(A5+S(A6+SA7)) 61962306a36Sopenharmony_ci eorl %d0,SPRIME(%a6) 62062306a36Sopenharmony_ci faddd COSB5,%fp2 | ...B5+S(B6+S(B7+SB8)) 62162306a36Sopenharmony_ci 62262306a36Sopenharmony_ci fmulx %fp0,%fp1 | ...S(A4+...) 62362306a36Sopenharmony_ci fmulx %fp0,%fp2 | ...S(B5+...) 62462306a36Sopenharmony_ci 62562306a36Sopenharmony_ci faddd SINA3,%fp1 | ...A3+S(A4+...) 62662306a36Sopenharmony_ci faddd COSB4,%fp2 | ...B4+S(B5+...) 62762306a36Sopenharmony_ci 62862306a36Sopenharmony_ci fmulx %fp0,%fp1 | ...S(A3+...) 62962306a36Sopenharmony_ci fmulx %fp0,%fp2 | ...S(B4+...) 63062306a36Sopenharmony_ci 63162306a36Sopenharmony_ci faddx SINA2,%fp1 | ...A2+S(A3+...) 63262306a36Sopenharmony_ci faddx COSB3,%fp2 | ...B3+S(B4+...) 63362306a36Sopenharmony_ci 63462306a36Sopenharmony_ci fmulx %fp0,%fp1 | ...S(A2+...) 63562306a36Sopenharmony_ci fmulx %fp0,%fp2 | ...S(B3+...) 63662306a36Sopenharmony_ci 63762306a36Sopenharmony_ci faddx SINA1,%fp1 | ...A1+S(A2+...) 63862306a36Sopenharmony_ci faddx COSB2,%fp2 | ...B2+S(B3+...) 63962306a36Sopenharmony_ci 64062306a36Sopenharmony_ci fmulx %fp0,%fp1 | ...S(A1+...) 64162306a36Sopenharmony_ci fmulx %fp2,%fp0 | ...S(B2+...) 64262306a36Sopenharmony_ci 64362306a36Sopenharmony_ci 64462306a36Sopenharmony_ci 64562306a36Sopenharmony_ci fmulx RPRIME(%a6),%fp1 | ...R'S(A1+...) 64662306a36Sopenharmony_ci fadds COSB1,%fp0 | ...B1+S(B2...) 64762306a36Sopenharmony_ci fmulx SPRIME(%a6),%fp0 | ...S'(B1+S(B2+...)) 64862306a36Sopenharmony_ci 64962306a36Sopenharmony_ci movel %d1,-(%sp) |restore users mode & precision 65062306a36Sopenharmony_ci andil #0xff,%d1 |mask off all exceptions 65162306a36Sopenharmony_ci fmovel %d1,%FPCR 65262306a36Sopenharmony_ci faddx RPRIME(%a6),%fp1 | ...COS(X) 65362306a36Sopenharmony_ci bsr sto_cos |store cosine result 65462306a36Sopenharmony_ci fmovel (%sp)+,%FPCR |restore users exceptions 65562306a36Sopenharmony_ci fadds POSNEG1(%a6),%fp0 | ...SIN(X) 65662306a36Sopenharmony_ci 65762306a36Sopenharmony_ci bra t_frcinx 65862306a36Sopenharmony_ci 65962306a36Sopenharmony_ci 66062306a36Sopenharmony_ciNEVEN: 66162306a36Sopenharmony_ci|--REGISTERS SAVED SO FAR: FP2. 66262306a36Sopenharmony_ci 66362306a36Sopenharmony_ci fmovex %fp0,RPRIME(%a6) 66462306a36Sopenharmony_ci fmulx %fp0,%fp0 | ...FP0 IS S = R*R 66562306a36Sopenharmony_ci fmoved COSB8,%fp1 | ...B8 66662306a36Sopenharmony_ci fmoved SINA7,%fp2 | ...A7 66762306a36Sopenharmony_ci fmulx %fp0,%fp1 | ...SB8 66862306a36Sopenharmony_ci fmovex %fp0,SPRIME(%a6) 66962306a36Sopenharmony_ci fmulx %fp0,%fp2 | ...SA7 67062306a36Sopenharmony_ci rorl #1,%d0 67162306a36Sopenharmony_ci andil #0x80000000,%d0 67262306a36Sopenharmony_ci faddd COSB7,%fp1 | ...B7+SB8 67362306a36Sopenharmony_ci faddd SINA6,%fp2 | ...A6+SA7 67462306a36Sopenharmony_ci eorl %d0,RPRIME(%a6) 67562306a36Sopenharmony_ci eorl %d0,SPRIME(%a6) 67662306a36Sopenharmony_ci fmulx %fp0,%fp1 | ...S(B7+SB8) 67762306a36Sopenharmony_ci oril #0x3F800000,%d0 67862306a36Sopenharmony_ci movel %d0,POSNEG1(%a6) 67962306a36Sopenharmony_ci fmulx %fp0,%fp2 | ...S(A6+SA7) 68062306a36Sopenharmony_ci 68162306a36Sopenharmony_ci faddd COSB6,%fp1 | ...B6+S(B7+SB8) 68262306a36Sopenharmony_ci faddd SINA5,%fp2 | ...A5+S(A6+SA7) 68362306a36Sopenharmony_ci 68462306a36Sopenharmony_ci fmulx %fp0,%fp1 | ...S(B6+S(B7+SB8)) 68562306a36Sopenharmony_ci fmulx %fp0,%fp2 | ...S(A5+S(A6+SA7)) 68662306a36Sopenharmony_ci 68762306a36Sopenharmony_ci faddd COSB5,%fp1 | ...B5+S(B6+S(B7+SB8)) 68862306a36Sopenharmony_ci faddd SINA4,%fp2 | ...A4+S(A5+S(A6+SA7)) 68962306a36Sopenharmony_ci 69062306a36Sopenharmony_ci fmulx %fp0,%fp1 | ...S(B5+...) 69162306a36Sopenharmony_ci fmulx %fp0,%fp2 | ...S(A4+...) 69262306a36Sopenharmony_ci 69362306a36Sopenharmony_ci faddd COSB4,%fp1 | ...B4+S(B5+...) 69462306a36Sopenharmony_ci faddd SINA3,%fp2 | ...A3+S(A4+...) 69562306a36Sopenharmony_ci 69662306a36Sopenharmony_ci fmulx %fp0,%fp1 | ...S(B4+...) 69762306a36Sopenharmony_ci fmulx %fp0,%fp2 | ...S(A3+...) 69862306a36Sopenharmony_ci 69962306a36Sopenharmony_ci faddx COSB3,%fp1 | ...B3+S(B4+...) 70062306a36Sopenharmony_ci faddx SINA2,%fp2 | ...A2+S(A3+...) 70162306a36Sopenharmony_ci 70262306a36Sopenharmony_ci fmulx %fp0,%fp1 | ...S(B3+...) 70362306a36Sopenharmony_ci fmulx %fp0,%fp2 | ...S(A2+...) 70462306a36Sopenharmony_ci 70562306a36Sopenharmony_ci faddx COSB2,%fp1 | ...B2+S(B3+...) 70662306a36Sopenharmony_ci faddx SINA1,%fp2 | ...A1+S(A2+...) 70762306a36Sopenharmony_ci 70862306a36Sopenharmony_ci fmulx %fp0,%fp1 | ...S(B2+...) 70962306a36Sopenharmony_ci fmulx %fp2,%fp0 | ...s(a1+...) 71062306a36Sopenharmony_ci 71162306a36Sopenharmony_ci 71262306a36Sopenharmony_ci 71362306a36Sopenharmony_ci fadds COSB1,%fp1 | ...B1+S(B2...) 71462306a36Sopenharmony_ci fmulx RPRIME(%a6),%fp0 | ...R'S(A1+...) 71562306a36Sopenharmony_ci fmulx SPRIME(%a6),%fp1 | ...S'(B1+S(B2+...)) 71662306a36Sopenharmony_ci 71762306a36Sopenharmony_ci movel %d1,-(%sp) |save users mode & precision 71862306a36Sopenharmony_ci andil #0xff,%d1 |mask off all exceptions 71962306a36Sopenharmony_ci fmovel %d1,%FPCR 72062306a36Sopenharmony_ci fadds POSNEG1(%a6),%fp1 | ...COS(X) 72162306a36Sopenharmony_ci bsr sto_cos |store cosine result 72262306a36Sopenharmony_ci fmovel (%sp)+,%FPCR |restore users exceptions 72362306a36Sopenharmony_ci faddx RPRIME(%a6),%fp0 | ...SIN(X) 72462306a36Sopenharmony_ci 72562306a36Sopenharmony_ci bra t_frcinx 72662306a36Sopenharmony_ci 72762306a36Sopenharmony_ciSCBORS: 72862306a36Sopenharmony_ci cmpil #0x3FFF8000,%d0 72962306a36Sopenharmony_ci bgt REDUCEX 73062306a36Sopenharmony_ci 73162306a36Sopenharmony_ci 73262306a36Sopenharmony_ciSCSM: 73362306a36Sopenharmony_ci movew #0x0000,XDCARE(%a6) 73462306a36Sopenharmony_ci fmoves #0x3F800000,%fp1 73562306a36Sopenharmony_ci 73662306a36Sopenharmony_ci movel %d1,-(%sp) |save users mode & precision 73762306a36Sopenharmony_ci andil #0xff,%d1 |mask off all exceptions 73862306a36Sopenharmony_ci fmovel %d1,%FPCR 73962306a36Sopenharmony_ci fsubs #0x00800000,%fp1 74062306a36Sopenharmony_ci bsr sto_cos |store cosine result 74162306a36Sopenharmony_ci fmovel (%sp)+,%FPCR |restore users exceptions 74262306a36Sopenharmony_ci fmovex X(%a6),%fp0 74362306a36Sopenharmony_ci bra t_frcinx 74462306a36Sopenharmony_ci 74562306a36Sopenharmony_ci |end 746