arm/nwfpe/netwinder-fpe.rst

8c2ecf20Sopenharmony_ci=============
8c2ecf20Sopenharmony_ciCurrent State
8c2ecf20Sopenharmony_ci=============
8c2ecf20Sopenharmony_ci
8c2ecf20Sopenharmony_ciThe following describes the current state of the NetWinder's floating point
8c2ecf20Sopenharmony_ciemulator.
8c2ecf20Sopenharmony_ci
8c2ecf20Sopenharmony_ciIn the following nomenclature is used to describe the floating point
8c2ecf20Sopenharmony_ciinstructions.  It follows the conventions in the ARM manual.
8c2ecf20Sopenharmony_ci
8c2ecf20Sopenharmony_ci::
8c2ecf20Sopenharmony_ci
8c2ecf20Sopenharmony_ci  <S|D|E> = <single|double|extended>, no default
8c2ecf20Sopenharmony_ci  {P|M|Z} = {round to +infinity,round to -infinity,round to zero},
8c2ecf20Sopenharmony_ci            default = round to nearest
8c2ecf20Sopenharmony_ci
8c2ecf20Sopenharmony_ciNote: items enclosed in {} are optional.
8c2ecf20Sopenharmony_ci
8c2ecf20Sopenharmony_ciFloating Point Coprocessor Data Transfer Instructions (CPDT)
8c2ecf20Sopenharmony_ci------------------------------------------------------------
8c2ecf20Sopenharmony_ci
8c2ecf20Sopenharmony_ciLDF/STF - load and store floating
8c2ecf20Sopenharmony_ci
8c2ecf20Sopenharmony_ci<LDF|STF>{cond}<S|D|E> Fd, Rn
8c2ecf20Sopenharmony_ci<LDF|STF>{cond}<S|D|E> Fd, [Rn, #<expression>]{!}
8c2ecf20Sopenharmony_ci<LDF|STF>{cond}<S|D|E> Fd, [Rn], #<expression>
8c2ecf20Sopenharmony_ci
8c2ecf20Sopenharmony_ciThese instructions are fully implemented.
8c2ecf20Sopenharmony_ci
8c2ecf20Sopenharmony_ciLFM/SFM - load and store multiple floating
8c2ecf20Sopenharmony_ci
8c2ecf20Sopenharmony_ciForm 1 syntax:
8c2ecf20Sopenharmony_ci<LFM|SFM>{cond}<S|D|E> Fd, <count>, [Rn]
8c2ecf20Sopenharmony_ci<LFM|SFM>{cond}<S|D|E> Fd, <count>, [Rn, #<expression>]{!}
8c2ecf20Sopenharmony_ci<LFM|SFM>{cond}<S|D|E> Fd, <count>, [Rn], #<expression>
8c2ecf20Sopenharmony_ci
8c2ecf20Sopenharmony_ciForm 2 syntax:
8c2ecf20Sopenharmony_ci<LFM|SFM>{cond}<FD,EA> Fd, <count>, [Rn]{!}
8c2ecf20Sopenharmony_ci
8c2ecf20Sopenharmony_ciThese instructions are fully implemented.  They store/load three words
8c2ecf20Sopenharmony_cifor each floating point register into the memory location given in the
8c2ecf20Sopenharmony_ciinstruction.  The format in memory is unlikely to be compatible with
8c2ecf20Sopenharmony_ciother implementations, in particular the actual hardware.  Specific
8c2ecf20Sopenharmony_cimention of this is made in the ARM manuals.
8c2ecf20Sopenharmony_ci
8c2ecf20Sopenharmony_ciFloating Point Coprocessor Register Transfer Instructions (CPRT)
8c2ecf20Sopenharmony_ci----------------------------------------------------------------
8c2ecf20Sopenharmony_ci
8c2ecf20Sopenharmony_ciConversions, read/write status/control register instructions
8c2ecf20Sopenharmony_ci
8c2ecf20Sopenharmony_ciFLT{cond}<S,D,E>{P,M,Z} Fn, Rd          Convert integer to floating point
8c2ecf20Sopenharmony_ciFIX{cond}{P,M,Z} Rd, Fn                 Convert floating point to integer
8c2ecf20Sopenharmony_ciWFS{cond} Rd                            Write floating point status register
8c2ecf20Sopenharmony_ciRFS{cond} Rd                            Read floating point status register
8c2ecf20Sopenharmony_ciWFC{cond} Rd                            Write floating point control register
8c2ecf20Sopenharmony_ciRFC{cond} Rd                            Read floating point control register
8c2ecf20Sopenharmony_ci
8c2ecf20Sopenharmony_ciFLT/FIX are fully implemented.
8c2ecf20Sopenharmony_ci
8c2ecf20Sopenharmony_ciRFS/WFS are fully implemented.
8c2ecf20Sopenharmony_ci
8c2ecf20Sopenharmony_ciRFC/WFC are fully implemented.  RFC/WFC are supervisor only instructions, and
8c2ecf20Sopenharmony_cipresently check the CPU mode, and do an invalid instruction trap if not called
8c2ecf20Sopenharmony_cifrom supervisor mode.
8c2ecf20Sopenharmony_ci
8c2ecf20Sopenharmony_ciCompare instructions
8c2ecf20Sopenharmony_ci
8c2ecf20Sopenharmony_ciCMF{cond} Fn, Fm        Compare floating
8c2ecf20Sopenharmony_ciCMFE{cond} Fn, Fm       Compare floating with exception
8c2ecf20Sopenharmony_ciCNF{cond} Fn, Fm        Compare negated floating
8c2ecf20Sopenharmony_ciCNFE{cond} Fn, Fm       Compare negated floating with exception
8c2ecf20Sopenharmony_ci
8c2ecf20Sopenharmony_ciThese are fully implemented.
8c2ecf20Sopenharmony_ci
8c2ecf20Sopenharmony_ciFloating Point Coprocessor Data Instructions (CPDT)
8c2ecf20Sopenharmony_ci---------------------------------------------------
8c2ecf20Sopenharmony_ci
8c2ecf20Sopenharmony_ciDyadic operations:
8c2ecf20Sopenharmony_ci
8c2ecf20Sopenharmony_ciADF{cond}<S|D|E>{P,M,Z} Fd, Fn, <Fm,#value> - add
8c2ecf20Sopenharmony_ciSUF{cond}<S|D|E>{P,M,Z} Fd, Fn, <Fm,#value> - subtract
8c2ecf20Sopenharmony_ciRSF{cond}<S|D|E>{P,M,Z} Fd, Fn, <Fm,#value> - reverse subtract
8c2ecf20Sopenharmony_ciMUF{cond}<S|D|E>{P,M,Z} Fd, Fn, <Fm,#value> - multiply
8c2ecf20Sopenharmony_ciDVF{cond}<S|D|E>{P,M,Z} Fd, Fn, <Fm,#value> - divide
8c2ecf20Sopenharmony_ciRDV{cond}<S|D|E>{P,M,Z} Fd, Fn, <Fm,#value> - reverse divide
8c2ecf20Sopenharmony_ci
8c2ecf20Sopenharmony_ciThese are fully implemented.
8c2ecf20Sopenharmony_ci
8c2ecf20Sopenharmony_ciFML{cond}<S|D|E>{P,M,Z} Fd, Fn, <Fm,#value> - fast multiply
8c2ecf20Sopenharmony_ciFDV{cond}<S|D|E>{P,M,Z} Fd, Fn, <Fm,#value> - fast divide
8c2ecf20Sopenharmony_ciFRD{cond}<S|D|E>{P,M,Z} Fd, Fn, <Fm,#value> - fast reverse divide
8c2ecf20Sopenharmony_ci
8c2ecf20Sopenharmony_ciThese are fully implemented as well.  They use the same algorithm as the
8c2ecf20Sopenharmony_cinon-fast versions.  Hence, in this implementation their performance is
8c2ecf20Sopenharmony_ciequivalent to the MUF/DVF/RDV instructions.  This is acceptable according
8c2ecf20Sopenharmony_cito the ARM manual.  The manual notes these are defined only for single
8c2ecf20Sopenharmony_cioperands, on the actual FPA11 hardware they do not work for double or
8c2ecf20Sopenharmony_ciextended precision operands.  The emulator currently does not check
8c2ecf20Sopenharmony_cithe requested permissions conditions, and performs the requested operation.
8c2ecf20Sopenharmony_ci
8c2ecf20Sopenharmony_ciRMF{cond}<S|D|E>{P,M,Z} Fd, Fn, <Fm,#value> - IEEE remainder
8c2ecf20Sopenharmony_ci
8c2ecf20Sopenharmony_ciThis is fully implemented.
8c2ecf20Sopenharmony_ci
8c2ecf20Sopenharmony_ciMonadic operations:
8c2ecf20Sopenharmony_ci
8c2ecf20Sopenharmony_ciMVF{cond}<S|D|E>{P,M,Z} Fd, <Fm,#value> - move
8c2ecf20Sopenharmony_ciMNF{cond}<S|D|E>{P,M,Z} Fd, <Fm,#value> - move negated
8c2ecf20Sopenharmony_ci
8c2ecf20Sopenharmony_ciThese are fully implemented.
8c2ecf20Sopenharmony_ci
8c2ecf20Sopenharmony_ciABS{cond}<S|D|E>{P,M,Z} Fd, <Fm,#value> - absolute value
8c2ecf20Sopenharmony_ciSQT{cond}<S|D|E>{P,M,Z} Fd, <Fm,#value> - square root
8c2ecf20Sopenharmony_ciRND{cond}<S|D|E>{P,M,Z} Fd, <Fm,#value> - round
8c2ecf20Sopenharmony_ci
8c2ecf20Sopenharmony_ciThese are fully implemented.
8c2ecf20Sopenharmony_ci
8c2ecf20Sopenharmony_ciURD{cond}<S|D|E>{P,M,Z} Fd, <Fm,#value> - unnormalized round
8c2ecf20Sopenharmony_ciNRM{cond}<S|D|E>{P,M,Z} Fd, <Fm,#value> - normalize
8c2ecf20Sopenharmony_ci
8c2ecf20Sopenharmony_ciThese are implemented.  URD is implemented using the same code as the RND
8c2ecf20Sopenharmony_ciinstruction.  Since URD cannot return a unnormalized number, NRM becomes
8c2ecf20Sopenharmony_cia NOP.
8c2ecf20Sopenharmony_ci
8c2ecf20Sopenharmony_ciLibrary calls:
8c2ecf20Sopenharmony_ci
8c2ecf20Sopenharmony_ciPOW{cond}<S|D|E>{P,M,Z} Fd, Fn, <Fm,#value> - power
8c2ecf20Sopenharmony_ciRPW{cond}<S|D|E>{P,M,Z} Fd, Fn, <Fm,#value> - reverse power
8c2ecf20Sopenharmony_ciPOL{cond}<S|D|E>{P,M,Z} Fd, Fn, <Fm,#value> - polar angle (arctan2)
8c2ecf20Sopenharmony_ci
8c2ecf20Sopenharmony_ciLOG{cond}<S|D|E>{P,M,Z} Fd, <Fm,#value> - logarithm to base 10
8c2ecf20Sopenharmony_ciLGN{cond}<S|D|E>{P,M,Z} Fd, <Fm,#value> - logarithm to base e
8c2ecf20Sopenharmony_ciEXP{cond}<S|D|E>{P,M,Z} Fd, <Fm,#value> - exponent
8c2ecf20Sopenharmony_ciSIN{cond}<S|D|E>{P,M,Z} Fd, <Fm,#value> - sine
8c2ecf20Sopenharmony_ciCOS{cond}<S|D|E>{P,M,Z} Fd, <Fm,#value> - cosine
8c2ecf20Sopenharmony_ciTAN{cond}<S|D|E>{P,M,Z} Fd, <Fm,#value> - tangent
8c2ecf20Sopenharmony_ciASN{cond}<S|D|E>{P,M,Z} Fd, <Fm,#value> - arcsine
8c2ecf20Sopenharmony_ciACS{cond}<S|D|E>{P,M,Z} Fd, <Fm,#value> - arccosine
8c2ecf20Sopenharmony_ciATN{cond}<S|D|E>{P,M,Z} Fd, <Fm,#value> - arctangent
8c2ecf20Sopenharmony_ci
8c2ecf20Sopenharmony_ciThese are not implemented.  They are not currently issued by the compiler,
8c2ecf20Sopenharmony_ciand are handled by routines in libc.  These are not implemented by the FPA11
8c2ecf20Sopenharmony_cihardware, but are handled by the floating point support code.  They should
8c2ecf20Sopenharmony_cibe implemented in future versions.
8c2ecf20Sopenharmony_ci
8c2ecf20Sopenharmony_ciSignalling:
8c2ecf20Sopenharmony_ci
8c2ecf20Sopenharmony_ciSignals are implemented.  However current ELF kernels produced by Rebel.com
8c2ecf20Sopenharmony_cihave a bug in them that prevents the module from generating a SIGFPE.  This
8c2ecf20Sopenharmony_ciis caused by a failure to alias fp_current to the kernel variable
8c2ecf20Sopenharmony_cicurrent_set[0] correctly.
8c2ecf20Sopenharmony_ci
8c2ecf20Sopenharmony_ciThe kernel provided with this distribution (vmlinux-nwfpe-0.93) contains
8c2ecf20Sopenharmony_cia fix for this problem and also incorporates the current version of the
8c2ecf20Sopenharmony_ciemulator directly.  It is possible to run with no floating point module
8c2ecf20Sopenharmony_ciloaded with this kernel.  It is provided as a demonstration of the
8c2ecf20Sopenharmony_citechnology and for those who want to do floating point work that depends
8c2ecf20Sopenharmony_cion signals.  It is not strictly necessary to use the module.
8c2ecf20Sopenharmony_ci
8c2ecf20Sopenharmony_ciA module (either the one provided by Russell King, or the one in this
8c2ecf20Sopenharmony_cidistribution) can be loaded to replace the functionality of the emulator
8c2ecf20Sopenharmony_cibuilt into the kernel.