arm/nwfpe/todo.rst

8c2ecf20Sopenharmony_ciTODO LIST
8c2ecf20Sopenharmony_ci=========
8c2ecf20Sopenharmony_ci
8c2ecf20Sopenharmony_ci::
8c2ecf20Sopenharmony_ci
8c2ecf20Sopenharmony_ci  POW{cond}<S|D|E>{P,M,Z} Fd, Fn, <Fm,#value> - power
8c2ecf20Sopenharmony_ci  RPW{cond}<S|D|E>{P,M,Z} Fd, Fn, <Fm,#value> - reverse power
8c2ecf20Sopenharmony_ci  POL{cond}<S|D|E>{P,M,Z} Fd, Fn, <Fm,#value> - polar angle (arctan2)
8c2ecf20Sopenharmony_ci
8c2ecf20Sopenharmony_ci  LOG{cond}<S|D|E>{P,M,Z} Fd, <Fm,#value> - logarithm to base 10
8c2ecf20Sopenharmony_ci  LGN{cond}<S|D|E>{P,M,Z} Fd, <Fm,#value> - logarithm to base e
8c2ecf20Sopenharmony_ci  EXP{cond}<S|D|E>{P,M,Z} Fd, <Fm,#value> - exponent
8c2ecf20Sopenharmony_ci  SIN{cond}<S|D|E>{P,M,Z} Fd, <Fm,#value> - sine
8c2ecf20Sopenharmony_ci  COS{cond}<S|D|E>{P,M,Z} Fd, <Fm,#value> - cosine
8c2ecf20Sopenharmony_ci  TAN{cond}<S|D|E>{P,M,Z} Fd, <Fm,#value> - tangent
8c2ecf20Sopenharmony_ci  ASN{cond}<S|D|E>{P,M,Z} Fd, <Fm,#value> - arcsine
8c2ecf20Sopenharmony_ci  ACS{cond}<S|D|E>{P,M,Z} Fd, <Fm,#value> - arccosine
8c2ecf20Sopenharmony_ci  ATN{cond}<S|D|E>{P,M,Z} Fd, <Fm,#value> - arctangent
8c2ecf20Sopenharmony_ci
8c2ecf20Sopenharmony_ciThese are not implemented.  They are not currently issued by the compiler,
8c2ecf20Sopenharmony_ciand are handled by routines in libc.  These are not implemented by the FPA11
8c2ecf20Sopenharmony_cihardware, but are handled by the floating point support code.  They should
8c2ecf20Sopenharmony_cibe implemented in future versions.
8c2ecf20Sopenharmony_ci
8c2ecf20Sopenharmony_ciThere are a couple of ways to approach the implementation of these.  One
8c2ecf20Sopenharmony_cimethod would be to use accurate table methods for these routines.  I have
8c2ecf20Sopenharmony_cia couple of papers by S. Gal from IBM's research labs in Haifa, Israel that
8c2ecf20Sopenharmony_ciseem to promise extreme accuracy (in the order of 99.8%) and reasonable speed.
8c2ecf20Sopenharmony_ciThese methods are used in GLIBC for some of the transcendental functions.
8c2ecf20Sopenharmony_ci
8c2ecf20Sopenharmony_ciAnother approach, which I know little about is CORDIC.  This stands for
8c2ecf20Sopenharmony_ciCoordinate Rotation Digital Computer, and is a method of computing
8c2ecf20Sopenharmony_citranscendental functions using mostly shifts and adds and a few
8c2ecf20Sopenharmony_cimultiplications and divisions.  The ARM excels at shifts and adds,
8c2ecf20Sopenharmony_ciso such a method could be promising, but requires more research to
8c2ecf20Sopenharmony_cidetermine if it is feasible.
8c2ecf20Sopenharmony_ci
8c2ecf20Sopenharmony_ciRounding Methods
8c2ecf20Sopenharmony_ci----------------
8c2ecf20Sopenharmony_ci
8c2ecf20Sopenharmony_ciThe IEEE standard defines 4 rounding modes.  Round to nearest is the
8c2ecf20Sopenharmony_cidefault, but rounding to + or - infinity or round to zero are also allowed.
8c2ecf20Sopenharmony_ciMany architectures allow the rounding mode to be specified by modifying bits
8c2ecf20Sopenharmony_ciin a control register.  Not so with the ARM FPA11 architecture.  To change
8c2ecf20Sopenharmony_cithe rounding mode one must specify it with each instruction.
8c2ecf20Sopenharmony_ci
8c2ecf20Sopenharmony_ciThis has made porting some benchmarks difficult.  It is possible to
8c2ecf20Sopenharmony_ciintroduce such a capability into the emulator.  The FPCR contains
8c2ecf20Sopenharmony_cibits describing the rounding mode.  The emulator could be altered to
8c2ecf20Sopenharmony_ciexamine a flag, which if set forced it to ignore the rounding mode in
8c2ecf20Sopenharmony_cithe instruction, and use the mode specified in the bits in the FPCR.
8c2ecf20Sopenharmony_ci
8c2ecf20Sopenharmony_ciThis would require a method of getting/setting the flag, and the bits
8c2ecf20Sopenharmony_ciin the FPCR.  This requires a kernel call in ArmLinux, as WFC/RFC are
8c2ecf20Sopenharmony_cisupervisor only instructions.  If anyone has any ideas or comments I
8c2ecf20Sopenharmony_ciwould like to hear them.
8c2ecf20Sopenharmony_ci
8c2ecf20Sopenharmony_ciNOTE:
8c2ecf20Sopenharmony_ci pulled out from some docs on ARM floating point, specifically
8c2ecf20Sopenharmony_ci for the Acorn FPE, but not limited to it:
8c2ecf20Sopenharmony_ci
8c2ecf20Sopenharmony_ci The floating point control register (FPCR) may only be present in some
8c2ecf20Sopenharmony_ci implementations: it is there to control the hardware in an implementation-
8c2ecf20Sopenharmony_ci specific manner, for example to disable the floating point system.  The user
8c2ecf20Sopenharmony_ci mode of the ARM is not permitted to use this register (since the right is
8c2ecf20Sopenharmony_ci reserved to alter it between implementations) and the WFC and RFC
8c2ecf20Sopenharmony_ci instructions will trap if tried in user mode.
8c2ecf20Sopenharmony_ci
8c2ecf20Sopenharmony_ci Hence, the answer is yes, you could do this, but then you will run a high
8c2ecf20Sopenharmony_ci risk of becoming isolated if and when hardware FP emulation comes out
8c2ecf20Sopenharmony_ci
8c2ecf20Sopenharmony_ci		-- Russell.