FP16

Header-only library for conversion to/from half-precision floating point formats

Features

Supports IEEE and ARM alternative half-precision floating-point format
- Property converts infinities and NaNs
- Properly converts denormal numbers, even on systems without denormal support
Header-only library, no installation or build required
Compatible with C99 and C++11
Fully covered with unit tests and microbenchmarks

Acknowledgements

The library is developed by Marat Dukhan of Georgia Tech. FP16 is a research project at Richard Vuduc's HPC Garage lab in the Georgia Institute of Technology, College of Computing, School of Computational Science and Engineering.

This material is based upon work supported by the U.S. National Science Foundation (NSF) Award Number 1339745. Any opinions, findings and conclusions or recommendations expressed in this material are those of the authors and do not necessarily reflect those of NSF.

Name		Name	Last commit message	Last commit date
Latest commit History 71 Commits
.github/workflows		.github/workflows
bench		bench
cmake		cmake
include		include
test		test
third-party		third-party
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

FP16

Features

Acknowledgements

About

Releases

Packages

Languages

License

brendandahl/FP16

Folders and files

Latest commit

History

Repository files navigation

FP16

Features

Acknowledgements

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages