Skip to content

Releases: KernelTuner/kernel_tuner

Version 1.1.3

21 May 12:21
Compare
Choose a tag to compare

This release contains a number of small bugfixes and enables support on Nvidia Blackwell GPUs.

What's Changed

New Contributors

Full Changelog: 1.1.2...1.1.3

Version 1.1.2

08 Apr 08:48
Compare
Choose a tag to compare

This release would not have been necessary if I had not forgotten to increment the version number on the previous release that I made 20 minutes ago. Alas, we all make mistakes sometimes.

Version 1.1.1

08 Apr 08:25
ece0719
Compare
Choose a tag to compare

The sole purpose of this release is to support Numpy 2.0 and newer. The main motivation for this is to make the examples and tutorial notebooks working again on Google Colab.

What's Changed

Full Changelog: 1.1.0...1.1.1

Version 1.1.0

04 Apr 10:10
85da990
Compare
Choose a tag to compare

This release integrates many smaller changes that have been made over the past year.

The most significant new features are:

  • The NCUObserver to include performance metrics from the Nvidia Profiler during tuning
  • TegraObserver to read/set clock frequencies, power and temperature on Nvidia Jetson GPUs

In addition, a lot of work has been put into several backends, including OpenACC, the compiler backend, the HIP backend and so on.

Thanks to everyone who contributed to Kernel Tuner in the past year!

What's Changed

New Contributors

Full Changelog: 1.0...1.1.0

Version 1.0

04 Apr 20:03
Compare
Choose a tag to compare

Finally, the Version 1.0 release is here! The software has been stable and ready for production use for quite some time now and after being in beta for about a half a year, we are confident that the current version of the software deserves to mark the first major release of Kernel Tuner.

Version 1.0 integrates a lot of new functionality, including blazing fast search space construction, support for tuning HIP kernels on AMD GPUs, new functionality for mixed precision and accuracy tuning, experimental support for tuning OpenACC programs, a conda package installer for Kernel Tuner, and many more changes and additions.

I would like to thank every one involved in the development of Kernel Tuner of the past years! Special thanks to the Kernel Tuner developers team for their continued support of the project!

From the Changelog

  • HIP backend to support tuning HIP kernels on AMD GPUs
  • Experimental features for mixed-precision and accuracy tuning
  • Experimental features for OpenACC tuning
  • Major speedup due to new parser and using revamped python-constraint for searchspace building
  • Implemented ability to use PySMT and ATF for searchspace building
  • Added Poetry for dependency and build management
  • Switched from setup.py and setup.cfg to pyproject.toml for centralized metadata, added relevant tests
  • Updated GitHub Action workflows to use Poetry
  • Updated dependencies, most notably NumPy is no longer version-locked as scikit-opt is no longer a dependency
  • Documentation now uses pyproject.toml metadata, minor fixes and changes to be compatible with updated dependencies
  • Set up Nox for testing on all supported Python versions in isolated environments
  • Added linting information, VS Code settings and recommendations
  • Discontinued use of OrderedDict, as all dictionaries in the Python versions used are already ordered
  • Dropped Python 3.7 support

Merged Pull Requests

New Contributors

Full Changelog: 0.4.5...1.0

Version 1.0.0b6

07 Dec 08:19
Compare
Choose a tag to compare
Version 1.0.0b6 Pre-release
Pre-release

This is a beta release for early access to the new features. Not intended for production use.

The release contains:

  • Inclusion of tests in the source package, as requested in #225
  • Updated dependencies

Version 1.0.0b5

01 Nov 14:11
Compare
Choose a tag to compare
Version 1.0.0b5 Pre-release
Pre-release

This is a beta release for early access to the new features. Not intended for production use.

The release contains:

  • Expanded documentation on backends by @benvanwerkhoven in #213
  • A fix for an issue that could cause incorrect conversion to Constraint
  • Extended tests to detect this
  • Bump urllib3 from 2.0.6 to 2.0.7 by @dependabot in #222
  • Updated dependencies

Full Changelog: 1.0.0b4...1.0.0b5

Version 1.0.0b4

22 Oct 14:11
Compare
Choose a tag to compare
Version 1.0.0b4 Pre-release
Pre-release

This is a beta release for early access to the new features. Not intended for production use.

This release contains several improvements:

  • nvidia-ml-py added to tutorial extra dependencies.
  • Additional checks for coherent Poetry configuration and warning in case of outdated development environment.
  • Updated dependencies.

Version 1.0.0b3

12 Oct 13:02
Compare
Choose a tag to compare
Version 1.0.0b3 Pre-release
Pre-release

This is a beta release for early access to the new features. Not intended for production use.

This version contains several bugfixes:

  • Fix snap_to_nearest on non-numeric parameters by @stijnh in #221
  • Fixed an issue where some restrictions would not be recognized by the old check_restrictions function.
  • Fixed an issue where bayes_opt would not handle pruned parameters correctly.

Full Changelog: 1.0.0b2...1.0.0b3

Version 1.0.0b2

11 Oct 16:37
Compare
Choose a tag to compare
Version 1.0.0b2 Pre-release
Pre-release

This is a beta release for early access to the new features. Not intended for production use.

Full Changelog: 1.0.0b1...1.0.0b2