Memory usage optimization

Numexpr

Evaluation of complex expressions with one operation at a time can lead
also into suboptimal performance.
Numexpr package provides fast evaluation of array expressions.

import numexpr as ne
x = np.random.random((10000000, 1))
y = np.random.random((10000000, 1))
%timeit y = ((0.25 * x + 0.75) * x - 1.5) * x - 2
%timeit y = ne.evaluate("((0.25 * x + 0.75) * x - 1.5) * x - 2")

Numexpr tries to use multiple threads (numexpr.set_num_threads(nthreads)).
Supported operations: +, -, *, /, **, sin, cos, tan, exp, log, sqrt.
Speedups in comparison to NumPy are typically between 0.95 and 4.
Works best on arrays that do not fit in CPU cache.

Lecture 4

Profiling and optimizing

Development Tools for Scientific Computing - SISSA, 2024-2025

Pasquale Claudio Africa, Dario Coscia

20 Feb 2025

Outline

Profiling and optimizing

Why can Python be slow?

Dynamic typing

Flexible data structures

Python lists vs. NumPy arrays

Profiling

time

Timeit

cProfile (1/2)

cProfile (2/2)

Line-profiler

Line-profiler: sample output

Based on this output, can you spot a mistake which is affecting performance?

Performance optimization

Algorithm optimization (1/2)

Example: Singular Value Decomposition (SVD)

Algorithm optimization (2/2)

Example: Fibonacci sequence

CPU usage optimization (1/2)

Vectorization

Vectorizing more complex functions

Memory usage optimization

Broadcasting

Memory usage optimization

Broadcasting

Memory usage optimization

Broadcasting

Memory usage optimization

Cache effects (1/2)

Memory usage optimization

Cache effects (2/2)

Memory usage optimization

Temporary arrays

Memory usage optimization

Numexpr

Performance boosting

Performance boosting

Pre-compiling Python

Pre-compiling Python

Cython

Demo: integrating a function in Cython

Cython

Cython (1/3)

Cython (2/3)

Cython (3/3)

Numba (1/3)

Example: using Numba for function optimization

Numba (2/3)

Numba (3/3)

Cython vs. Numba

Conclusions

Binding C++ and Python: pybind11

Creating bindings for a custom type (1/2)

Creating bindings for a custom type (2/2)