20#ifndef NDA_HAVE_DEVICE
45 template <
typename A,
typename X,
typename Y>
47 EXPECTS(a.extent(1) == x.extent(0));
48 EXPECTS(a.extent(0) == y.extent(0));
49 for (
int i = 0; i < a.extent(0); ++i) {
51 for (
int k = 0; k < a.extent(1); ++k) y(i) += alpha * a(i, k) * x(k);
76 template <Matrix A, MemoryVector X, MemoryVector Y>
80 auto to_mat = []<
Matrix Z>(Z
const &z) ->
decltype(
auto) {
82 return std::get<0>(z.a);
86 auto &mat = to_mat(a);
89 using mat_type =
decltype(mat);
93 EXPECTS(mat.extent(1) == x.extent(0));
94 EXPECTS(mat.extent(0) == y.extent(0));
95 EXPECTS(mat.indexmap().min_stride() == 1);
96 EXPECTS(x.indexmap().min_stride() == 1);
97 EXPECTS(y.indexmap().min_stride() == 1);
102 auto [m, n] = mat.shape();
106#if defined(NDA_HAVE_DEVICE)
107 device::gemv(op_a, m, n, alpha, mat.data(),
get_ld(mat), x.data(), x.indexmap().strides()[0], beta, y.data(), y.indexmap().strides()[0]);
112 f77::gemv(op_a, m, n, alpha, mat.data(),
get_ld(mat), x.data(), x.indexmap().strides()[0], beta, y.data(), y.indexmap().strides()[0]);
Provides definitions and type traits involving the different memory address spaces supported by nda.
void swap(nda::basic_array_view< V1, R1, LP1, A1, AP1, OP1 > &a, nda::basic_array_view< V2, R2, LP2, A2, AP2, OP2 > &b)=delete
std::swap is deleted for nda::basic_array_view.
Provides a C++ interface for various BLAS routines.
Check if a given type is a matrix, i.e. an nda::ArrayOfRank<2>.
Check if a given type is a memory matrix, i.e. an nda::MemoryArrayOfRank<2>.
Provides concepts for the nda library.
Provides GPU and non-GPU specific functionality.
constexpr bool have_same_value_type_v
Constexpr variable that is true if all types in As have the same value type as A0.
std::decay_t< decltype(get_first_element(std::declval< A const >()))> get_value_t
Get the value type of an array/view or a scalar type.
int get_ld(A const &a)
Get the leading dimension in LAPACK jargon of an nda::MemoryMatrix.
static constexpr bool has_C_layout
Constexpr variable that is true if the given nda::Array type has a C memory layout.
void gemv_generic(get_value_t< A > alpha, A const &a, X const &x, get_value_t< A > beta, Y &&y)
Generic nda::blas::gemv implementation for types not supported by BLAS/LAPACK.
static constexpr bool is_conj_array_expr
Constexpr variable that is true if the given type is a conjugate lazy expression.
void gemv(get_value_t< A > alpha, A const &a, X const &x, get_value_t< A > beta, Y &&y)
Interface to the BLAS gemv routine.
static constexpr bool has_F_layout
Constexpr variable that is true if the given nda::Array type has a Fortran memory layout.
const char get_op
Variable template that determines the BLAS matrix operation tag ('N','T','C') based on the given bool...
static constexpr bool have_compatible_addr_space
Constexpr variable that is true if all given types have compatible address spaces.
static constexpr bool have_device_compatible_addr_space
Constexpr variable that is true if all given types have an address space compatible with Device.
void compile_error_no_gpu()
Trigger a compilation error in case GPU specific functionality is used without configuring the projec...
constexpr bool is_blas_lapack_v
Alias for nda::is_double_or_complex_v.
Macros used in the nda library.
Provides type traits for the nda library.