BUG: core: fix ilp64 blas dot/vdot/... for strides > int32 max by pv · Pull Request #17121 · numpy/numpy

pv · 2020-08-20T18:42:44Z

Fix overlooked int cast when HAVE_BLAS_ILP64 is defined.
It was supposed to cast to CBLAS_INT, not int.
Also add a regression test.

Fixes #17111

charris · 2020-08-20T23:16:10Z

My machine requires more than this for the allocation.

EDIT: Needs 32 GiB

charris · 2020-08-20T23:17:48Z

I think you should leave out the multiplication by itemsize here.

Maybe use int32_max + 2 instead,

I think no: the BLAS incx = stride//itemsize, and that needs to overflow to trigger this bug.
I can use itemsize*(int32_max + 2) but the test does fail also with the +1.

Ah crap, obviously you're right. No idea what I was thinking.

Fix overlooked int cast when HAVE_BLAS_ILP64 is defined. It was supposed to cast to CBLAS_INT, not int. Also add a regression test. Move blas_stride() to npy_cblas.h Replace npy_is_aligned by modulo; we're going to call BLAS so no need to micro-optimize integer division here.

pv · 2020-08-21T19:52:44Z

Updated. Obvious mistakes fixed, and moved blas_strides to npy_cblas.h.

eric-wieser · 2020-08-21T20:03:51Z

-
 /*
 * Define a chunksize for CBLAS. CBLAS counts in integers.
 */


Should this definition move file too, since it's also blas-related?

Looks like its used in exactly one file, and that file uses CBLAS_INT to store the result.

Not sure. I thought the chunking was mostly related to our *_dot loops, so I left it here.

I think we should move it, if you're happy to make another update.

Uses: https://github.com/numpy/numpy/search?q=NPY_CBLAS_CHUNK&unscoped_q=NPY_CBLAS_CHUNK

Note CBLAS_INT on the same line in all cases - so the callers are already including both headers (and the headers are internal not public)

Moved + fixed the definition. No need to have it differ from intp max except when that's larger than CBLAS_INT max

eric-wieser · 2020-08-21T20:33:53Z

So just to be clear - this is a change for when HAVE_BLAS_ILP64 is defined and NPY_MAX_INTP > INT_MAX.

Previously: NPY_CBLAS_CHUNK = (NPY_MAX_INT64 / 2 + 1)

Now: NPY_CBLAS_CHUNK = NPY_MAX_INTP

I don't have any understanding of the purpose of the /2 + 1, which makes me worry about removing it.

Yes, it changes it to depend only on the CBLAS_INT maximum value. Switching to the smaller threshold for INTP>INT with ilp64 I think is a bug (but with no consequences, nobody has arrays of 2^62 elements). With 32-bit BLAS, there is no change here.

I suspect the INT_MAX/2 + 1 is chosen instead of INT_MAX because some 32-bit BLAS implementations have bugs that makes them fail if the number of elements gets too close to the maximum integer. However, for 64-bit blas, this is a theoretical concern as nobody can run BLAS with that large n.

Actually, 46dd681 seems to clarify it's because that makes the chunksize a power of two.
So I think this is OK. (Restored the comment saying so back.)

I think is a bug (but with no consequences, nobody has arrays of 2^62 elements)

However, for 64-bit blas, this is a theoretical concern as nobody can run BLAS with that large n.

I think this was what I was missing. I'd add that the compiler can optimize out the comparison if we use NPY_MAX_INTP, which is an argument for leaving the bug in.

If you could pull in the power of two comment from that commit, that would be great. Either way, I'm no longer worried by this.

charris · 2020-08-21T23:29:42Z

The error is unrelated. @mattip We have been getting web errors trying to fetch pypy, could you take a look?

mattip · 2020-08-22T18:47:51Z

@charris: seems to have been a temporary outage.

mattip · 2020-08-22T18:49:20Z

Thanks @pv

pv mentioned this pull request Aug 20, 2020

blas_stride has the wrong return type with HAVE_BLAS_ILP64 #17111

Closed

pv added the 00 - Bug label Aug 20, 2020

charris added 09 - Backport-Candidate PRs tagged should be backported component: numpy._core labels Aug 20, 2020

charris added this to the 1.19.2 release milestone Aug 20, 2020

charris reviewed Aug 20, 2020

View reviewed changes

eric-wieser reviewed Aug 20, 2020

View reviewed changes

Comment thread numpy/core/src/multiarray/common.h Outdated

pv force-pushed the fix-ilp64-stride branch from 510c4fc to f241d5f Compare August 21, 2020 19:48

pv force-pushed the fix-ilp64-stride branch from f241d5f to 9cebb29 Compare August 21, 2020 19:49

eric-wieser reviewed Aug 21, 2020

View reviewed changes

Comment thread numpy/core/src/common/npy_cblas.h Outdated

pv force-pushed the fix-ilp64-stride branch from b895334 to 9ecf482 Compare August 21, 2020 20:24

eric-wieser reviewed Aug 21, 2020

View reviewed changes

MAINT: npy_cblas.h: redefine NPY_CBLAS_CHUNK in terms of CBLAS_INT_MAX

51bb217

pv force-pushed the fix-ilp64-stride branch from 9ecf482 to 51bb217 Compare August 21, 2020 20:56

eric-wieser approved these changes Aug 21, 2020

View reviewed changes

mattip merged commit 97f9fcb into numpy:master Aug 22, 2020

charris mentioned this pull request Sep 3, 2020

BUG: core: fix ilp64 blas dot/vdot/... for strides > int32 max #17243

Merged

charris removed the 09 - Backport-Candidate PRs tagged should be backported label Sep 3, 2020

charris removed this from the 1.19.2 release milestone Sep 3, 2020

Uh oh!

Uh oh!

Conversation

pv commented Aug 20, 2020

Uh oh!

charris Aug 20, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

pv Aug 21, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

pv commented Aug 21, 2020

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

eric-wieser Aug 21, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

pv Aug 21, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

charris commented Aug 21, 2020

Uh oh!

mattip commented Aug 22, 2020

Uh oh!

mattip commented Aug 22, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

charris Aug 20, 2020 •

edited

Loading

pv Aug 21, 2020 •

edited

Loading

eric-wieser Aug 21, 2020 •

edited

Loading

pv Aug 21, 2020 •

edited

Loading