-
Notifications
You must be signed in to change notification settings - Fork 13.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Ensure non-empty buffers for large vectored I/O #138879
base: master
Are you sure you want to change the base?
Conversation
Since this involves what we guarantee for these methods, I want to make sure T-libs-api sees it too: r? libs-api The original question that led to this patch was whether these APIs are guaranteed to return |
d4bc926
to
3321ce6
Compare
I've added a test for this case. The second assertion fails before this patch. |
3321ce6
to
1e21e57
Compare
☔ The latest upstream changes (presumably #139101) made this pull request unmergeable. Please resolve the merge conflicts. |
1e21e57
to
82ebd46
Compare
This test is falling for me on macOS on the first assertion, after a few hours of investigation, I came to the conclusion that this is happening because the new method ( If I'm correct, and if returning ok-zero if Another option I though of is maybe letting one empty IoSlice in the POSIX reference I read: https://pubs.opengroup.org/onlinepubs/9799919799/functions/writev.html EDIT: Looks like |
Thanks for such a helpful diagnosis! Turns out I also encounter that same error locally for the first assertion, also on macOS. With this patch, it fixes it. I decided to slice it to one empty buffer and still invoke Also, what you bring up illustrates a larger, preexisting issue. Should we allow zero-length buffer slices or expose the platform's behavior? I'd argue, given that Rust smooths over the case of too-large buffer slices, that this should be similarly handled. We could even handle both cases together by branching on |
82ebd46
to
dc3625d
Compare
I'd argue that Rust should smooth things over whenever possible if it directly affects an public API that is meant to work in a platform independent way, like |
dc3625d
to
de0672f
Compare
`readv` and `writev` are constrained by a platform-specific upper bound on the number of buffers which can be passed. Currently, `read_vectored` and `write_vectored` implementations simply truncate to this limit when larger. However, when the only non-empty buffers are at indices above this limit, they will erroneously return `Ok(0)`. Instead, slice the buffers starting at the first non-empty buffer. This trades a conditional move for a branch, so it's barely a penalty in the common case. The new method `limit_slices` on `IoSlice` and `IoSliceMut` may be generally useful to users like `advance_slices` is, but I have left it as `pub(crate)` for now.
POSIX requires at least one buffer passed to readv and writev, but we allow the user to pass an empty slice of buffers. In this case, return a zero-length read or write.
de0672f
to
9d3dc92
Compare
I implemented the empty slice case, but I'm not satisfied with the approach, so it's a separate commit for now. I needed to implement it as a macro, because I couldn't figure out how to get this sort of pattern to compile. When given an empty slice, it needs to conjure up a use std::io::{IoSlice, IoSliceMut};
fn replace<'a>(bufs: &mut &[IoSlice<'a>], bufs_mut: &mut &mut [IoSliceMut<'a>]) {
*bufs = &[IoSlice::new(&[])];
*bufs_mut = &mut [IoSliceMut::new(&mut [])];
}
|
I suspect not all platforms have the restriction that |
My current feeling is that this is unnecessary for Windows specifically. Having a slice of 4,294,967,295 empty buffers followed by one or more non-empty buffers isn't a particularly realistic scenario and is almost certainly a user error. We could document the limit but I'm not so sure it's worth having special handling for it. |
That certainly seems reasonable, as it would require 64 GiB to create a slice of that many buffers. I think the only useful case would be if Windows forbids a 0-length slice of buffers. I'll have to look through Windows docs for that. |
readv
andwritev
are constrained by a platform-specific upper bound on the number of buffers which can be passed. Currently,read_vectored
andwrite_vectored
implementations simply truncate to this limit when larger. However, when the only non-empty buffers are at indices above this limit, they will erroneously returnOk(0)
.Instead, slice the buffers starting at the first non-empty buffer. This trades a conditional move for a branch, so it's barely a penalty in the common case.
The new method
limit_slices
onIoSlice
andIoSliceMut
may be generally useful to users likeadvance_slices
is, but I have left it aspub(crate)
for now.