Add `Span` struct (replacing `StrRange`). Spans represent read-only access to a contiguous array, resembling `std::span`. #100293

Ivorforce · 2024-12-11T22:42:53Z

It is currently difficult to run algorithms on String, Vector, LocalVector data ranges or plain C arrays without copying the data first. This leads to inefficiencies.

This PR adds the Span class. A span represents a view into an array (it's a pointer and size).
With Span, it will be easier to implement functions with more agnosticism as to the memory storage, helping to reduce unnecessary copies. Additionally, Span will help bridge the gap between LocalVector and Vector, helping to address godotengine/godot-proposals#5144.

Span is similar to StrView which it replaces, but meant for more than just strings. It is furthermore similar to VectorView which will be replaced in a future PR.

Example

The String.path_to implementation currently has the following lines of code:

godot/core/string/ustring.cpp

Lines 5108 to 5109 in c2e4ae7

    
           Vector<String> src_dirs = src.substr(1, src.length() - 2).split("/"); 
        
           Vector<String> dst_dirs = dst.substr(1, dst.length() - 2).split("/");

Here, the String is first subselected through substr, causing a copy to be made.
Then, it is split through a split call, through which multiple copies of regions of the string are made.
None of the regions are modified in the process - the copies are made merely because it is not possible to avoid them.

An optimized implementation could look like this:

LocalVector<Span<char32_t>> src_dirs = spans::split(src.span().subspan(1, src.length() - 2), U'/'); 
LocalVector<Span<char32_t>> dst_dirs = spans::split(dst.span().subspan(1, dst.length() - 2), U'/');

In this implementation, no copies of the string need to be made, because all Spans used here are views into the original string (spans::split and subspan would need to be added as functions).

Discussion

Span is named after std::span of C++20 std::span.
Span is const-only because a need for mutable spans is not obvious yet. It is easier to use if it is const by default. If mutable spans are needed in the future, it can be proposed then.

Ivorforce · 2024-12-12T00:51:35Z

Looks like the builds are failing.
The Windows failure is a repeat of a problem I had in #99806 already. It can be fixed by touching the file to wipe caches.
I have no idea what the Linux problem is about though, since tests are completing on my own machine.
Edit: Nevermind, I got it.

RandomShaper · 2024-12-12T06:40:16Z

There's a VectorView somewhere in the rendering code (in rendering_device_commons.h, I think) that seems to have the same goal. Also, I barely remember there's a proposal around related to this.

Ivorforce · 2024-12-12T09:41:49Z

There's a VectorView somewhere in the rendering code (in rendering_device_commons.h, I think) that seems to have the same goal. Also, I barely remember there's a proposal around related to this.

Oh yeah, I see it.
I think it would be best to consolidate that into BufferView once merged, in a follow-up PR.

Ivorforce · 2024-12-12T09:56:21Z

To avoid the MSVC compiler issue (which I have previously determined to be a cache issue), I am touching feed_effects.h by inserting an explicit include to a well-known file.
I think it may be caused by the only include of the file being a generated file, possibly leading to the cache resolver running into a race condition when compiling the file. I hope this change can fix the problem for good (though we won't know for sure until future PRs since touching the file already invalidates the cache, fixing the issue).

Ivorforce · 2024-12-12T20:22:40Z

I renamed BufferView -> Span since that's what C++20 calls the concept. And I shrunk the implementation to be bare-bones as we can still add the rest of what we need later.

core/templates/span.h

Repiteo

I'd encourage adding tests as well. Seeing as you're making a new template, you'd be free to make several functions constexpr for compile-time sanity checks:

TEST_CASE("[Span] Constant Validators") {
	constexpr Span<uint8_t> span_empty;
	static_assert(span_empty.size() == 0);
	static_assert(span_empty.is_empty());

	constexpr uint8_t byte = 0;
	constexpr Span<uint8_t> span_byte = Span<uint8_t>(byte, 1);
	static_assert(span_byte.size() == 1);
	static_assert(!span_byte.is_empty());
}

core/templates/span.h

Ivorforce · 2024-12-17T22:18:41Z

I'd encourage adding tests as well. Seeing as you're making a new template, you'd be free to make several functions constexpr for compile-time sanity checks:

TEST_CASE("[Span] Constant Validators") {
	constexpr Span<uint8_t> span_empty;
	static_assert(span_empty.size() == 0);
	static_assert(span_empty.is_empty());

	constexpr uint8_t byte = 0;
	constexpr Span<uint8_t> span_byte = Span<uint8_t>(byte, 1);
	static_assert(span_byte.size() == 1);
	static_assert(!span_byte.is_empty());
}

I like this test! I'll add some more as well.
I'm also realizing, i can make every single function currently in Span constexpr. Not just great for the compiler, but it also means we don't need a single runtime test (yet) 😄

Ivorforce · 2024-12-20T10:38:11Z

Nice, looks like my include fix from #100434 has fixed the feed_effects compile issues for good. No more cache problems!

Ivorforce · 2024-12-20T11:05:30Z

Hrm... Wondering again if it wouldn't be possible to just have const Span<char> propagate constness to the contained type. It would just require a few const cast hacks. I'll have a look later and see if I can make it work.
Edit: It's not possible; it would require const Span<T> const_span(const T *ptr, size_t size) to return const Span, but the constness of the return value is ignored on the caller's side. That's a shame!

Ivorforce · 2025-01-16T18:53:24Z

I've updated the PR according to feedback from the godot core meeting:

Span will hold only const pointers for now. If mutable pointers are needed, they should be separately proposed.
operator[] subscripts will do a sanity bounds check.
find, rfind and count will not be hosted by Span. Instead, they should be hosted by a separate header (to be moved in another PR).
Add implicit conversion from Vector, String, CharString and Char16String to Span, to aid transition.

VectorView will be replaced with Span in a future PR.

The PR is ready for review again.

kiroxas · 2025-01-16T18:59:33Z

core/templates/span.h

+template <typename T>
+class Span {
+	const T *_ptr = nullptr;
+	uint32_t _len = 0;


This is free to make this 64bit as you'll have 32 bit padding right now

On 32-bit systems the current implementation should be size 64 align 32, and on 64-bit systems it should be size 96 align 64, no?

on 32 bit, you're just 64, but I'm not sure 4.x is compiled to 32 bits ? And on 64 you have 64bits of pointer, 32 bits of integer, and then you need to pad to be aligned to the max alignment of your fields (which is 64 due to the pointer)

then you need to pad to be aligned to the max alignment of your fields (which is 64 due to the pointer)

Padding comes before the struct, not after. If you have a Span as a variable, then a uint32_t as the next one, it can fit right after _len without padding because it aligns to the 32 bit left by Span. If I were to make it uint64_t it would take 32 more bits.

No, that's not right. Padding is at the end of the structure (or between the elements) never at the start. And if you declare a Span on the stack, it will pad, the next uin32_t variable will not be directly after your _len field. Except if you #pragma pack(push, 1) so that the compiler does not put padding bytes, but this is not the case here.

Another problem to think about in advance here by the way is that we might have a caller that uses something like:

for (u32 n=0; n<span.size(); n++)

This will end up happening, this is partly why variable size types are such a nightmare.

So it will afaik end up promoting n to 64 bit each time in e.g. a tight loop, just to make this comparison. (This might be free, or might not..)

For these reasons actually I would reconsider fixing size as u32 (similar to LocalVector), and using padding for 64 bit, as it will make usage far easier to reason about and remove platform effects.

In 99.9% of cases, u32 gives 4gb range for 8 bit values, and more when addressing larger T sizes. For those cases that require more than that, they could use e.g. a 64 bit span?

Just my personal thoughts though.

Another knock on effect:

The moment you make size able to hold 64 bit, everywhere that stores this has to account for this possibility, or else have error handling for > 32 bit. This means knock on structures end up having to hold 64 bit instead of 32 bit.

I'm on your side, as I tend to prefer fixed size to make the layout clearer and not platform dependant. This i why I wrote that :

I like uint64_t better BUT ...

But size_t seems to be used a lot in the codebase already....

So it will afaik end up promoting n to 64 bit each time in e.g. a tight loop, just to make this comparison. (This might be free, or might not..)

When I look at assembly for this (and take this with a grain of salt, as some different usage may cause different behaviour), at my surprise, it is totally not free, as it forces one additional move ( godbolt ). It makes sense when you think about it, as it have to do the inc on a 32 bit register for overflow reasons, then move it to a 64 bits register for the comparison. Not a big deal, but it has a cost, so we should take it into consideration.

But size_t seems to be used a lot in the codebase already....

size_t is (correctly) used where OS and API calls return a size_t, however historically we haven't (to my knowledge) used it much internally, for containers etc. But bear in mind I'm more familiar with 3.x.

I can see it used in lru.h (which may be new), but we should consider whether it should be used there.

Let me summarize my current thoughts.
32 bit uint would put 64-bit systems at a disadvantage because > 4gb buffers (at 1 byte objects) would not be supported. Such buffers are admittedly rare, but do happen in my line of work (which is not games admittedly).
64 bit uint would be perfect on 64 bit systems, but slightly slow and potentially unsafe on 32 bit systems.
size_t is somewhat fishy because it's different sizes on different systems.

I'm still on the size_t side because it's fast, safe, and while we have no guarantees C++ side, we do know it to be correct for all major platforms we expect Godot to ship on. It's also used by all C++ size types (sizeof(array), malloc, ...) and STL container types (std::span, std::array...).

core/templates/span.h

…ccess to a contiguous array, resembling `std::span`.

Ivorforce · 2025-01-16T20:53:59Z

servers/rendering/renderer_rd/effects/sort_effects.h

@@ -31,6 +31,7 @@
 #ifndef SORT_EFFECTS_RD_H
 #define SORT_EFFECTS_RD_H

+#include "servers/rendering/renderer_rd/shader_rd.h"


Quick fix due to build failing because of erroneous cache restores.

I have #100293 (comment) that including a non-generated file explicitly solves such cache issues for good.

Repiteo

Codestyle checks out & the use of more modern C++ concepts is a welcome addition

Ivorforce marked this pull request as ready for review December 11, 2024 22:43

Ivorforce requested a review from a team as a code owner December 11, 2024 22:43

Ivorforce force-pushed the buffer-view branch 3 times, most recently from e3be56f to b8c9998 Compare December 11, 2024 23:18

Ivorforce force-pushed the buffer-view branch from b8c9998 to 9f5d7f0 Compare December 12, 2024 00:56

Ivorforce mentioned this pull request Dec 12, 2024

Optimize String.count and String.countn by avoiding repeated reallocations. #100294

Merged

AThousandShips added enhancement topic:core labels Dec 12, 2024

AThousandShips added this to the 4.x milestone Dec 12, 2024

Ivorforce force-pushed the buffer-view branch from 9f5d7f0 to 0a2ce2d Compare December 12, 2024 09:53

Ivorforce requested a review from a team as a code owner December 12, 2024 09:53

Ivorforce force-pushed the buffer-view branch from 0a2ce2d to 3888e73 Compare December 12, 2024 20:21

Ivorforce changed the title ~~Rename StrRange -> BufferView. Move find, rfind and contains functions from CowData to BufferView.~~ Rename StrRange -> Span. Move find, rfind and contains functions from CowData to Span. Dec 12, 2024

Ivorforce force-pushed the buffer-view branch 2 times, most recently from 825c23a to e20f020 Compare December 12, 2024 20:30

This was referenced Dec 12, 2024

Harmonize Vector and LocalVector godotengine/godot-proposals#5144

Open

Consolidate RenderingDeviceDriver's VectorView into Span. #100338

Closed

Ivorforce force-pushed the buffer-view branch 2 times, most recently from d787cfb to 191f22a Compare December 13, 2024 14:31

hpvb reviewed Dec 17, 2024

View reviewed changes

core/templates/span.h Outdated Show resolved Hide resolved

Ivorforce force-pushed the buffer-view branch 2 times, most recently from 967b48e to 49fda0d Compare December 17, 2024 16:11

Repiteo reviewed Dec 17, 2024

View reviewed changes

core/templates/span.h Show resolved Hide resolved

core/templates/span.h Outdated Show resolved Hide resolved

Ivorforce force-pushed the buffer-view branch from 49fda0d to f42523f Compare December 17, 2024 22:26

Ivorforce requested a review from a team as a code owner December 17, 2024 22:26

Ivorforce force-pushed the buffer-view branch from f42523f to f5794e1 Compare December 17, 2024 22:27

Ivorforce mentioned this pull request Dec 18, 2024

Add ptrw() to LocalVector, to bring it in-line with Vector and String. #100555

Open

Ivorforce force-pushed the buffer-view branch 2 times, most recently from 9979811 to d7465a5 Compare December 20, 2024 09:26

This was referenced Jan 4, 2025

Add String::concat and string.extend functions to core for efficient String concatenation. #99929

Open

Make use of latin1 encoding explicit in gdextension_interface.cpp. #101352

Open

Ivorforce force-pushed the buffer-view branch from d7465a5 to bf0325b Compare January 16, 2025 18:48

Ivorforce force-pushed the buffer-view branch from bf0325b to a7dee20 Compare January 16, 2025 18:57

kiroxas approved these changes Jan 16, 2025

View reviewed changes

Ivorforce changed the title ~~Rename StrRange -> Span. Move find, rfind and contains functions from CowData to Span.~~ Add Span struct (replacing StrRange). Spans represent read-only access to a contiguous array, resembling std::span. Jan 16, 2025

Ivorforce force-pushed the buffer-view branch from a7dee20 to 61e2cdd Compare January 16, 2025 19:42

clayjohn requested review from hpvb and RandomShaper January 16, 2025 20:03

Add Span struct (replacing StrRange). Spans represent read-only a…

2e27f3c

…ccess to a contiguous array, resembling `std::span`.

Ivorforce force-pushed the buffer-view branch from 61e2cdd to 2e27f3c Compare January 16, 2025 20:53

Ivorforce commented Jan 16, 2025

View reviewed changes

clayjohn modified the milestones: 4.x, 4.5 Jan 30, 2025

Ivorforce mentioned this pull request Feb 8, 2025

Find_Sequence Function for Arrays, and Array-Like classes godotengine/godot-proposals#11722

Open

Repiteo approved these changes Mar 3, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add `Span` struct (replacing `StrRange`). Spans represent read-only access to a contiguous array, resembling `std::span`. #100293

Add `Span` struct (replacing `StrRange`). Spans represent read-only access to a contiguous array, resembling `std::span`. #100293

Ivorforce commented Dec 11, 2024 •

edited

Loading

Ivorforce commented Dec 12, 2024 •

edited

Loading

RandomShaper commented Dec 12, 2024

Ivorforce commented Dec 12, 2024

Ivorforce commented Dec 12, 2024 •

edited

Loading

Ivorforce commented Dec 12, 2024 •

edited

Loading

Repiteo left a comment

Ivorforce commented Dec 17, 2024 •

edited

Loading

Ivorforce commented Dec 20, 2024

Ivorforce commented Dec 20, 2024 •

edited

Loading

Ivorforce commented Jan 16, 2025 •

edited

Loading

kiroxas Jan 16, 2025

Ivorforce Jan 16, 2025 •

edited

Loading

kiroxas Jan 16, 2025

Ivorforce Jan 16, 2025 •

edited

Loading

kiroxas Jan 16, 2025

lawnjelly Jan 21, 2025 •

edited

Loading

lawnjelly Jan 21, 2025 •

edited

Loading

kiroxas Jan 21, 2025 •

edited

Loading

lawnjelly Jan 21, 2025

Ivorforce Jan 21, 2025 •

edited

Loading

Ivorforce Jan 16, 2025 •

edited

Loading

Repiteo left a comment

	Vector<String> src_dirs = src.substr(1, src.length() - 2).split("/");
	Vector<String> dst_dirs = dst.substr(1, dst.length() - 2).split("/");

Add Span struct (replacing StrRange). Spans represent read-only access to a contiguous array, resembling std::span. #100293

Are you sure you want to change the base?

Add Span struct (replacing StrRange). Spans represent read-only access to a contiguous array, resembling std::span. #100293

Conversation

Ivorforce commented Dec 11, 2024 • edited Loading

Example

Discussion

Ivorforce commented Dec 12, 2024 • edited Loading

RandomShaper commented Dec 12, 2024

Ivorforce commented Dec 12, 2024

Ivorforce commented Dec 12, 2024 • edited Loading

Ivorforce commented Dec 12, 2024 • edited Loading

Repiteo left a comment

Choose a reason for hiding this comment

Ivorforce commented Dec 17, 2024 • edited Loading

Ivorforce commented Dec 20, 2024

Ivorforce commented Dec 20, 2024 • edited Loading

Ivorforce commented Jan 16, 2025 • edited Loading

kiroxas Jan 16, 2025

Choose a reason for hiding this comment

Ivorforce Jan 16, 2025 • edited Loading

Choose a reason for hiding this comment

kiroxas Jan 16, 2025

Choose a reason for hiding this comment

Ivorforce Jan 16, 2025 • edited Loading

Choose a reason for hiding this comment

kiroxas Jan 16, 2025

Choose a reason for hiding this comment

lawnjelly Jan 21, 2025 • edited Loading

Choose a reason for hiding this comment

lawnjelly Jan 21, 2025 • edited Loading

Choose a reason for hiding this comment

Another knock on effect:

kiroxas Jan 21, 2025 • edited Loading

Choose a reason for hiding this comment

lawnjelly Jan 21, 2025

Choose a reason for hiding this comment

Ivorforce Jan 21, 2025 • edited Loading

Choose a reason for hiding this comment

Ivorforce Jan 16, 2025 • edited Loading

Choose a reason for hiding this comment

Repiteo left a comment

Choose a reason for hiding this comment

Add `Span` struct (replacing `StrRange`). Spans represent read-only access to a contiguous array, resembling `std::span`. #100293

Add `Span` struct (replacing `StrRange`). Spans represent read-only access to a contiguous array, resembling `std::span`. #100293

Ivorforce commented Dec 11, 2024 •

edited

Loading

Ivorforce commented Dec 12, 2024 •

edited

Loading

Ivorforce commented Dec 12, 2024 •

edited

Loading

Ivorforce commented Dec 12, 2024 •

edited

Loading

Ivorforce commented Dec 17, 2024 •

edited

Loading

Ivorforce commented Dec 20, 2024 •

edited

Loading

Ivorforce commented Jan 16, 2025 •

edited

Loading

Ivorforce Jan 16, 2025 •

edited

Loading

Ivorforce Jan 16, 2025 •

edited

Loading

lawnjelly Jan 21, 2025 •

edited

Loading

lawnjelly Jan 21, 2025 •

edited

Loading

kiroxas Jan 21, 2025 •

edited

Loading

Ivorforce Jan 21, 2025 •

edited

Loading

Ivorforce Jan 16, 2025 •

edited

Loading