Skip to content

StringWa.rs! #1

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 29 commits into
base: main
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
29 commits
Select commit Hold shift + click to select a range
bc52fcf
Docs: Extend to StringWa.rs
ashvardanian Dec 6, 2024
c842fe9
Add: Levenshtein benchmarks
ashvardanian Dec 6, 2024
b906b91
Add: Hashing benchmarks
ashvardanian Dec 6, 2024
bd23a21
Add: Placeholder for TF-IDF
ashvardanian Dec 8, 2024
7a0fc0b
Improve: More hashing backends
ashvardanian Mar 2, 2025
39dbeb0
Docs: File-level documentation
ashvardanian Mar 9, 2025
fd5cb5f
Add: StringZilla version logging
ashvardanian Mar 9, 2025
11c6f1a
Improve: Cycling through data over indexing
ashvardanian Mar 9, 2025
7fba12f
Add: Sorting drafts
ashvardanian Mar 9, 2025
934af12
Improve: Unified design
ashvardanian Mar 14, 2025
c0ef9ea
Make: Drop legacy helpers
ashvardanian Mar 14, 2025
2bf46ad
Improve: Put slowest hash in the end
ashvardanian Mar 14, 2025
3dc097d
Add: Native substring search benchmarks
ashvardanian Mar 14, 2025
ebe8af9
Docs: Hashing throughput
ashvardanian Mar 14, 2025
5d5469e
Improve: Rename files
ashvardanian Mar 14, 2025
65cc43d
Add: Byteset search benchmarks
ashvardanian Mar 14, 2025
a4d8247
Make: New dependencies
ashvardanian Mar 14, 2025
bc15e90
Add: New byteset benchmarks
ashvardanian Mar 14, 2025
567d39c
Improve: Naming hash benchmarks
ashvardanian Mar 14, 2025
380ca6d
Add: Sequence-sorting benchmarks
ashvardanian Mar 14, 2025
a43ad43
Add: Incremental hashing benchmarks
ashvardanian Mar 15, 2025
849fd99
Docs: New Intel Sapphire Rapids results
ashvardanian Mar 15, 2025
1e246db
Fix: `arrow::LargeStringArray` to avoid overflow
ashvardanian Mar 15, 2025
8aff49f
Docs: Formatting
ashvardanian Mar 16, 2025
05b56bf
Improve: Switch to binary strings
ashvardanian Mar 24, 2025
b42ae25
Fix: Skip empty tokens
ashvardanian Mar 24, 2025
848da95
Add: `memmem` iterators
ashvardanian Mar 24, 2025
1176473
Improve: Style as search benchmarks
ashvardanian Mar 24, 2025
38cefd6
Add: Memory-system benchmarks
ashvardanian Mar 24, 2025
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
28 changes: 28 additions & 0 deletions .vscode/settings.json
Original file line number Diff line number Diff line change
@@ -0,0 +1,28 @@
{
"cSpell.words": [
"ahash",
"Bioinformatics",
"bstr",
"byteset",
"bytesets",
"bytesum",
"corasick",
"Dataframe",
"gxhash",
"lexsort",
"Melem",
"memchr",
"memmem",
"MergeSort",
"QuickSort",
"Needleman",
"rapidfuzz",
"rfind",
"Skylake",
"stringwars",
"stringzilla",
"strstr",
"tfidf",
"Wunsch"
]
}
Loading