Skip to content

Avoid duplicate utf8 parsing#22

Open
jduck wants to merge 1 commit intojonhoo:mainfrom
jduck:remove-dupe-utf8-parse
Open

Avoid duplicate utf8 parsing#22
jduck wants to merge 1 commit intojonhoo:mainfrom
jduck:remove-dupe-utf8-parse

Conversation

@jduck
Copy link
Copy Markdown

@jduck jduck commented Jan 19, 2026

I spent 2 hours yelling at my screen about this one when I watched your 10 hour
YouTube video today. Hope this change helps get the runtime under 1 second.

-jduck

I spent 2 hours yelling at my screen about this one when I watched your
10 hour YouTube video today. Hope this change helps get the runtime
under 1 second.
@arnabanimesh
Copy link
Copy Markdown
Contributor

Can you share the benchmark numbers here?

@jduck
Copy link
Copy Markdown
Author

jduck commented Feb 5, 2026

I can't because I don't have the dataset that Jon was using. I tested with a the dataset in the repo and it was a slight improvement.

@d2weber
Copy link
Copy Markdown

d2weber commented Mar 5, 2026

As far as I understand, the BTreeMap is needed so that the output will be sorted by station names.

@jduck
Copy link
Copy Markdown
Author

jduck commented Apr 5, 2026

As far as I understand, the BTreeMap is needed so that the output will be sorted by station names.

Shouldn't the data already be sorted? This is only in the print() function, which should not need to convert the string from UTF-8 AGAIN.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants