[Feature] Monitor token consumption #806

riedgar-ms · 2024-05-07T14:44:08Z

Count tokens fed into & retrieved from the underlying LLM. This is just for Tranformer and LLamaCpp models at the current time.

This hooks into get_logits() which is the method which concrete Engines must implement. We don't attempt to distinguish between 'forced' and 'prompt' tokens. Between non-unique tokenisations, a variety of tokenisers and token healing, trying to count tokens exactly is decidedly non-trivial.

guidance/models/transformers/_transformers.py

guidance/models/_model.py

tests/library/test_gen.py

codecov-commenter · 2024-05-07T15:18:44Z

Codecov Report

Attention: Patch coverage is 81.25000% with 3 lines in your changes are missing coverage. Please review.

Project coverage is 62.14%. Comparing base (7b4d85f) to head (d860cb2).

Files	Patch %	Lines
guidance/models/transformers/_transformers.py	60.00%	2 Missing ⚠️
guidance/models/_model.py	80.00%	1 Missing ⚠️

❗ Your organization needs to install the Codecov GitHub app to enable full functionality.

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #806      +/-   ##
==========================================
+ Coverage   56.72%   62.14%   +5.42%     
==========================================
  Files          56       57       +1     
  Lines        4159     4174      +15     
==========================================
+ Hits         2359     2594     +235     
+ Misses       1800     1580     -220

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

…trics-01

paulbkoch · 2024-05-09T21:41:43Z

LGTM

riedgar-ms added 7 commits May 7, 2024 08:11

Create the basic class for holding metrics

0b9b43b

Put in, along with a very basic test

6faf8db

Another test to watch the metrics

bbbec17

Getting things close to working.....

25f42bf

Remove minor hangover

230f782

Another oversight

bdd80e7

Add a comment

392a479

riedgar-ms commented May 7, 2024

View reviewed changes

guidance/models/transformers/_transformers.py Outdated Show resolved Hide resolved

riedgar-ms commented May 7, 2024

View reviewed changes

guidance/models/_model.py Outdated Show resolved Hide resolved

riedgar-ms commented May 7, 2024

View reviewed changes

tests/library/test_gen.py Outdated Show resolved Hide resolved

riedgar-ms requested review from Harsha-Nori, paulbkoch and slundberg May 7, 2024 14:50

Need to be able to reset the metrics on the Model

76d533e

riedgar-ms added 15 commits May 8, 2024 08:13

Thinking about another metric

34881c9

Figure out how to call tokeniser

96de164

Reformat

2fa6521

Do some renaming

c3a0c6b

Some more output

ff46ec1

Trying to count forced tokens

2faa583

Try following things through

f8de7c8

I don't think I need these bits

822f8e1

Tweak where stats are grabbed

5c50051

Tidy up tests

67e21c6

Remove extra

bcc269f

Try to figure out if syntax makes a difference

b728b0f

Latest attempt to get consistent token results

9f330c3

Rethink the metrics

216a5de

Add a reset method

a083f1b

riedgar-ms added 7 commits May 9, 2024 06:31

Undo another change

66a3b05

Fix tests

6af11ba

Better name

b25381e

Merge remote-tracking branch 'upstream/main' into riedgar-ms/model-me…

f557f4d

…trics-01

Don't have things for common_chat_testing yet

268d4a0

Hook metrics into llamacpp

4d851b0

Fix test

d860cb2

riedgar-ms changed the title ~~[WIP] [Feature] Monitor token consumption~~ [Feature] Monitor token consumption May 9, 2024

paulbkoch merged commit 5ad2304 into guidance-ai:main May 9, 2024
124 checks passed

riedgar-ms deleted the riedgar-ms/model-metrics-01 branch August 26, 2024 16:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature] Monitor token consumption #806

[Feature] Monitor token consumption #806

riedgar-ms commented May 7, 2024 •

edited

Loading

codecov-commenter commented May 7, 2024 •

edited

Loading

paulbkoch commented May 9, 2024 •

edited

Loading

[Feature] Monitor token consumption #806

[Feature] Monitor token consumption #806

Conversation

riedgar-ms commented May 7, 2024 • edited Loading

codecov-commenter commented May 7, 2024 • edited Loading

Codecov Report

paulbkoch commented May 9, 2024 • edited Loading

riedgar-ms commented May 7, 2024 •

edited

Loading

codecov-commenter commented May 7, 2024 •

edited

Loading

paulbkoch commented May 9, 2024 •

edited

Loading