Interactive generation script by younesbelkada · Pull Request #53 · bigscience-workshop/bigscience

younesbelkada · 2022-07-08T12:02:53Z

Add small arguments that are accepted by accelerate for better performance
in the previous script we were offloading to the disk which takes a lot of time

cc @Muennighoff

Co-authored-by: Thomas Wang <[email protected]> Co-authored-by: Narsil <[email protected]>

- remove eval - remove pipeline & json from import

Co-authored-by: Niklas Muennighoff <[email protected]>

Muennighoff · 2022-07-12T17:00:38Z

I can't find any documentation on max_cpu_memory - Does this kwarg exist?

Traceback (most recent call last):
  File "generate.py", line 64, in <module>
    main()
  File "generate.py", line 41, in main
    model = AutoModelForCausalLM.from_pretrained(
  File "/gpfswork/rech/six/commun/conda/muennighoffmodelconv/lib/python3.8/site-packages/transformers/models/auto/auto_factory.py", line 446, in from_pretrained
    return model_class.from_pretrained(pretrained_model_name_or_path, *model_args, config=config, **kwargs)
  File "/gpfswork/rech/six/commun/conda/muennighoffmodelconv/lib/python3.8/site-packages/transformers/modeling_utils.py", line 2070, in from_pretrained
    model = cls(config, *model_args, **model_kwargs)
TypeError: __init__() got an unexpected keyword argument 'max_cpu_memory'
bash-4.4$ pip show accelerate
Name: accelerate
Version: 0.11.0.dev0
Summary: Accelerate
Home-page: https://github.com/huggingface/accelerate
Author: The HuggingFace team
Author-email: [email protected]
License: Apache
Location: /gpfsssd/worksf/projects/rech/six/commun/conda/muennighoffmodelconv/lib/python3.8/site-packages
Requires: psutil, torch, packaging, pyyaml, numpy
Required-by:
bash-4.4$ pip show transformers
Name: transformers
Version: 4.21.0.dev0
Summary: State-of-the-art Machine Learning for JAX, PyTorch and TensorFlow
Home-page: https://github.com/huggingface/transformers
Author: The Hugging Face team (past and future) with the help of all our contributors (https://github.com/huggingface/transformers/graphs/contributors)
Author-email: [email protected]

Muennighoff · 2022-07-12T17:13:21Z

Also I'm pretty sure max_memory cannot be a string, but has to be a dictionary

Muennighoff · 2022-07-12T17:29:29Z

Just writing one line and CTRL+C (w/o Enter) yields the below for me. I think there is some batching issue.

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "generate.py", line 64, in <module>
    main()
  File "generate.py", line 59, in main
    output = generate_from_text(model, text, tokenizer, max_length=args.generate_max_length, greedy=args.greedy, top_k=args.top_k)
  File "generate.py", line 25, in generate_from_text
    greedy_output = model.generate(
  File "/gpfswork/rech/six/commun/conda/muennighoffmodelconv/lib/python3.8/site-packages/torch/autograd/grad_mode.py", line 27, in decorate_context
    return func(*args, **kwargs)
  File "/gpfswork/rech/six/commun/conda/muennighoffmodelconv/lib/python3.8/site-packages/transformers/generation_utils.py", line 1288, in generate
    return self.greedy_search(
  File "/gpfswork/rech/six/commun/conda/muennighoffmodelconv/lib/python3.8/site-packages/transformers/generation_utils.py", line 1683, in greedy_search
    outputs = self(
  File "/gpfswork/rech/six/commun/conda/muennighoffmodelconv/lib/python3.8/site-packages/torch/nn/modules/module.py", line 1130, in _call_impl
    return forward_call(*input, **kwargs)
  File "/gpfswork/rech/six/commun/conda/muennighoffmodelconv/lib/python3.8/site-packages/accelerate/hooks.py", line 148, in new_forward
    output = old_forward(*args, **kwargs)
  File "/gpfswork/rech/six/commun/conda/muennighoffmodelconv/lib/python3.8/site-packages/transformers/models/bloom/modeling_bloom.py", line 821, in forward
    transformer_outputs = self.transformer(
  File "/gpfswork/rech/six/commun/conda/muennighoffmodelconv/lib/python3.8/site-packages/torch/nn/modules/module.py", line 1130, in _call_impl
    return forward_call(*input, **kwargs)
  File "/gpfswork/rech/six/commun/conda/muennighoffmodelconv/lib/python3.8/site-packages/accelerate/hooks.py", line 148, in new_forward
    output = old_forward(*args, **kwargs)
  File "/gpfswork/rech/six/commun/conda/muennighoffmodelconv/lib/python3.8/site-packages/transformers/models/bloom/modeling_bloom.py", line 639, in forward
    input_ids = input_ids.view(-1, input_shape[-1])
RuntimeError: cannot reshape tensor of 0 elements into shape [-1, 0] because the unspecified dimension size -1 can be any value and is ambiguous

Muennighoff · 2022-07-13T08:07:19Z

Opened a PR with some changes: younesbelkada#1

younesbelkada and others added 17 commits June 10, 2022 15:37

first try gen script

8b2c770

add args

62fcebb

correct tokenizer

1782c2d

add import torch

80df231

few nits

0d11810

new line

5899ed1

print decoded output

609b880

add small nit

877f76e

fix small nit

560b3fc

forward contrib credits

5e2d2a2

Co-authored-by: Thomas Wang <[email protected]> Co-authored-by: Narsil <[email protected]>

rm useless file

7c6c171

small nits

d6dd676

- remove eval - remove pipeline & json from import

Update evaluation/generation/generate.py

21eca50

Co-authored-by: Niklas Muennighoff <[email protected]>

Update evaluation/generation/generate.py

839622c

Co-authored-by: Niklas Muennighoff <[email protected]>

commit suggestions

ac16dd7

Update evaluation/generation/generate.py

7191802

fix small accelerate nits

5759518

younesbelkada force-pushed the fix_generate branch from 5848c7f to 5759518 Compare July 8, 2022 12:03

Merge branch 'master' into fix_generate

e726b15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Interactive generation script#53

Interactive generation script#53
younesbelkada wants to merge 18 commits intobigscience-workshop:masterfrom
younesbelkada:fix_generate

younesbelkada commented Jul 8, 2022 •

edited

Loading

Uh oh!

Muennighoff commented Jul 12, 2022

Uh oh!

Muennighoff commented Jul 12, 2022

Uh oh!

Muennighoff commented Jul 12, 2022

Uh oh!

Muennighoff commented Jul 13, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

younesbelkada commented Jul 8, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Muennighoff commented Jul 12, 2022

Uh oh!

Muennighoff commented Jul 12, 2022

Uh oh!

Muennighoff commented Jul 12, 2022

Uh oh!

Muennighoff commented Jul 13, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

younesbelkada commented Jul 8, 2022 •

edited

Loading