Skip to content

Commit 5d8eb93

Browse files
authored
chore: Fix multiple typos (#28574)
1 parent 8189977 commit 5d8eb93

File tree

5 files changed

+5
-5
lines changed

5 files changed

+5
-5
lines changed

examples/research_projects/codeparrot/README.md

+1-1
Original file line numberDiff line numberDiff line change
@@ -50,7 +50,7 @@ The raw dataset contains many duplicates. We deduplicated and filtered the datas
5050
- fraction of alphanumeric characters < 0.25
5151
- containing the word "auto-generated" or similar in the first 5 lines
5252
- filtering with a probability of 0.7 of files with a mention of "test file" or "configuration file" or similar in the first 5 lines
53-
- filtering with a probability of 0.7 of files with high occurence of the keywords "test " or "config"
53+
- filtering with a probability of 0.7 of files with high occurrence of the keywords "test " or "config"
5454
- filtering with a probability of 0.7 of files without a mention of the keywords `def` , `for`, `while` and `class`
5555
- filtering files that use the assignment operator `=` less than 5 times
5656
- filtering files with ratio between number of characters and number of tokens after tokenization < 1.5 (the average ratio is 3.6)

examples/research_projects/jax-projects/README.md

+1-1
Original file line numberDiff line numberDiff line change
@@ -1153,7 +1153,7 @@ In the following, we will describe how to do so using a standard console, but yo
11531153
2. Once you've installed the google cloud sdk, you should set your account by running the following command. Make sure that `<your-email-address>` corresponds to the gmail address you used to sign up for this event.
11541154

11551155
```bash
1156-
$ gcloud config set account <your-email-adress>
1156+
$ gcloud config set account <your-email-address>
11571157
```
11581158

11591159
3. Let's also make sure the correct project is set in case your email is used for multiple gcloud projects:

examples/research_projects/jax-projects/big_bird/README.md

+1-1
Original file line numberDiff line numberDiff line change
@@ -57,4 +57,4 @@ wget https://huggingface.co/datasets/vasudevgupta/natural-questions-validation/r
5757
python3 evaluate.py
5858
```
5959

60-
You can find our checkpoint on HuggingFace Hub ([see this](https://huggingface.co/vasudevgupta/flax-bigbird-natural-questions)). In case you are interested in PyTorch BigBird fine-tuning, you can refer to [this repositary](https://github.com/thevasudevgupta/bigbird).
60+
You can find our checkpoint on HuggingFace Hub ([see this](https://huggingface.co/vasudevgupta/flax-bigbird-natural-questions)). In case you are interested in PyTorch BigBird fine-tuning, you can refer to [this repository](https://github.com/thevasudevgupta/bigbird).

examples/research_projects/jax-projects/model_parallel/README.md

+1-1
Original file line numberDiff line numberDiff line change
@@ -27,7 +27,7 @@ To adapt the script for other models, we need to also change the `ParitionSpec`
2727

2828
TODO: Add more explantion.
2929

30-
Before training, let's prepare our model first. To be able to shard the model, the sharded dimention needs to be a multiple of devices it'll be sharded on. But GPTNeo's vocab size is 50257, so we need to resize the embeddings accordingly.
30+
Before training, let's prepare our model first. To be able to shard the model, the sharded dimension needs to be a multiple of devices it'll be sharded on. But GPTNeo's vocab size is 50257, so we need to resize the embeddings accordingly.
3131

3232
```python
3333
from transformers import FlaxGPTNeoForCausalLM, GPTNeoConfig

examples/research_projects/mlm_wwm/README.md

+1-1
Original file line numberDiff line numberDiff line change
@@ -95,4 +95,4 @@ python run_mlm_wwm.py \
9595

9696
**Note1:** On TPU, you should the flag `--pad_to_max_length` to make sure all your batches have the same length.
9797

98-
**Note2:** And if you have any questions or something goes wrong when runing this code, don't hesitate to pin @wlhgtc.
98+
**Note2:** And if you have any questions or something goes wrong when running this code, don't hesitate to pin @wlhgtc.

0 commit comments

Comments
 (0)