Dataset statistics
Number of variables | 9 |
---|---|
Number of observations | 41002 |
Missing cells | 8591 |
Missing cells (%) | 2.3% |
Duplicate rows | 45 |
Duplicate rows (%) | 0.1% |
Total size in memory | 18.4 MiB |
Average record size in memory | 469.8 B |
Variable types
Text | 6 |
---|---|
Categorical | 2 |
Numeric | 1 |
Dataset has 45 (0.1%) duplicate rows | Duplicates |
Name has 850 (2.1%) missing values | Missing |
Opponent has 960 (2.3%) missing values | Missing |
W/L has 850 (2.1%) missing values | Missing |
Method has 1862 (4.5%) missing values | Missing |
Competition has 870 (2.1%) missing values | Missing |
Weight has 1301 (3.2%) missing values | Missing |
Stage has 1048 (2.6%) missing values | Missing |
Year has 850 (2.1%) missing values | Missing |
Reproduction
Analysis started | 2024-07-28 16:11:17.272738 |
---|---|
Analysis finished | 2024-07-28 16:11:19.436010 |
Duration | 2.16 seconds |
Software version | ydata-profiling vv4.9.0 |
Download configuration | config.json |
URL Tag
Text
Distinct | 1374 |
---|---|
Distinct (%) | 3.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.4 MiB |
Length
Max length | 27 |
---|---|
Median length | 25 |
Mean length | 13.50361 |
Min length | 6 |
Characters and Unicode
Total characters | 553675 |
---|---|
Distinct characters | 30 |
Distinct categories | 4 ? |
Distinct scripts | 2 ? |
Distinct blocks | 2 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
Unique | 831 ? |
---|---|
Unique (%) | 2.0% |
Sample
1st row | aarae-alexander |
---|---|
2nd row | aaron-johnson |
3rd row | aaron-johnson |
4th row | aaron-johnson |
5th row | aaron-johnson |
Value | Count | Frequency (%) |
adam-wardzinski | 349 | 0.8% |
gianni-grippo | 335 | 0.8% |
erberth-santos | 309 | 0.8% |
fellipe-andrew | 304 | 0.7% |
thiago-macedo | 266 | 0.6% |
felipe-cesar | 260 | 0.6% |
renato-cardoso | 258 | 0.6% |
jaime-canuto | 252 | 0.6% |
joao-miyao | 244 | 0.6% |
jackson-sousa | 242 | 0.6% |
Other values (1442) | 38318 |
Most occurring characters
Value | Count | Frequency (%) |
a | 71473 | |
e | 46949 | 8.5% |
i | 45513 | 8.2% |
r | 44950 | 8.1% |
o | 43496 | 7.9% |
- | 41287 | 7.5% |
n | 34198 | 6.2% |
s | 29862 | 5.4% |
l | 27897 | 5.0% |
d | 18997 | 3.4% |
Other values (20) | 149053 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 512252 | |
Dash Punctuation | 41287 | 7.5% |
Space Separator | 135 | < 0.1% |
Other Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
a | 71473 | |
e | 46949 | 9.2% |
i | 45513 | 8.9% |
r | 44950 | 8.8% |
o | 43496 | 8.5% |
n | 34198 | 6.7% |
s | 29862 | 5.8% |
l | 27897 | 5.4% |
d | 18997 | 3.7% |
c | 17787 | 3.5% |
Other values (17) | 131130 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 41287 |
Space Separator
Value | Count | Frequency (%) |
135 |
Other Punctuation
Value | Count | Frequency (%) |
/ | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 512252 | |
Common | 41423 | 7.5% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
a | 71473 | |
e | 46949 | 9.2% |
i | 45513 | 8.9% |
r | 44950 | 8.8% |
o | 43496 | 8.5% |
n | 34198 | 6.7% |
s | 29862 | 5.8% |
l | 27897 | 5.4% |
d | 18997 | 3.7% |
c | 17787 | 3.5% |
Other values (17) | 131130 |
Common
Value | Count | Frequency (%) |
- | 41287 | |
135 | 0.3% | |
/ | 1 | < 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 553673 | |
None | 2 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
a | 71473 | |
e | 46949 | 8.5% |
i | 45513 | 8.2% |
r | 44950 | 8.1% |
o | 43496 | 7.9% |
- | 41287 | 7.5% |
n | 34198 | 6.2% |
s | 29862 | 5.4% |
l | 27897 | 5.0% |
d | 18997 | 3.4% |
Other values (19) | 149051 |
None
Value | Count | Frequency (%) |
ã | 2 |
Name
Text
MISSING
 
Distinct | 534 |
---|---|
Distinct (%) | 1.3% |
Missing | 850 |
Missing (%) | 2.1% |
Memory size | 2.4 MiB |
Length
Max length | 24 |
---|---|
Median length | 20 |
Mean length | 13.481969 |
Min length | 8 |
Characters and Unicode
Total characters | 541328 |
---|---|
Distinct characters | 27 |
Distinct categories | 2 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
Unique | 1 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | aaron-johnson |
---|---|
2nd row | aaron-johnson |
3rd row | aaron-johnson |
4th row | aaron-johnson |
5th row | aaron-johnson |
Value | Count | Frequency (%) |
adam-wardzinski | 349 | 0.9% |
gianni-grippo | 335 | 0.8% |
erberth-santos | 309 | 0.8% |
fellipe-andrew | 304 | 0.8% |
thiago-macedo | 266 | 0.7% |
felipe-cesar | 260 | 0.6% |
renato-cardoso | 258 | 0.6% |
jaime-canuto | 252 | 0.6% |
joao-miyao | 244 | 0.6% |
jackson-sousa | 242 | 0.6% |
Other values (524) | 37333 |
Most occurring characters
Value | Count | Frequency (%) |
a | 69900 | |
e | 45907 | 8.5% |
i | 44500 | 8.2% |
r | 43884 | 8.1% |
o | 42433 | 7.8% |
- | 40433 | 7.5% |
n | 33459 | 6.2% |
s | 29238 | 5.4% |
l | 27213 | 5.0% |
d | 18583 | 3.4% |
Other values (17) | 145778 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 500895 | |
Dash Punctuation | 40433 | 7.5% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
a | 69900 | |
e | 45907 | 9.2% |
i | 44500 | 8.9% |
r | 43884 | 8.8% |
o | 42433 | 8.5% |
n | 33459 | 6.7% |
s | 29238 | 5.8% |
l | 27213 | 5.4% |
d | 18583 | 3.7% |
c | 17366 | 3.5% |
Other values (16) | 128412 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 40433 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 500895 | |
Common | 40433 | 7.5% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
a | 69900 | |
e | 45907 | 9.2% |
i | 44500 | 8.9% |
r | 43884 | 8.8% |
o | 42433 | 8.5% |
n | 33459 | 6.7% |
s | 29238 | 5.8% |
l | 27213 | 5.4% |
d | 18583 | 3.7% |
c | 17366 | 3.5% |
Other values (16) | 128412 |
Common
Value | Count | Frequency (%) |
- | 40433 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 541328 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
a | 69900 | |
e | 45907 | 8.5% |
i | 44500 | 8.2% |
r | 43884 | 8.1% |
o | 42433 | 7.8% |
- | 40433 | 7.5% |
n | 33459 | 6.2% |
s | 29238 | 5.4% |
l | 27213 | 5.0% |
d | 18583 | 3.4% |
Other values (17) | 145778 |
Opponent
Text
MISSING
 
Distinct | 9610 |
---|---|
Distinct (%) | 24.0% |
Missing | 960 |
Missing (%) | 2.3% |
Memory size | 2.7 MiB |
Length
Max length | 48 |
---|---|
Median length | 32 |
Mean length | 19.856251 |
Min length | 3 |
Characters and Unicode
Total characters | 795084 |
---|---|
Distinct characters | 77 |
Distinct categories | 7 ? |
Distinct scripts | 2 ? |
Distinct blocks | 3 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
Unique | 5701 ? |
---|---|
Unique (%) | 14.2% |
Sample
1st row | Quentin Rosensweig |
---|---|
2nd row | Neiman Gracie |
3rd row | Richie MartinezRichie Martinez |
4th row | Leo NogueiraLeo Nogueira |
5th row | Romulo AzevedoRomulo Azevedo |
Value | Count | Frequency (%) |
lucas | 888 | 0.9% |
gabriel | 816 | 0.8% |
pedro | 606 | 0.6% |
oliveira | 601 | 0.6% |
silva | 535 | 0.5% |
sousa | 500 | 0.5% |
matheus | 498 | 0.5% |
felipe | 489 | 0.5% |
rafael | 478 | 0.5% |
santos | 468 | 0.5% |
Other values (8414) | 94429 |
Most occurring characters
Value | Count | Frequency (%) |
a | 89902 | 11.3% |
e | 64340 | 8.1% |
o | 63771 | 8.0% |
60266 | 7.6% | |
i | 59477 | 7.5% |
r | 53243 | 6.7% |
n | 46183 | 5.8% |
s | 34768 | 4.4% |
l | 33734 | 4.2% |
u | 23758 | 3.0% |
Other values (67) | 265642 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 611608 | |
Uppercase Letter | 121656 | 15.3% |
Space Separator | 60266 | 7.6% |
Other Punctuation | 1397 | 0.2% |
Dash Punctuation | 155 | < 0.1% |
Final Punctuation | 1 | < 0.1% |
Initial Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
a | 89902 | |
e | 64340 | |
o | 63771 | |
i | 59477 | |
r | 53243 | |
n | 46183 | 7.6% |
s | 34768 | 5.7% |
l | 33734 | 5.5% |
u | 23758 | 3.9% |
t | 22004 | 3.6% |
Other values (31) | 120428 |
Uppercase Letter
Value | Count | Frequency (%) |
M | 13052 | 10.7% |
A | 10013 | 8.2% |
R | 9070 | 7.5% |
S | 8302 | 6.8% |
C | 8084 | 6.6% |
L | 8025 | 6.6% |
J | 7557 | 6.2% |
G | 7543 | 6.2% |
B | 6224 | 5.1% |
D | 5394 | 4.4% |
Other values (19) | 38392 |
Other Punctuation
Value | Count | Frequency (%) |
. | 1389 | |
' | 6 | 0.4% |
, | 2 | 0.1% |
Space Separator
Value | Count | Frequency (%) |
60266 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 155 |
Final Punctuation
Value | Count | Frequency (%) |
” | 1 |
Initial Punctuation
Value | Count | Frequency (%) |
“ | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 733264 | |
Common | 61820 | 7.8% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
a | 89902 | 12.3% |
e | 64340 | 8.8% |
o | 63771 | 8.7% |
i | 59477 | 8.1% |
r | 53243 | 7.3% |
n | 46183 | 6.3% |
s | 34768 | 4.7% |
l | 33734 | 4.6% |
u | 23758 | 3.2% |
t | 22004 | 3.0% |
Other values (60) | 242084 |
Common
Value | Count | Frequency (%) |
60266 | ||
. | 1389 | 2.2% |
- | 155 | 0.3% |
' | 6 | < 0.1% |
, | 2 | < 0.1% |
” | 1 | < 0.1% |
“ | 1 | < 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 794940 | |
None | 142 | < 0.1% |
Punctuation | 2 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
a | 89902 | 11.3% |
e | 64340 | 8.1% |
o | 63771 | 8.0% |
60266 | 7.6% | |
i | 59477 | 7.5% |
r | 53243 | 6.7% |
n | 46183 | 5.8% |
s | 34768 | 4.4% |
l | 33734 | 4.2% |
u | 23758 | 3.0% |
Other values (47) | 265498 |
None
Value | Count | Frequency (%) |
ç | 20 | |
é | 18 | |
ã | 18 | |
ł | 17 | |
á | 17 | |
ó | 13 | |
í | 10 | |
ú | 7 | 4.9% |
ä | 6 | 4.2% |
ô | 4 | 2.8% |
Other values (8) | 12 |
Punctuation
Value | Count | Frequency (%) |
” | 1 | |
“ | 1 |
W/L
Categorical
MISSING
 
Distinct | 3 |
---|---|
Distinct (%) | < 0.1% |
Missing | 850 |
Missing (%) | 2.1% |
Memory size | 2.0 MiB |
W | |
---|---|
L | |
D | 348 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Characters and Unicode
Total characters | 40152 |
---|---|
Distinct characters | 3 |
Distinct categories | 1 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | L |
---|---|
2nd row | L |
3rd row | L |
4th row | L |
5th row | L |
Common Values
Value | Count | Frequency (%) |
W | 28162 | |
L | 11642 | |
D | 348 | 0.8% |
(Missing) | 850 | 2.1% |
Length
Histogram of lengths of the category
Common Values (Plot)
Value | Count | Frequency (%) |
w | 28162 | |
l | 11642 | |
d | 348 | 0.9% |
Most occurring characters
Value | Count | Frequency (%) |
W | 28162 | |
L | 11642 | |
D | 348 | 0.9% |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 40152 |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
W | 28162 | |
L | 11642 | |
D | 348 | 0.9% |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 40152 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
W | 28162 | |
L | 11642 | |
D | 348 | 0.9% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 40152 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
W | 28162 | |
L | 11642 | |
D | 348 | 0.9% |
Method
Text
MISSING
 
Distinct | 478 |
---|---|
Distinct (%) | 1.2% |
Missing | 1862 |
Missing (%) | 4.5% |
Memory size | 2.2 MiB |
Length
Max length | 22 |
---|---|
Median length | 19 |
Mean length | 9.5642054 |
Min length | 2 |
Characters and Unicode
Total characters | 374343 |
---|---|
Distinct characters | 70 |
Distinct categories | 8 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
Unique | 152 ? |
---|---|
Unique (%) | 0.4% |
Sample
1st row | Inside heel hook |
---|---|
2nd row | RNC |
3rd row | Heel hook |
4th row | Points |
5th row | Cross choke |
Value | Count | Frequency (%) |
pts | 14962 | |
choke | 4110 | 5.7% |
adv | 3505 | 4.8% |
referee | 3341 | 4.6% |
decision | 3341 | 4.6% |
2x0 | 3040 | 4.2% |
points | 2989 | 4.1% |
from | 2505 | 3.5% |
back | 2500 | 3.5% |
armbar | 2433 | 3.4% |
Other values (382) | 29584 |
Most occurring characters
Value | Count | Frequency (%) |
33170 | 8.9% | |
e | 30327 | 8.1% |
s | 24704 | 6.6% |
t | 22142 | 5.9% |
o | 21461 | 5.7% |
P | 18312 | 4.9% |
i | 16739 | 4.5% |
r | 16098 | 4.3% |
x | 15050 | 4.0% |
: | 14962 | 4.0% |
Other values (60) | 161378 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 239962 | |
Uppercase Letter | 50066 | 13.4% |
Space Separator | 33170 | 8.9% |
Decimal Number | 31628 | 8.4% |
Other Punctuation | 18468 | 4.9% |
Dash Punctuation | 1047 | 0.3% |
Open Punctuation | 1 | < 0.1% |
Close Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
e | 30327 | |
s | 24704 | |
t | 22142 | 9.2% |
o | 21461 | 8.9% |
i | 16739 | 7.0% |
r | 16098 | 6.7% |
x | 15050 | 6.3% |
a | 12634 | 5.3% |
n | 12615 | 5.3% |
k | 9665 | 4.0% |
Other values (16) | 58527 |
Uppercase Letter
Value | Count | Frequency (%) |
P | 18312 | |
A | 6268 | 12.5% |
C | 5072 | 10.1% |
R | 4941 | 9.9% |
D | 3804 | 7.6% |
T | 2135 | 4.3% |
N | 1602 | 3.2% |
S | 1431 | 2.9% |
K | 1280 | 2.6% |
B | 1125 | 2.2% |
Other values (15) | 4096 | 8.2% |
Decimal Number
Value | Count | Frequency (%) |
0 | 11617 | |
2 | 7995 | |
4 | 3516 | 11.1% |
1 | 2437 | 7.7% |
6 | 1689 | 5.3% |
3 | 1509 | 4.8% |
5 | 1000 | 3.2% |
8 | 788 | 2.5% |
7 | 595 | 1.9% |
9 | 482 | 1.5% |
Other Punctuation
Value | Count | Frequency (%) |
: | 14962 | |
, | 3187 | 17.3% |
/ | 316 | 1.7% |
. | 2 | < 0.1% |
' | 1 | < 0.1% |
Space Separator
Value | Count | Frequency (%) |
33170 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 1047 |
Open Punctuation
Value | Count | Frequency (%) |
( | 1 |
Close Punctuation
Value | Count | Frequency (%) |
) | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 290028 | |
Common | 84315 | 22.5% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
e | 30327 | 10.5% |
s | 24704 | 8.5% |
t | 22142 | 7.6% |
o | 21461 | 7.4% |
P | 18312 | 6.3% |
i | 16739 | 5.8% |
r | 16098 | 5.6% |
x | 15050 | 5.2% |
a | 12634 | 4.4% |
n | 12615 | 4.3% |
Other values (41) | 99946 |
Common
Value | Count | Frequency (%) |
33170 | ||
: | 14962 | |
0 | 11617 | 13.8% |
2 | 7995 | 9.5% |
4 | 3516 | 4.2% |
, | 3187 | 3.8% |
1 | 2437 | 2.9% |
6 | 1689 | 2.0% |
3 | 1509 | 1.8% |
- | 1047 | 1.2% |
Other values (9) | 3186 | 3.8% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 374343 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
33170 | 8.9% | |
e | 30327 | 8.1% |
s | 24704 | 6.6% |
t | 22142 | 5.9% |
o | 21461 | 5.7% |
P | 18312 | 4.9% |
i | 16739 | 4.5% |
r | 16098 | 4.3% |
x | 15050 | 4.0% |
: | 14962 | 4.0% |
Other values (60) | 161378 |
Competition
Text
MISSING
 
Distinct | 1775 |
---|---|
Distinct (%) | 4.4% |
Missing | 870 |
Missing (%) | 2.1% |
Memory size | 2.3 MiB |
Length
Max length | 19 |
---|---|
Median length | 16 |
Mean length | 11.33492 |
Min length | 3 |
Characters and Unicode
Total characters | 454893 |
---|---|
Distinct characters | 71 |
Distinct categories | 7 ? |
Distinct scripts | 2 ? |
Distinct blocks | 2 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
Unique | 314 ? |
---|---|
Unique (%) | 0.8% |
Sample
1st row | Kakuto 5 |
---|---|
2nd row | NoGi Pan Ams |
3rd row | Kakuto Challenge |
4th row | Atlanta W. Open |
5th row | UAEJJF NYC Pro |
Value | Count | Frequency (%) |
open | 7224 | 8.2% |
world | 5231 | 6.0% |
american | 3731 | 4.3% |
champ | 3536 | 4.0% |
pro | 3448 | 3.9% |
adcc | 3267 | 3.7% |
pan | 3205 | 3.7% |
nogi | 2577 | 2.9% |
slam | 1976 | 2.3% |
grand | 1971 | 2.2% |
Other values (1144) | 51590 |
Most occurring characters
Value | Count | Frequency (%) |
47624 | 10.5% | |
a | 35612 | 7.8% |
r | 29361 | 6.5% |
o | 27714 | 6.1% |
n | 27039 | 5.9% |
i | 23595 | 5.2% |
e | 21340 | 4.7% |
l | 19717 | 4.3% |
C | 15666 | 3.4% |
p | 15283 | 3.4% |
Other values (61) | 191942 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 266829 | |
Uppercase Letter | 129075 | |
Space Separator | 47624 | 10.5% |
Other Punctuation | 6260 | 1.4% |
Decimal Number | 5052 | 1.1% |
Dash Punctuation | 51 | < 0.1% |
Math Symbol | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
a | 35612 | |
r | 29361 | |
o | 27714 | |
n | 27039 | |
i | 23595 | |
e | 21340 | |
l | 19717 | |
p | 15283 | 5.7% |
m | 12050 | 4.5% |
s | 11309 | 4.2% |
Other values (19) | 43809 |
Uppercase Letter
Value | Count | Frequency (%) |
C | 15666 | |
A | 13434 | |
O | 12307 | |
P | 9985 | 7.7% |
G | 9566 | 7.4% |
W | 9324 | 7.2% |
S | 9282 | 7.2% |
N | 8701 | 6.7% |
D | 6463 | 5.0% |
B | 5751 | 4.5% |
Other values (16) | 28596 |
Decimal Number
Value | Count | Frequency (%) |
2 | 1460 | |
1 | 1205 | |
3 | 485 | 9.6% |
4 | 398 | 7.9% |
6 | 329 | 6.5% |
5 | 295 | 5.8% |
7 | 241 | 4.8% |
8 | 240 | 4.8% |
0 | 235 | 4.7% |
9 | 164 | 3.2% |
Other Punctuation
Value | Count | Frequency (%) |
. | 6257 | |
: | 2 | < 0.1% |
/ | 1 | < 0.1% |
Space Separator
Value | Count | Frequency (%) |
47624 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 51 |
Math Symbol
Value | Count | Frequency (%) |
+ | 2 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 395904 | |
Common | 58989 | 13.0% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
a | 35612 | 9.0% |
r | 29361 | 7.4% |
o | 27714 | 7.0% |
n | 27039 | 6.8% |
i | 23595 | 6.0% |
e | 21340 | 5.4% |
l | 19717 | 5.0% |
C | 15666 | 4.0% |
p | 15283 | 3.9% |
A | 13434 | 3.4% |
Other values (45) | 167143 |
Common
Value | Count | Frequency (%) |
47624 | ||
. | 6257 | 10.6% |
2 | 1460 | 2.5% |
1 | 1205 | 2.0% |
3 | 485 | 0.8% |
4 | 398 | 0.7% |
6 | 329 | 0.6% |
5 | 295 | 0.5% |
7 | 241 | 0.4% |
8 | 240 | 0.4% |
Other values (6) | 455 | 0.8% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 454881 | |
None | 12 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
47624 | 10.5% | |
a | 35612 | 7.8% |
r | 29361 | 6.5% |
o | 27714 | 6.1% |
n | 27039 | 5.9% |
i | 23595 | 5.2% |
e | 21340 | 4.7% |
l | 19717 | 4.3% |
C | 15666 | 3.4% |
p | 15283 | 3.4% |
Other values (58) | 191930 |
None
Value | Count | Frequency (%) |
ã | 8 | |
ú | 2 | 16.7% |
ó | 2 | 16.7% |
Weight
Text
MISSING
 
Distinct | 109 |
---|---|
Distinct (%) | 0.3% |
Missing | 1301 |
Missing (%) | 3.2% |
Memory size | 2.0 MiB |
Length
Max length | 6 |
---|---|
Median length | 4 |
Mean length | 3.9092718 |
Min length | 1 |
Characters and Unicode
Total characters | 155202 |
---|---|
Distinct characters | 29 |
Distinct categories | 6 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
Unique | 12 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | ABS |
---|---|
2nd row | 94KG |
3rd row | ABS |
4th row | 94KG |
5th row | 94KG |
Value | Count | Frequency (%) |
abs | 8421 | |
70kg | 2970 | 7.5% |
82kg | 2836 | 7.1% |
76kg | 2670 | 6.7% |
88kg | 2460 | 6.2% |
77kg | 2325 | 5.9% |
94kg | 2284 | 5.8% |
85kg | 1551 | 3.9% |
64kg | 1227 | 3.1% |
100kg | 1095 | 2.8% |
Other values (95) | 11863 |
Most occurring characters
Value | Count | Frequency (%) |
K | 31189 | |
G | 31186 | |
7 | 13521 | |
8 | 10425 | 6.7% |
6 | 8568 | 5.5% |
A | 8475 | 5.5% |
S | 8451 | 5.4% |
B | 8430 | 5.4% |
0 | 8164 | 5.3% |
9 | 7148 | 4.6% |
Other values (19) | 19645 |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 89901 | |
Decimal Number | 65287 | |
Lowercase Letter | 10 | < 0.1% |
Other Punctuation | 2 | < 0.1% |
Math Symbol | 1 | < 0.1% |
Space Separator | 1 | < 0.1% |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
K | 31189 | |
G | 31186 | |
A | 8475 | 9.4% |
S | 8451 | 9.4% |
B | 8430 | 9.4% |
O | 1975 | 2.2% |
W | 54 | 0.1% |
U | 37 | < 0.1% |
L | 36 | < 0.1% |
F | 24 | < 0.1% |
Other values (4) | 44 | < 0.1% |
Decimal Number
Value | Count | Frequency (%) |
7 | 13521 | |
8 | 10425 | |
6 | 8568 | |
0 | 8164 | |
9 | 7148 | |
2 | 4234 | 6.5% |
4 | 4211 | 6.4% |
5 | 3999 | 6.1% |
1 | 3998 | 6.1% |
3 | 1019 | 1.6% |
Lowercase Letter
Value | Count | Frequency (%) |
g | 6 | |
k | 4 |
Other Punctuation
Value | Count | Frequency (%) |
/ | 2 |
Math Symbol
Value | Count | Frequency (%) |
+ | 1 |
Space Separator
Value | Count | Frequency (%) |
1 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 89911 | |
Common | 65291 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
K | 31189 | |
G | 31186 | |
A | 8475 | 9.4% |
S | 8451 | 9.4% |
B | 8430 | 9.4% |
O | 1975 | 2.2% |
W | 54 | 0.1% |
U | 37 | < 0.1% |
L | 36 | < 0.1% |
F | 24 | < 0.1% |
Other values (6) | 54 | 0.1% |
Common
Value | Count | Frequency (%) |
7 | 13521 | |
8 | 10425 | |
6 | 8568 | |
0 | 8164 | |
9 | 7148 | |
2 | 4234 | 6.5% |
4 | 4211 | 6.4% |
5 | 3999 | 6.1% |
1 | 3998 | 6.1% |
3 | 1019 | 1.6% |
Other values (3) | 4 | < 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 155202 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
K | 31189 | |
G | 31186 | |
7 | 13521 | |
8 | 10425 | 6.7% |
6 | 8568 | 5.5% |
A | 8475 | 5.5% |
S | 8451 | 5.4% |
B | 8430 | 5.4% |
0 | 8164 | 5.3% |
9 | 7148 | 4.6% |
Other values (19) | 19645 |
Stage
Categorical
MISSING
 
Distinct | 44 |
---|---|
Distinct (%) | 0.1% |
Missing | 1048 |
Missing (%) | 2.6% |
Memory size | 2.0 MiB |
SF | |
---|---|
4F | |
F | |
R1 | |
SPF | |
Other values (39) |
Length
Max length | 4 |
---|---|
Median length | 2 |
Mean length | 1.8837663 |
Min length | 1 |
Characters and Unicode
Total characters | 75264 |
---|---|
Distinct characters | 23 |
Distinct categories | 3 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
Unique | 6 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | SPF |
---|---|
2nd row | SF |
3rd row | SF |
4th row | SF |
5th row | SF |
Common Values
Value | Count | Frequency (%) |
SF | 10034 | |
4F | 8718 | |
F | 8129 | |
R1 | 4325 | |
SPF | 2177 | 5.3% |
8F | 2088 | 5.1% |
RR | 1497 | 3.7% |
R2 | 1178 | 2.9% |
RPC | 515 | 1.3% |
3RD | 492 | 1.2% |
Other values (34) | 801 | 2.0% |
(Missing) | 1048 | 2.6% |
Length
Histogram of lengths of the category
Value | Count | Frequency (%) |
sf | 10034 | |
4f | 8718 | |
f | 8129 | |
r1 | 4325 | |
spf | 2177 | 5.4% |
8f | 2088 | 5.2% |
rr | 1497 | 3.7% |
r2 | 1178 | 2.9% |
rpc | 515 | 1.3% |
3rd | 492 | 1.2% |
Other values (34) | 806 | 2.0% |
Most occurring characters
Value | Count | Frequency (%) |
F | 31158 | |
S | 12345 | 16.4% |
R | 9969 | 13.2% |
4 | 8786 | 11.7% |
1 | 4388 | 5.8% |
P | 2948 | 3.9% |
8 | 2107 | 2.8% |
2 | 1184 | 1.6% |
3 | 748 | 1.0% |
D | 651 | 0.9% |
Other values (13) | 980 | 1.3% |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 57984 | |
Decimal Number | 17275 | 23.0% |
Space Separator | 5 | < 0.1% |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
F | 31158 | |
S | 12345 | 21.3% |
R | 9969 | 17.2% |
P | 2948 | 5.1% |
D | 651 | 1.1% |
C | 565 | 1.0% |
G | 230 | 0.4% |
L | 50 | 0.1% |
E | 42 | 0.1% |
K | 22 | < 0.1% |
Other values (2) | 4 | < 0.1% |
Decimal Number
Value | Count | Frequency (%) |
4 | 8786 | |
1 | 4388 | |
8 | 2107 | 12.2% |
2 | 1184 | 6.9% |
3 | 748 | 4.3% |
5 | 23 | 0.1% |
6 | 18 | 0.1% |
7 | 11 | 0.1% |
9 | 6 | < 0.1% |
0 | 4 | < 0.1% |
Space Separator
Value | Count | Frequency (%) |
5 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 57984 | |
Common | 17280 | 23.0% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
F | 31158 | |
S | 12345 | 21.3% |
R | 9969 | 17.2% |
P | 2948 | 5.1% |
D | 651 | 1.1% |
C | 565 | 1.0% |
G | 230 | 0.4% |
L | 50 | 0.1% |
E | 42 | 0.1% |
K | 22 | < 0.1% |
Other values (2) | 4 | < 0.1% |
Common
Value | Count | Frequency (%) |
4 | 8786 | |
1 | 4388 | |
8 | 2107 | 12.2% |
2 | 1184 | 6.9% |
3 | 748 | 4.3% |
5 | 23 | 0.1% |
6 | 18 | 0.1% |
7 | 11 | 0.1% |
9 | 6 | < 0.1% |
5 | < 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 75264 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
F | 31158 | |
S | 12345 | 16.4% |
R | 9969 | 13.2% |
4 | 8786 | 11.7% |
1 | 4388 | 5.8% |
P | 2948 | 3.9% |
8 | 2107 | 2.8% |
2 | 1184 | 1.6% |
3 | 748 | 1.0% |
D | 651 | 0.9% |
Other values (13) | 980 | 1.3% |
Year
Real number (ℝ)
MISSING
 
Distinct | 52 |
---|---|
Distinct (%) | 0.1% |
Missing | 850 |
Missing (%) | 2.1% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2018.838 |
Minimum | 1932 |
---|---|
Maximum | 2024 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 320.5 KiB |
Quantile statistics
Minimum | 1932 |
---|---|
5-th percentile | 2011 |
Q1 | 2017 |
median | 2020 |
Q3 | 2022 |
95-th percentile | 2024 |
Maximum | 2024 |
Range | 92 |
Interquartile range (IQR) | 5 |
Descriptive statistics
Standard deviation | 4.7376643 |
---|---|
Coefficient of variation (CV) | 0.0023467283 |
Kurtosis | 38.62521 |
Mean | 2018.838 |
Median Absolute Deviation (MAD) | 2 |
Skewness | -3.5455842 |
Sum | 81060385 |
Variance | 22.445463 |
Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
Value | Count | Frequency (%) |
2022 | 5504 | |
2023 | 5231 | |
2021 | 4425 | |
2019 | 4275 | |
2018 | 3765 | |
2017 | 3029 | |
2024 | 2808 | |
2020 | 2187 | 5.3% |
2016 | 2127 | 5.2% |
2015 | 2078 | 5.1% |
Other values (42) | 4723 |
Value | Count | Frequency (%) |
1932 | 3 | |
1934 | 2 | |
1935 | 2 | |
1936 | 3 | |
1937 | 1 | < 0.1% |
1950 | 2 | |
1951 | 3 | |
1954 | 1 | < 0.1% |
1955 | 1 | < 0.1% |
1973 | 1 | < 0.1% |
Value | Count | Frequency (%) |
2024 | 2808 | |
2023 | 5231 | |
2022 | 5504 | |
2021 | 4425 | |
2020 | 2187 | 5.3% |
2019 | 4275 | |
2018 | 3765 | |
2017 | 3029 | |
2016 | 2127 | 5.2% |
2015 | 2078 | 5.1% |
Stage | W/L | Year | |
---|---|---|---|
Stage | 1.000 | 0.224 | 0.094 |
W/L | 0.224 | 1.000 | 0.077 |
Year | 0.094 | 0.077 | 1.000 |
A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.
URL Tag | Name | Opponent | W/L | Method | Competition | Weight | Stage | Year | |
---|---|---|---|---|---|---|---|---|---|
0 | aarae-alexander | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
1 | aaron-johnson | aaron-johnson | Quentin Rosensweig | L | Inside heel hook | Kakuto 5 | ABS | SPF | 2015.0 |
2 | aaron-johnson | aaron-johnson | Neiman Gracie | L | RNC | NoGi Pan Ams | 94KG | SF | 2015.0 |
3 | aaron-johnson | aaron-johnson | Richie MartinezRichie Martinez | L | Heel hook | Kakuto Challenge | ABS | SF | 2015.0 |
4 | aaron-johnson | aaron-johnson | Leo NogueiraLeo Nogueira | L | Points | Atlanta W. Open | 94KG | SF | 2016.0 |
5 | aaron-johnson | aaron-johnson | Romulo AzevedoRomulo Azevedo | L | NaN | UAEJJF NYC Pro | 94KG | SF | 2016.0 |
6 | aaron-johnson | aaron-johnson | Abraham MarteAbraham Marte | L | Cross choke | UAEJJF NYC Pro | HWABS | 4F | 2016.0 |
7 | aaron-johnson | aaron-johnson | Andre GalvaoAndre Galvao | L | Choke | Pan American | ABS | R2 | 2016.0 |
8 | aaron-johnson | aaron-johnson | Joao Soares | L | Triangle | Boston Spring O. | 100KG | F | 2016.0 |
9 | aaron-johnson | aaron-johnson | Bernardo FariaBernardo Faria | L | Triangle armbar | World Champ. | ABS | R2 | 2016.0 |
URL Tag | Name | Opponent | W/L | Method | Competition | Weight | Stage | Year | |
---|---|---|---|---|---|---|---|---|---|
40992 | vinicius-garcia | vinicius-garcia | Pedro Palhares | W | NaN | Nashville Fall Open | ABS | F | 2018.0 |
40993 | vinicius-garcia | vinicius-garcia | Stanley Rosa | W | Points | American Nats | 88KG | 4F | 2019.0 |
40994 | vinicius-garcia | vinicius-garcia | Juan Cleber | W | Canto choke | American Nats | ABS | 4F | 2019.0 |
40995 | vinicius-garcia | vinicius-garcia | Jean Cartagena | W | NaN | Austin SMO | ABS | 4F | 2019.0 |
40996 | vinicius-garcia | vinicius-garcia | Andre Reis | W | NaN | Austin SMO | ABS | SF | 2019.0 |
40997 | vinicius-garcia | vinicius-garcia | Cody Heller | W | NaN | Atlanta SM Open | ABS | 4F | 2019.0 |
40998 | vinicius-garcia | vinicius-garcia | Daniel Olivier | W | Canto choke | New Orleans Open | 88KG | SF | 2020.0 |
40999 | vinicius-garcia | vinicius-garcia | Joshua Murdock | W | Points | New Orleans Open | ABS | SF | 2020.0 |
41000 | vinicius-garcia | vinicius-garcia | Kyle Raemisch | W | Mounted X choke | F2W 153 | 85KG | SPF | 2020.0 |
41001 | vinicius-garcia | vinicius-garcia | Kevin Vieira | W | Hashimoto choke | Pan American | 82KG | 8F | 2020.0 |
Most frequently occurring
URL Tag | Name | Opponent | W/L | Method | Competition | Weight | Stage | Year | # duplicates | |
---|---|---|---|---|---|---|---|---|---|---|
0 | adam-wardzinski | adam-wardzinski | Arya EsfandmazArya Esfandmaz | D | --- | Polaris Squads 2 | ABS | RR | 2020.0 | 2 |
1 | aj-agazarm | aj-agazarm | Choi Choi | W | Pts: 32x0 | American Nats | 76KG | 4F | 2013.0 | 2 |
2 | alex-munis | alex-munis | Gabriel CostaGabriel Costa | D | --- | Fenajitsu | 88KG | SPF | 2021.0 | 2 |
3 | andre-almeida | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 2 |
4 | arya-esfandmaz | arya-esfandmaz | Adam WardzinskiAdam Wardzinski | D | --- | Polaris Squads 2 | ABS | RR | 2020.0 | 2 |
5 | brenda-larissa | brenda-larissa | Alexa Yanes | W | Pts: 1x0 | Grand Slam MSK | 55KG | RR | 2021.0 | 2 |
6 | bruno-fernandes | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 2 |
7 | claudio-calasans | claudio-calasans | Lucas GualbertoLucas Gualberto | L | Pts: 2x0 | Fenajitsu | 82KG | SPF | 2021.0 | 2 |
8 | claudio-calasans | claudio-calasans | Sonoda Kendy | W | Armbar | Asian Open | 88KG | SF | 2014.0 | 2 |
9 | eldar-rafigaev | eldar-rafigaev | Max Arnold | W | Footlock | Croatia Pro | ABS | RR | 2018.0 | 2 |