Skip to main content

Table 11 Top 10 genes are mentioned by each country

From: Integrating text mining, data mining, and network analysis for identifying genetic breast cancer trends

Country name # of abstracts Gene name   Country name # of abstracts Gene name  
United States 33,373 ESR1 5429 [16.27 %] Germany 4148 ERBB2 620 [14.95 %]
ERBB2 4271 [12.8 %] ESR1 517 [12.46 %]
EGF 2199 [6.59 %] PGR 245 [5.91 %]
PGR 1887 [5.65 %] EGF 239 [5.76 %]
BRCA1 1845 [5.53 %] CDKN2A 218 [5.26 %]
CDKN2A 1809 [5.42 %] SLC20A2 191 [4.6 %]
SLC20A2 1418 [4.25 %] BRCA1 140 [3.38 %]
TKT 1297 [3.89 %] CYP19A1 120 [2.89 %]
ACAD9 1143 [3.42 %] KRT75 120 [2.89 %]
CYP19A1 1073 [3.22 %] TKT 116 [2.8 %]
United Kingdom 6041 ESR1 1249 [20.68 %] France 3642 ESR1 569 [15.62 %]
ERBB2 674 [11.16 %] ERBB2 486 [13.34 %]
CYP19A1 425 [7.04 %] PGR 294 [8.07 %]
EGF 408 [6.75 %] CDKN2A 224 [6.15 %]
BRCA1 395 [6.54 %] BRCA1 222 [6.1 %]
CDKN2A 325 [5.38 %] EGF 173 [4.75 %]
PGR 311 [5.15 %] SLC20A2 165 [4.53 %]
BRCA2 256 [4.24 %] TKT 131 [3.6 %]
SLC20A2 227 [3.76 %] CYP19A1 120 [3.29 %]
INS 188 [3.11 %] CTSD 114 [3.13 %]
China 6553 ERBB2 799 [12.19 %] Canada 3573 ESR1 515 [14.41 %]
ESR1 764 [11.66 %] ERBB2 433 [12.12 %]
CDKN2A 431 [6.58 %] BRCA1 304 [8.51 %]
PGR 385 [5.88 %] EGF 205 [5.74 %]
EGF 378 [5.77 %] BRCA2 203 [5.68 %]
ACAD9 336 [5.13 %] PGR 188 [5.26 %]
MYLIP 327 [4.99 %] CDKN2A 186 [5.21 %]
BCL2 312 [4.76 %] INS 146 [4.09 %]
ABCB1 209 [3.19 %] TKT 137 [3.83 %]
CASP3 203 [3.1 %] SLC20A2 136 [3.81 %]
Japan 5299 ESR1 918 [17.32 %] The Netherlands 1844 ESR1 267 [14.48 %]
ERBB2 806 [15.21 %] BRCA1 218 [11.82 %]
PGR 456 [8.61 %] ERBB2 181 [9.82 %]
EGF 394 [7.44 %] BRCA2 115 [6.24 %]
CDKN2A 340 [6.42 %] PGR 115 [6.24 %]
CYP19A1 210 [3.96 %] EGF 97 [5.26 %]
SLC20A2 159 [3 %] CDKN2A 90 [4.88 %]
CEACAM3 151 [2.85 %] SLC20A2 82 [4.45 %]
BCL2L14 129 [2.43 %] ABCB1 81 [4.39 %]
ABCB1 129 [2.43 %] BCL2L14 69 [3.74 %]
Italy 4621 ERBB2 808 [17.49 %] Australia 1715 ESR1 260 [15.16 %]
ESR1 727 [15.73 %] ERBB2 166 [9.68 %]
PGR 404 [8.74 %] PGR 123 [7.17 %]
EGF 298 [6.45 %] BRCA1 120 [7 %]
CDKN2A 297 [6.43 %] EGF 94 [5.48 %]
SLC20A2 238 [5.15 %] SLC20A2 85 [4.96 %]
BRCA1 197 [4.26 %] BRCA2 73 [4.26 %]
INS 171 [3.7 %] INS 72 [4.2 %]
TKT 159 [3.44 %] ARL11 71 [4.14 %]
CYP19A1 156 [3.38 %] CDKN2A 68 [3.97 %]