Config
helical.models.geneformer.GeneformerConfig
Configuration class to use the Geneformer Model.
Parameters:
| Name | Type | Description | Default |
|---|---|---|---|
model_name
|
Literal[
|
"gf-6L-10M-i2048", "gf-12L-38M-i4096", "gf-12L-38M-i4096-CLcancer", "gf-12L-40M-i2048", "gf-12L-40M-i2048-CZI-CellxGene", "gf-12L-104M-i4096", "gf-12L-104M-i4096-CLcancer", "gf-20L-151M-i4096", "gf-18L-316M-i4096", ], default="gf-12L-38M-i4096" The name of the model. |
'gf-12L-38M-i4096'
|
batch_size
|
int
|
The batch size |
= 24
|
emb_layer
|
int
|
The embedding layer |
= -1
|
emb_mode
|
Literal['cls', 'cell', 'gene']
|
The embedding mode to use. "cls" is only available for Geneformer v2 models, returning the embeddings of the cls token. For cell level embeddings, a mean across all embeddings excluding the cls token is returned. For gene level embeddings, each gene token embedding is returned along with the corresponding ensembl ID. |
"cell"
|
device
|
str
|
The device to use. Accepts any string torch.device accepts, e.g. "cpu", "cuda", "cuda:0". |
"cpu"
|
nproc
|
int
|
Number of processes to use for data processing. |
1
|
output_attentions
|
bool
|
Whether to return attention weights from get_embeddings. Must be set at construction time: True forces eager attention (required for attention output; flash_attention_2 and sdpa do not support it), False uses flash_attention_2 when available, else sdpa. Note: eager attention materialises the full O(seq²) matrix and may OOM on long sequences or large batches. |
False
|
custom_attr_name_dict
|
dict
|
A dictionary that contains the names of the custom attributes to be added to the dataset. The keys of the dictionary are the names of the custom attributes, and the values are the names of the columns in adata.obs. For example, if you want to add a custom attribute called "cell_type" to the dataset, you would pass custom_attr_name_dict = {"cell_type": "cell_type"}. If you do not want to add any custom attributes, you can leave this parameter as None. |
None
|
Returns:
| Type | Description |
|---|---|
GeneformerConfig
|
The Geneformer configuration object |
Source code in helical/models/geneformer/geneformer_config.py
10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 | |