heads¶
heads
提供了一系列的模型预测头,用于处理不同的任务。
heads
接受 ModelOutupt
、dict
或 tuple
作为输入。
它会自动查找预测所需的模型输出并相应地处理。
一些预测头可能需要额外的信息,例如 attention_mask
或 input_ids
,例如 ContactPredictionHead
。
这些额外的参数可以作为参数/关键字参数传入。
请注意,heads
使用与 Transformers 相同的 ModelOutupt
约定。
如果模型输出是一个 tuple
,我们将第一个元素视为 pooler_output
,第二个元素视为 last_hidden_state
,最后一个元素视为 attention_map
。
用户有责任确保模型输出格式正确。
如果模型输出是一个 ModelOutupt
或一个 dict
,heads
将从模型输出中查找 HeadConfig.output_name
。
你可以在 HeadConfig
中指定 output_name
,以确保 heads
可以正确定位所需的张量。
multimolecule.module.heads.config
¶
HeadConfig
¶
Bases: BaseHeadConfig
Configuration class for a prediction head.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
|
Number of labels to use in the last layer added to the model, typically for a classification task. Head should look for |
required | |
|
Problem type for Head should look for |
required | |
|
Dimensionality of the encoder layers and the pooler layer. Head should look for |
required | |
|
The dropout ratio for the hidden states. |
required | |
|
The transform operation applied to hidden states. |
required | |
|
The activation function of transform applied to hidden states. |
required | |
|
Whether to apply bias to the final prediction layer. |
required | |
|
The activation function of the final prediction output. |
required | |
|
The epsilon used by the layer normalization layers. |
required | |
|
The name of the tensor required in model outputs. If is |
required | |
|
The type of the head in the model. This is used by [ |
required |
Source code in multimolecule/module/heads/config.py
MaskedLMHeadConfig
¶
Bases: BaseHeadConfig
Configuration class for a Masked Language Modeling head.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
|
Dimensionality of the encoder layers and the pooler layer. Head should look for |
required | |
|
The dropout ratio for the hidden states. |
required | |
|
The transform operation applied to hidden states. |
required | |
|
The activation function of transform applied to hidden states. |
required | |
|
Whether to apply bias to the final prediction layer. |
required | |
|
The activation function of the final prediction output. |
required | |
|
The epsilon used by the layer normalization layers. |
required | |
|
The name of the tensor required in model outputs. If is |
required |
Source code in multimolecule/module/heads/config.py
multimolecule.module.heads.sequence
¶
SequencePredictionHead
¶
Bases: PredictionHead
Head for tasks in sequence-level.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
|
PreTrainedConfig
|
The configuration object for the model. |
required |
|
HeadConfig | None
|
The configuration object for the head.
If None, will use configuration from the |
None
|
Source code in multimolecule/module/heads/sequence.py
output_name
class-attribute
instance-attribute
¶
output_name: str = 'pooler_output'
The default output to use for the head.
forward
¶
forward(outputs: ModelOutput | Tuple[Tensor, ...], labels: Tensor | None = None, output_name: str | None = None, **kwargs) -> HeadOutput
Forward pass of the SequencePredictionHead.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
|
ModelOutput | Tuple[Tensor, ...]
|
The outputs of the model. |
required |
|
Tensor | None
|
The labels for the head. |
None
|
|
str | None
|
The name of the output to use.
Defaults to |
None
|
Source code in multimolecule/module/heads/sequence.py
multimolecule.module.heads.token
¶
TokenPredictionHead
¶
Bases: PredictionHead
Head for tasks in token-level.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
|
PreTrainedConfig
|
The configuration object for the model. |
required |
|
HeadConfig | None
|
The configuration object for the head.
If None, will use configuration from the |
None
|
Source code in multimolecule/module/heads/token.py
output_name
class-attribute
instance-attribute
¶
output_name: str = 'last_hidden_state'
The default output to use for the head.
forward
¶
forward(outputs: ModelOutput | Tuple[Tensor, ...], attention_mask: Tensor | None = None, input_ids: NestedTensor | Tensor | None = None, labels: Tensor | None = None, output_name: str | None = None, **kwargs) -> HeadOutput
Forward pass of the TokenPredictionHead.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
|
ModelOutput | Tuple[Tensor, ...]
|
The outputs of the model. |
required |
|
Tensor | None
|
The attention mask for the inputs. |
None
|
|
NestedTensor | Tensor | None
|
The input ids for the inputs. |
None
|
|
Tensor | None
|
The labels for the head. |
None
|
|
str | None
|
The name of the output to use.
Defaults to |
None
|
Source code in multimolecule/module/heads/token.py
TokenKMerHead
¶
Bases: PredictionHead
Head for tasks in token-level with kmer inputs.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
|
PreTrainedConfig
|
The configuration object for the model. |
required |
|
HeadConfig | None
|
The configuration object for the head.
If None, will use configuration from the |
None
|
Source code in multimolecule/module/heads/token.py
output_name
class-attribute
instance-attribute
¶
output_name: str = 'last_hidden_state'
The default output to use for the head.
forward
¶
forward(outputs: ModelOutput | Tuple[Tensor, ...], attention_mask: Tensor | None = None, input_ids: NestedTensor | Tensor | None = None, labels: Tensor | None = None, output_name: str | None = None, **kwargs) -> HeadOutput
Forward pass of the TokenKMerHead.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
|
ModelOutput | Tuple[Tensor, ...]
|
The outputs of the model. |
required |
|
Tensor | None
|
The attention mask for the inputs. |
None
|
|
NestedTensor | Tensor | None
|
The input ids for the inputs. |
None
|
|
Tensor | None
|
The labels for the head. |
None
|
|
str | None
|
The name of the output to use.
Defaults to |
None
|
Source code in multimolecule/module/heads/token.py
multimolecule.module.heads.contact
¶
ContactPredictionHead
¶
Bases: PredictionHead
Head for tasks in contact-level.
Performs symmetrization, and average product correct.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
|
PreTrainedConfig
|
The configuration object for the model. |
required |
|
HeadConfig | None
|
The configuration object for the head.
If None, will use configuration from the |
None
|
Source code in multimolecule/module/heads/contact.py
Python | |
---|---|
87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 |
|
output_name
class-attribute
instance-attribute
¶
output_name: str = 'attentions'
The default output to use for the head.
forward
¶
forward(outputs: ModelOutput | Mapping | Tuple[Tensor, ...], attention_mask: Tensor | None = None, input_ids: NestedTensor | Tensor | None = None, labels: Tensor | None = None, output_name: str | None = None, **kwargs) -> HeadOutput
Forward pass of the ContactPredictionHead.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
|
ModelOutput | Mapping | Tuple[Tensor, ...]
|
The outputs of the model. |
required |
|
Tensor | None
|
The attention mask for the inputs. |
None
|
|
NestedTensor | Tensor | None
|
The input ids for the inputs. |
None
|
|
Tensor | None
|
The labels for the head. |
None
|
|
str | None
|
The name of the output to use.
Defaults to |
None
|
Source code in multimolecule/module/heads/contact.py
ContactLogitsHead
¶
Bases: PredictionHead
Head for tasks in contact-level.
Performs symmetrization, and average product correct.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
|
PreTrainedConfig
|
The configuration object for the model. |
required |
|
HeadConfig | None
|
The configuration object for the head.
If None, will use configuration from the |
None
|
Source code in multimolecule/module/heads/contact.py
output_name
class-attribute
instance-attribute
¶
output_name: str = 'last_hidden_state'
The default output to use for the head.
forward
¶
forward(outputs: ModelOutput | Mapping | Tuple[Tensor, ...], attention_mask: Tensor | None = None, input_ids: NestedTensor | Tensor | None = None, labels: Tensor | None = None, output_name: str | None = None, **kwargs) -> HeadOutput
Forward pass of the ContactPredictionHead.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
|
ModelOutput | Mapping | Tuple[Tensor, ...]
|
The outputs of the model. |
required |
|
Tensor | None
|
The attention mask for the inputs. |
None
|
|
NestedTensor | Tensor | None
|
The input ids for the inputs. |
None
|
|
Tensor | None
|
The labels for the head. |
None
|
|
str | None
|
The name of the output to use.
Defaults to |
None
|
Source code in multimolecule/module/heads/contact.py
symmetrize
¶
average_product_correct
¶
Perform average product correct, used for contact prediction.
Source code in multimolecule/module/heads/contact.py
Python | |
---|---|
multimolecule.module.heads.pretrain
¶
MaskedLMHead
¶
Bases: Module
Head for masked language modeling.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
|
PreTrainedConfig
|
The configuration object for the model. |
required |
|
MaskedLMHeadConfig | None
|
The configuration object for the head.
If None, will use configuration from the |
None
|
Source code in multimolecule/module/heads/pretrain.py
output_name
class-attribute
instance-attribute
¶
output_name: str = 'last_hidden_state'
The default output to use for the head.
forward
¶
forward(outputs: ModelOutput | Tuple[Tensor, ...], labels: Tensor | None = None, output_name: str | None = None) -> HeadOutput
Forward pass of the MaskedLMHead.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
|
ModelOutput | Tuple[Tensor, ...]
|
The outputs of the model. |
required |
|
Tensor | None
|
The labels for the head. |
None
|
|
str | None
|
The name of the output to use.
Defaults to |
None
|
Source code in multimolecule/module/heads/pretrain.py
multimolecule.module.heads.generic
¶
PredictionHead
¶
Bases: Module
Head for all-level of tasks.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
|
PreTrainedConfig
|
The configuration object for the model. |
required |
|
HeadConfig | None
|
The configuration object for the head.
If None, will use configuration from the |
None
|
Source code in multimolecule/module/heads/generic.py
Python | |
---|---|
42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 |
|
forward
¶
forward(embeddings: Tensor, labels: Tensor | None, **kwargs) -> HeadOutput
Forward pass of the PredictionHead.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
|
Tensor
|
The embeddings to be passed through the head. |
required |
|
Tensor | None
|
The labels for the head. |
required |