torch_openreml.covariance.EquicorrelationMatrix¶
- class torch_openreml.covariance.EquicorrelationMatrix(n, param_names=None, trans=None, no_grad_index=None)[source]¶
Bases:
MatrixEquicorrelation matrix with a single shared correlation parameter.
\[\symbf{V} = (1 - \rho)\symbf{I}_n + \rho\symbf{J}_n\]where \(\symbf{I}_n\) is the identity matrix and \(\symbf{J}_n\) is the matrix of ones. All diagonal entries equal one and all off-diagonal entries equal \(\rho\).
For \(\symbf{V}\) to be positive definite, the correlation parameter must satisfy \(\rho > -1/(n-1)\). The default transform enforces this by mapping an unconstrained scalar through a sigmoid scaled to \((-1/(n-1),\, 1)\).
Unlike
CompoundSymmetricMatrix, this matrix has no variance parameter.Initialize an equicorrelation matrix of size
n x n.- Parameters:
n (int) – Matrix dimension.
param_names (list of str, optional) – Name for the correlation parameter. Defaults to
["rho"].trans (list of Transform, optional) – Transform applied to the parameter. Defaults to a sigmoid scaled to \((-1/(n-1),\, 1)\).
no_grad_index (list of int, optional) – Indices of parameters to exclude from gradient computation.
Example:
import torch from torch_openreml.covariance import EquicorrelationMatrix mat = EquicorrelationMatrix(3) params = torch.tensor([0.0]) print(mat(params)) print(mat.grad(params))
tensor([[1.0000, 0.2500, 0.2500], [0.2500, 1.0000, 0.2500], [0.2500, 0.2500, 1.0000]]) (tensor([[[0.0000, 0.3750, 0.3750], [0.3750, 0.0000, 0.3750], [0.3750, 0.3750, 0.0000]]]), ['rho'])Methods
__call__(params)Construct the matrix from a flat parameter tensor.
auto_grad(params)Compute the Jacobian of
build()with respect to trainable parameters using automatic differentiation.check_params(params)Validate a parameter tensor and return its device and dtype.
from_param_dict(param_dict)Extract parameter tensors from a dictionary into a flat 1D tensor.
get_intermediates(params)Retrieve cached intermediate computation results if still valid.
grad(params)Compute the Jacobian of
__call__()with respect to trainable parameters.manual_grad(params)Compute the Jacobian of
__call__()with respect to trainable parameters using a closed-form analytic expression.map_theta_to_dv(theta)An interface compatible with
torch_openreml.REMLthat maps parameters to the matrix Jacobian.map_theta_to_v(theta)An interface compatible with
torch_openreml.REMLthat maps parameters to a matrix.reset_intermediates()Clear the intermediate computation cache.
set_intermediates(params, intermediates)Cache intermediate computation results keyed by parameter hash.
set_no_grad([index, param_name])Set the indices of parameters to exclude from gradient computation.
to_param_dict(params)Convert a flat parameter tensor to a parameter dictionary.
trans_grad(params)Compute the element-wise derivative of the parameter transforms.
trans_params(params)Apply parameter transforms to a flat parameter tensor.
Attributes
no_grad_indexIndices of parameters excluded from gradient computation.
num_paramsTotal number of parameters.
param_namesOrdered parameter names.
repr_dictKey-value pairs used to build the string representation.
shapeOutput matrix shape.
transParameter transforms.
- __call__(params)[source]¶
Construct the matrix from a flat parameter tensor.
Must be implemented by subclasses. Implementations should convert
paramsviafrom_param_dict()orto_param_dict(), then callcheck_params()to validate andtrans_params()to apply transforms before any computation.- Parameters:
params (torch.Tensor or dict) – Flat 1D parameter tensor or parameter dictionary.
- Returns:
Constructed matrix of shape
shape.- Return type:
torch.Tensor
- manual_grad(params)[source]¶
Compute the Jacobian of
__call__()with respect to trainable parameters using a closed-form analytic expression.- Parameters:
params (torch.Tensor or dict) – Flat 1D parameter tensor or parameter dictionary.
- Returns:
(grad, grad_names), wheregradis a 3D tensor of shape(num_params - len(no_grad_index), *shape)andgrad_namesis a list of the corresponding parameter names. Returns(None, [])if all parameters are excluded from gradient computation.- Return type:
tuple