#include <nnet-simple-component.h>
Public Member Functions | |
virtual int32 | InputDim () const |
Returns input-dimension of this component. More... | |
virtual int32 | OutputDim () const |
Returns output-dimension of this component. More... | |
BaseFloat | OrthonormalConstraint () const |
virtual std::string | Info () const |
Returns some text-form information about this component, for diagnostics. More... | |
virtual void | InitFromConfig (ConfigLine *cfl) |
Initialize, from a ConfigLine object. More... | |
AffineComponent () | |
virtual std::string | Type () const |
Returns a string such as "SigmoidComponent", describing the type of the object. More... | |
virtual int32 | Properties () const |
Return bitmask of the component's properties. More... | |
virtual void * | Propagate (const ComponentPrecomputedIndexes *indexes, const CuMatrixBase< BaseFloat > &in, CuMatrixBase< BaseFloat > *out) const |
Propagate function. More... | |
virtual void | Backprop (const std::string &debug_info, const ComponentPrecomputedIndexes *indexes, const CuMatrixBase< BaseFloat > &in_value, const CuMatrixBase< BaseFloat > &, const CuMatrixBase< BaseFloat > &out_deriv, void *memo, Component *to_update, CuMatrixBase< BaseFloat > *in_deriv) const |
Backprop function; depending on which of the arguments 'to_update' and 'in_deriv' are non-NULL, this can compute input-data derivatives and/or perform model update. More... | |
virtual void | Read (std::istream &is, bool binary) |
Read function (used after we know the type of the Component); accepts input that is missing the token that describes the component type, in case it has already been consumed. More... | |
virtual void | Write (std::ostream &os, bool binary) const |
Write component to stream. More... | |
virtual Component * | Copy () const |
Copies component (deep copy). More... | |
virtual void | Scale (BaseFloat scale) |
This virtual function when called on – an UpdatableComponent scales the parameters by "scale" when called by an UpdatableComponent. More... | |
virtual void | Add (BaseFloat alpha, const Component &other) |
This virtual function when called by – an UpdatableComponent adds the parameters of another updatable component, times some constant, to the current parameters. More... | |
virtual void | PerturbParams (BaseFloat stddev) |
This function is to be used in testing. More... | |
virtual BaseFloat | DotProduct (const UpdatableComponent &other) const |
Computes dot-product between parameters of two instances of a Component. More... | |
virtual int32 | NumParameters () const |
The following new virtual function returns the total dimension of the parameters in this class. More... | |
virtual void | Vectorize (VectorBase< BaseFloat > *params) const |
Turns the parameters into vector form. More... | |
virtual void | UnVectorize (const VectorBase< BaseFloat > ¶ms) |
Converts the parameters from vector form. More... | |
virtual void | SetParams (const CuVectorBase< BaseFloat > &bias, const CuMatrixBase< BaseFloat > &linear) |
const CuVector< BaseFloat > & | BiasParams () const |
CuVector< BaseFloat > & | BiasParams () |
const CuMatrix< BaseFloat > & | LinearParams () const |
CuMatrix< BaseFloat > & | LinearParams () |
AffineComponent (const AffineComponent &other) | |
AffineComponent (const CuMatrixBase< BaseFloat > &linear_params, const CuVectorBase< BaseFloat > &bias_params, BaseFloat learning_rate) | |
virtual void | Resize (int32 input_dim, int32 output_dim) |
void | Init (int32 input_dim, int32 output_dim, BaseFloat param_stddev, BaseFloat bias_stddev) |
Public Member Functions inherited from UpdatableComponent | |
UpdatableComponent (const UpdatableComponent &other) | |
UpdatableComponent () | |
virtual | ~UpdatableComponent () |
virtual void | SetUnderlyingLearningRate (BaseFloat lrate) |
Sets the learning rate of gradient descent- gets multiplied by learning_rate_factor_. More... | |
virtual void | SetActualLearningRate (BaseFloat lrate) |
Sets the learning rate directly, bypassing learning_rate_factor_. More... | |
virtual void | SetAsGradient () |
Sets is_gradient_ to true and sets learning_rate_ to 1, ignoring learning_rate_factor_. More... | |
virtual BaseFloat | LearningRateFactor () |
virtual void | SetLearningRateFactor (BaseFloat lrate_factor) |
void | SetUpdatableConfigs (const UpdatableComponent &other) |
virtual void | FreezeNaturalGradient (bool freeze) |
freezes/unfreezes NaturalGradient updates, if applicable (to be overriden by components that use Natural Gradient). More... | |
BaseFloat | LearningRate () const |
Gets the learning rate to be used in gradient descent. More... | |
BaseFloat | MaxChange () const |
Returns the per-component max-change value, which is interpreted as the maximum change (in l2 norm) in parameters that is allowed per minibatch for this component. More... | |
void | SetMaxChange (BaseFloat max_change) |
BaseFloat | L2Regularization () const |
Returns the l2 regularization constant, which may be set in any updatable component (usually from the config file). More... | |
void | SetL2Regularization (BaseFloat a) |
Public Member Functions inherited from Component | |
virtual void | StoreStats (const CuMatrixBase< BaseFloat > &in_value, const CuMatrixBase< BaseFloat > &out_value, void *memo) |
This function may store stats on average activation values, and for some component types, the average value of the derivative of the nonlinearity. More... | |
virtual void | ZeroStats () |
Components that provide an implementation of StoreStats should also provide an implementation of ZeroStats(), to set those stats to zero. More... | |
virtual void | GetInputIndexes (const MiscComputationInfo &misc_info, const Index &output_index, std::vector< Index > *desired_indexes) const |
This function only does something interesting for non-simple Components. More... | |
virtual bool | IsComputable (const MiscComputationInfo &misc_info, const Index &output_index, const IndexSet &input_index_set, std::vector< Index > *used_inputs) const |
This function only does something interesting for non-simple Components, and it exists to make it possible to manage optionally-required inputs. More... | |
virtual void | ReorderIndexes (std::vector< Index > *input_indexes, std::vector< Index > *output_indexes) const |
This function only does something interesting for non-simple Components. More... | |
virtual ComponentPrecomputedIndexes * | PrecomputeIndexes (const MiscComputationInfo &misc_info, const std::vector< Index > &input_indexes, const std::vector< Index > &output_indexes, bool need_backprop) const |
This function must return NULL for simple Components. More... | |
virtual void | DeleteMemo (void *memo) const |
This virtual function only needs to be overwritten by Components that return a non-NULL memo from their Propagate() function. More... | |
virtual void | ConsolidateMemory () |
This virtual function relates to memory management, and avoiding fragmentation. More... | |
Component () | |
virtual | ~Component () |
Protected Member Functions | |
void | Init (std::string matrix_filename) |
virtual void | Update (const std::string &debug_info, const CuMatrixBase< BaseFloat > &in_value, const CuMatrixBase< BaseFloat > &out_deriv) |
virtual void | UpdateSimple (const CuMatrixBase< BaseFloat > &in_value, const CuMatrixBase< BaseFloat > &out_deriv) |
const AffineComponent & | operator= (const AffineComponent &other) |
Protected Member Functions inherited from UpdatableComponent | |
void | InitLearningRatesFromConfig (ConfigLine *cfl) |
std::string | ReadUpdatableCommon (std::istream &is, bool binary) |
void | WriteUpdatableCommon (std::ostream &is, bool binary) const |
Protected Attributes | |
CuMatrix< BaseFloat > | linear_params_ |
CuVector< BaseFloat > | bias_params_ |
BaseFloat | orthonormal_constraint_ |
Protected Attributes inherited from UpdatableComponent | |
BaseFloat | learning_rate_ |
learning rate (typically 0.0..0.01) More... | |
BaseFloat | learning_rate_factor_ |
learning rate factor (normally 1.0, but can be set to another < value so that when < you call SetLearningRate(), that value will be scaled by this factor. More... | |
BaseFloat | l2_regularize_ |
L2 regularization constant. More... | |
bool | is_gradient_ |
True if this component is to be treated as a gradient rather than as parameters. More... | |
BaseFloat | max_change_ |
configuration value for imposing max-change More... | |
Friends | |
class | NaturalGradientAffineComponent |
Additional Inherited Members | |
Static Public Member Functions inherited from Component | |
static Component * | ReadNew (std::istream &is, bool binary) |
Read component from stream (works out its type). Dies on error. More... | |
static Component * | NewComponentOfType (const std::string &type) |
Returns a new Component of the given type e.g. More... | |
Definition at line 406 of file nnet-simple-component.h.
|
inline |
Definition at line 416 of file nnet-simple-component.h.
Referenced by AffineComponent::Copy().
|
explicit |
Definition at line 1125 of file nnet-simple-component.cc.
AffineComponent | ( | const CuMatrixBase< BaseFloat > & | linear_params, |
const CuVectorBase< BaseFloat > & | bias_params, | ||
BaseFloat | learning_rate | ||
) |
Definition at line 1131 of file nnet-simple-component.cc.
References CuVectorBase< Real >::Dim(), KALDI_ASSERT, CuMatrixBase< Real >::NumRows(), and UpdatableComponent::SetUnderlyingLearningRate().
This virtual function when called by – an UpdatableComponent adds the parameters of another updatable component, times some constant, to the current parameters.
– a NonlinearComponent (or another component that stores stats, like BatchNormComponent)– it relates to adding stats. Otherwise it will normally do nothing.
Reimplemented from Component.
Reimplemented in NaturalGradientAffineComponent.
Definition at line 1117 of file nnet-simple-component.cc.
References AffineComponent::bias_params_, KALDI_ASSERT, and AffineComponent::linear_params_.
|
virtual |
Backprop function; depending on which of the arguments 'to_update' and 'in_deriv' are non-NULL, this can compute input-data derivatives and/or perform model update.
[in] | debug_info | The component name, to be printed out in any warning messages. |
[in] | indexes | A pointer to some information output by this class's PrecomputeIndexes function (will be NULL for simple components, i.e. those that don't do things like splicing). |
[in] | in_value | The matrix that was given as input to the Propagate function. Will be ignored (and may be empty) if Properties()&kBackpropNeedsInput == 0. |
[in] | out_value | The matrix that was output from the Propagate function. Will be ignored (and may be empty) if Properties()&kBackpropNeedsOutput == 0 |
[in] | out_deriv | The derivative at the output of this component. |
[in] | memo | This will normally be NULL, but for component types that set the flag kUsesMemo, this will be the return value of the Propagate() function that corresponds to this Backprop() function. Ownership of any pointers is not transferred to the Backprop function; DeleteMemo() will be called to delete it. |
[out] | to_update | If model update is desired, the Component to be updated, else NULL. Does not have to be identical to this. If supplied, you can assume that to_update->Properties() & kUpdatableComponent is nonzero. |
[out] | in_deriv | The derivative at the input of this component, if needed (else NULL). If Properties()&kBackpropInPlace, may be the same matrix as out_deriv. If Properties()&kBackpropAdds, this is added to by the Backprop routine, else it is set. The component code chooses which mode to work in, based on convenience. |
Implements Component.
Definition at line 1260 of file nnet-simple-component.cc.
References CuMatrixBase< Real >::AddMatMat(), UpdatableComponent::is_gradient_, kaldi::kNoTrans, AffineComponent::linear_params_, NVTX_RANGE, AffineComponent::Update(), and AffineComponent::UpdateSimple().
Definition at line 455 of file nnet-simple-component.h.
Referenced by ModelCollapser::CollapseComponentsAffine(), ModelCollapser::CollapseComponentsScale(), SvdApplier::DecomposeComponent(), ModelCollapser::GetDiagonallyPreModifiedComponentIndex(), and kaldi::nnet3::ReduceRankOfComponents().
Definition at line 456 of file nnet-simple-component.h.
|
virtual |
Copies component (deep copy).
Implements Component.
Reimplemented in NaturalGradientAffineComponent.
Definition at line 1173 of file nnet-simple-component.cc.
References AffineComponent::AffineComponent().
Referenced by ModelCollapser::CollapseComponentsScale().
|
virtual |
Computes dot-product between parameters of two instances of a Component.
Can be used for computing parameter-norm of an UpdatableComponent.
Implements UpdatableComponent.
Definition at line 1178 of file nnet-simple-component.cc.
References AffineComponent::bias_params_, kaldi::kTrans, AffineComponent::linear_params_, kaldi::TraceMatMat(), and kaldi::VecVec().
|
virtual |
Returns some text-form information about this component, for diagnostics.
Starts with the type of the component. E.g. "SigmoidComponent dim=900", although most components will have much more info.
Reimplemented from UpdatableComponent.
Reimplemented in NaturalGradientAffineComponent.
Definition at line 1159 of file nnet-simple-component.cc.
References AffineComponent::bias_params_, kaldi::GetVerboseLevel(), UpdatableComponent::Info(), AffineComponent::linear_params_, AffineComponent::orthonormal_constraint_, and kaldi::nnet3::PrintParameterStats().
Referenced by NaturalGradientAffineComponent::Info().
Definition at line 1185 of file nnet-simple-component.cc.
References AffineComponent::bias_params_, KALDI_ASSERT, and AffineComponent::linear_params_.
Referenced by ModelCollapser::CollapseComponentsAffine(), and AffineComponent::InitFromConfig().
|
protected |
Definition at line 1196 of file nnet-simple-component.cc.
References AffineComponent::bias_params_, KALDI_ASSERT, AffineComponent::linear_params_, CuMatrixBase< Real >::NumCols(), CuMatrixBase< Real >::NumRows(), CuMatrixBase< Real >::Range(), and kaldi::ReadKaldiObject().
|
virtual |
Initialize, from a ConfigLine object.
[in] | cfl | A ConfigLine containing any parameters that are needed for initialization. For example: "dim=100 param-stddev=0.1" |
Implements Component.
Reimplemented in NaturalGradientAffineComponent.
Definition at line 1207 of file nnet-simple-component.cc.
References ConfigLine::GetValue(), ConfigLine::HasUnusedValues(), AffineComponent::Init(), UpdatableComponent::InitLearningRatesFromConfig(), AffineComponent::InputDim(), KALDI_ASSERT, KALDI_ERR, AffineComponent::orthonormal_constraint_, AffineComponent::OutputDim(), ConfigLine::UnusedValues(), and ConfigLine::WholeLine().
|
inlinevirtual |
Returns input-dimension of this component.
Implements Component.
Definition at line 408 of file nnet-simple-component.h.
Referenced by ModelCollapser::CollapseComponentsAffine(), SvdApplier::DecomposeComponent(), SvdApplier::DecomposeComponents(), AffineComponent::InitFromConfig(), NaturalGradientAffineComponent::InitFromConfig(), LinearComponent::InitFromConfig(), AffineComponent::NumParameters(), kaldi::nnet3::ReduceRankOfComponents(), AffineComponent::UnVectorize(), and AffineComponent::Vectorize().
Definition at line 457 of file nnet-simple-component.h.
Referenced by ModelCollapser::CollapseComponentsAffine(), ModelCollapser::CollapseComponentsScale(), kaldi::nnet3::ConstrainOrthonormal(), SvdApplier::DecomposeComponent(), ModelCollapser::GetDiagonallyPreModifiedComponentIndex(), and kaldi::nnet3::ReduceRankOfComponents().
Definition at line 458 of file nnet-simple-component.h.
References PnormComponent::Init().
|
virtual |
The following new virtual function returns the total dimension of the parameters in this class.
Reimplemented from UpdatableComponent.
Definition at line 1323 of file nnet-simple-component.cc.
References AffineComponent::InputDim(), and AffineComponent::OutputDim().
Referenced by AffineComponent::UnVectorize(), and AffineComponent::Vectorize().
|
protected |
|
inline |
Definition at line 411 of file nnet-simple-component.h.
References Component::Info(), and PnormComponent::InitFromConfig().
Referenced by kaldi::nnet3::ConstrainOrthonormal().
|
inlinevirtual |
Returns output-dimension of this component.
Implements Component.
Definition at line 409 of file nnet-simple-component.h.
Referenced by ModelCollapser::CollapseComponentsAffine(), ModelCollapser::CollapseComponentsScale(), SvdApplier::DecomposeComponent(), SvdApplier::DecomposeComponents(), AffineComponent::InitFromConfig(), NaturalGradientAffineComponent::InitFromConfig(), LinearComponent::InitFromConfig(), AffineComponent::NumParameters(), kaldi::nnet3::ReduceRankOfComponents(), AffineComponent::UnVectorize(), and AffineComponent::Vectorize().
|
virtual |
This function is to be used in testing.
It adds unit noise times "stddev" to the parameters of the component.
Implements UpdatableComponent.
Definition at line 1149 of file nnet-simple-component.cc.
References AffineComponent::bias_params_, AffineComponent::linear_params_, CuVectorBase< Real >::SetRandn(), and CuMatrixBase< Real >::SetRandn().
|
virtual |
Propagate function.
[in] | indexes | A pointer to some information output by this class's PrecomputeIndexes function (will be NULL for simple components, i.e. those that don't do things like splicing). |
[in] | in | The input to this component. Num-columns == InputDim(). |
[out] | out | The output of this component. Num-columns == OutputDim(). Note: output of this component will be added to the initial value of "out" if Properties()&kPropagateAdds != 0; otherwise the output will be set and the initial value ignored. Each Component chooses whether it is more convenient implementation-wise to add or set, and the calling code has to deal with it. |
Implements Component.
Definition at line 1242 of file nnet-simple-component.cc.
References CuMatrixBase< Real >::AddMatMat(), AffineComponent::bias_params_, CuMatrixBase< Real >::CopyRowsFromVec(), kaldi::kNoTrans, kaldi::kTrans, and AffineComponent::linear_params_.
|
inlinevirtual |
Return bitmask of the component's properties.
These properties depend only on the component's type. See enum ComponentProperties.
Implements Component.
Definition at line 418 of file nnet-simple-component.h.
References Component::Add(), PnormComponent::Backprop(), PnormComponent::Copy(), kaldi::nnet3::DotProduct(), kaldi::nnet3::kBackpropAdds, kaldi::nnet3::kBackpropNeedsInput, kaldi::nnet3::kSimpleComponent, kaldi::nnet3::kUpdatableComponent, kaldi::nnet3::NumParameters(), kaldi::nnet3::PerturbParams(), PnormComponent::Propagate(), PnormComponent::Read(), Component::Scale(), and PnormComponent::Write().
|
virtual |
Read function (used after we know the type of the Component); accepts input that is missing the token that describes the component type, in case it has already been consumed.
Implements Component.
Reimplemented in NaturalGradientAffineComponent.
Definition at line 1289 of file nnet-simple-component.cc.
References AffineComponent::bias_params_, kaldi::nnet3::ExpectToken(), UpdatableComponent::is_gradient_, AffineComponent::linear_params_, AffineComponent::orthonormal_constraint_, kaldi::PeekToken(), kaldi::ReadBasicType(), and UpdatableComponent::ReadUpdatableCommon().
Definition at line 1111 of file nnet-simple-component.cc.
References KALDI_ASSERT.
|
virtual |
This virtual function when called on – an UpdatableComponent scales the parameters by "scale" when called by an UpdatableComponent.
– a Nonlinear component (or another component that stores stats, like BatchNormComponent)– it relates to scaling activation stats, not parameters. Otherwise it will normally do nothing.
Reimplemented from Component.
Reimplemented in NaturalGradientAffineComponent.
Definition at line 1100 of file nnet-simple-component.cc.
Referenced by ModelCollapser::GetScaledComponentIndex().
|
virtual |
Definition at line 1142 of file nnet-simple-component.cc.
References AffineComponent::bias_params_, KALDI_ASSERT, and AffineComponent::linear_params_.
Referenced by ModelCollapser::CollapseComponentsAffine(), ModelCollapser::CollapseComponentsScale(), and kaldi::nnet3::ReduceRankOfComponents().
|
inlinevirtual |
Returns a string such as "SigmoidComponent", describing the type of the object.
Implements Component.
Reimplemented in NaturalGradientAffineComponent.
Definition at line 417 of file nnet-simple-component.h.
|
virtual |
Converts the parameters from vector form.
Reimplemented from UpdatableComponent.
Definition at line 1332 of file nnet-simple-component.cc.
References AffineComponent::bias_params_, VectorBase< Real >::Dim(), AffineComponent::InputDim(), KALDI_ASSERT, AffineComponent::linear_params_, AffineComponent::NumParameters(), AffineComponent::OutputDim(), and VectorBase< Real >::Range().
|
inlineprotectedvirtual |
Reimplemented in NaturalGradientAffineComponent.
Definition at line 476 of file nnet-simple-component.h.
Referenced by AffineComponent::Backprop().
|
protectedvirtual |
Definition at line 1253 of file nnet-simple-component.cc.
References AffineComponent::bias_params_, kaldi::kNoTrans, kaldi::kTrans, UpdatableComponent::learning_rate_, and AffineComponent::linear_params_.
Referenced by AffineComponent::Backprop().
|
virtual |
Turns the parameters into vector form.
We put the vector form on the CPU, because in the kinds of situations where we do this, we'll tend to use too much memory for the GPU.
Reimplemented from UpdatableComponent.
Definition at line 1326 of file nnet-simple-component.cc.
References AffineComponent::bias_params_, VectorBase< Real >::Dim(), AffineComponent::InputDim(), KALDI_ASSERT, AffineComponent::linear_params_, AffineComponent::NumParameters(), AffineComponent::OutputDim(), and VectorBase< Real >::Range().
|
virtual |
Write component to stream.
Implements Component.
Reimplemented in NaturalGradientAffineComponent.
Definition at line 1310 of file nnet-simple-component.cc.
References AffineComponent::bias_params_, AffineComponent::linear_params_, AffineComponent::orthonormal_constraint_, kaldi::WriteBasicType(), kaldi::WriteToken(), and UpdatableComponent::WriteUpdatableCommon().
|
friend |
Definition at line 473 of file nnet-simple-component.h.
Definition at line 490 of file nnet-simple-component.h.
Referenced by AffineComponent::Add(), NaturalGradientAffineComponent::Add(), AffineComponent::DotProduct(), AffineComponent::Info(), AffineComponent::Init(), NaturalGradientAffineComponent::InitFromConfig(), AffineComponent::PerturbParams(), AffineComponent::Propagate(), AffineComponent::Read(), NaturalGradientAffineComponent::Scale(), AffineComponent::SetParams(), AffineComponent::UnVectorize(), NaturalGradientAffineComponent::Update(), AffineComponent::UpdateSimple(), AffineComponent::Vectorize(), AffineComponent::Write(), and NaturalGradientAffineComponent::Write().
Definition at line 489 of file nnet-simple-component.h.
Referenced by AffineComponent::Add(), NaturalGradientAffineComponent::Add(), AffineComponent::Backprop(), AffineComponent::DotProduct(), AffineComponent::Info(), AffineComponent::Init(), NaturalGradientAffineComponent::InitFromConfig(), AffineComponent::PerturbParams(), AffineComponent::Propagate(), AffineComponent::Read(), NaturalGradientAffineComponent::Scale(), AffineComponent::SetParams(), AffineComponent::UnVectorize(), NaturalGradientAffineComponent::Update(), AffineComponent::UpdateSimple(), AffineComponent::Vectorize(), AffineComponent::Write(), and NaturalGradientAffineComponent::Write().
|
protected |
Definition at line 493 of file nnet-simple-component.h.
Referenced by AffineComponent::Info(), LinearComponent::Info(), AffineComponent::InitFromConfig(), NaturalGradientAffineComponent::InitFromConfig(), LinearComponent::InitFromConfig(), AffineComponent::Read(), LinearComponent::Read(), AffineComponent::Write(), NaturalGradientAffineComponent::Write(), and LinearComponent::Write().