Namespaces
	attention

	computation_graph

	time_height_convolution

	utterance_splitting
	This namespace contains things needed for the implementation of the function NnetBatchComputer::SplitUtteranceIntoTasks().

Classes
struct	Access

class	AffineComponent

class	AmNnetSimple

struct	Analyzer
	This struct exists to set up various pieces of analysis; it helps avoid the repetition of code where we compute all these things in sequence. More...

class	BackpropTruncationComponent

class	BackpropTruncationComponentPrecomputedIndexes

class	BatchedXvectorComputer

struct	BatchedXvectorComputerOptions

class	BatchNormComponent

class	BinarySumDescriptor
	BinarySumDescriptor can represent either A + B, or (A if defined, else B). More...

class	BlockAffineComponent
	This class implements an affine transform using a block diagonal matrix e.g., one whose weight matrix is all zeros except for blocks on the diagonal. More...

class	BlockFactorizedTdnnComponent
	BlockFactorizedTdnnComponent is a modified form of TdnnComponent (which inherits from TdnnComponent) that is inspired by quaternion-based neural networks, but is more general and trainable– the idea is that blocks of parameters are linear functions of a smaller number parameters, where the linear function itself is trainable. More...

class	CachingOptimizingCompiler
	This class enables you to do the compilation and optimization in one call, and also ensures that if the ComputationRequest is identical to the previous one, the compilation process is not repeated. More...

struct	CachingOptimizingCompilerOptions

class	ChainExampleMerger
	This class is responsible for arranging examples in groups that have the same strucure (i.e. More...

struct	ChainObjectiveInfo

struct	CheckComputationOptions

struct	ChunkInfo

struct	ChunkTimeInfo
	struct ChunkTimeInfo is used by class UtteranceSplitter to output information about how we split an utterance into chunks. More...

struct	CindexHasher

class	CindexSet

struct	CindexVectorHasher

class	ClipGradientComponent

struct	CollapseModelConfig
	Config class for the CollapseModel function. More...

struct	CommandAttributes

struct	CommandPairComparator

struct	ComparePair

class	Compiler
	This class creates an initial version of the NnetComputation, without any optimization or sharing of matrices. More...

struct	CompilerOptions

class	Component
	Abstract base-class for neural-net components. More...

class	ComponentPrecomputedIndexes

class	CompositeComponent
	CompositeComponent is a component representing a sequence of [simple] components. More...

class	ComputationAnalysis
	This class performs various kinds of specific analysis on top of what class Analyzer gives you immediately. More...

class	ComputationCache
	Class ComputationCache is used inside class CachingOptimizingCompiler to cache previously computed computations. More...

class	ComputationChecker

class	ComputationExpander

struct	ComputationGraph
	The first step in compilation is to turn the ComputationSpecification into a ComputationGraph, where for each Cindex we have a list of other Cindexes that it depends on. More...

class	ComputationGraphBuilder
	An abstract representation of a set of Cindexes. More...

class	ComputationLoopedOptimizer

class	ComputationRenumberer

struct	ComputationRequest

struct	ComputationRequestHasher

struct	ComputationRequestPtrEqual

class	ComputationStepsComputer
	This class arranges the cindex_ids of the computation into a sequence of lists called "steps", which will correspond roughly to the commands in the compiled computation. More...

class	ComputationVariables
	This class relates the matrices and sub-matrices in the computation to imaginary "variables", such that we can think of the operations as operating on sets of individual variables, and we can then do analysis that lets us do optimization. More...

class	ConstantComponent

class	ConstantFunctionComponent

class	ConstantSumDescriptor
	This is an alternative base-case of SumDescriptor (an alternative to SimpleSumDescriptor) which represents a constant term, e.g. More...

class	ConvolutionComponent
	WARNING, this component is deprecated in favor of TimeHeightConvolutionComponent, and will be deleted. More...

class	DecodableAmNnetLoopedOnline

class	DecodableAmNnetSimple

class	DecodableAmNnetSimpleLooped

class	DecodableAmNnetSimpleParallel

class	DecodableNnetLoopedOnline

class	DecodableNnetLoopedOnlineBase

class	DecodableNnetSimple

class	DecodableNnetSimpleLooped

class	DecodableNnetSimpleLoopedInfo
	When you instantiate class DecodableNnetSimpleLooped, you should give it a const reference to this class, that has been previously initialized. More...

class	DerivativeTimeLimiter

class	Descriptor

class	DiscriminativeExampleMerger
	This class is responsible for arranging examples in groups that have the same strucure (i.e. More...

struct	DiscriminativeObjectiveFunctionInfo

class	DistributeComponent
	This Component takes a larger input-dim than output-dim, where the input-dim must be a multiple of the output-dim, and distributes different blocks of the input dimension to different 'x' values. More...

class	DistributeComponentPrecomputedIndexes

class	DropoutComponent

class	DropoutMaskComponent

class	ElementwiseProductComponent

struct	ExampleGenerationConfig

class	ExampleMerger
	This class is responsible for arranging examples in groups that have the same strucure (i.e. More...

class	ExampleMergingConfig

class	ExampleMergingStats
	This class is responsible for storing, and displaying in log messages, statistics about how examples of different sizes (c.f. More...

class	FixedAffineComponent
	FixedAffineComponent is an affine transform that is supplied at network initialization time and is not trainable. More...

class	FixedBiasComponent
	FixedBiasComponent applies a fixed per-element bias; it's similar to the AddShift component in the nnet1 setup (and only needed for nnet1 model conversion. More...

class	FixedScaleComponent
	FixedScaleComponent applies a fixed per-element scale; it's similar to the Rescale component in the nnet1 setup (and only needed for nnet1 model conversion). More...

class	ForwardingDescriptor
	A ForwardingDescriptor describes how we copy data from another NetworkNode, or from multiple other NetworkNodes, possibly with a scalar weight. More...

struct	GeneralDescriptor
	This class is only used when parsing Descriptors. More...

class	GeneralDropoutComponent
	GeneralDropoutComponent implements dropout, including a continuous variant where the thing we multiply is not just zero or one, but may be a continuous value. More...

class	GeneralDropoutComponentPrecomputedIndexes

struct	ImageAugmentationConfig

struct	Index
	struct Index is intended to represent the various indexes by which we number the rows of the matrices that the Components process: mainly 'n', the index of the member of the minibatch, 't', used for the frame index in speech recognition, and 'x', which is a catch-all extra index which we might use in convolutional setups or for other reasons. More...

struct	IndexHasher

struct	IndexLessNxt

class	IndexSet
	An abstract representation of a set of Indexes. More...

struct	IndexVectorHasher

struct	IoSpecification

struct	IoSpecificationHasher

class	LinearComponent

class	LogSoftmaxComponent

class	LstmNonlinearityComponent

struct	MatrixAccesses

class	MatrixExtender

struct	MaxChangeStats

class	MaxpoolingComponent

class	MemoryCompressionOptimizer
	This class is used in the function OptimizeMemoryCompression(), once we determine that there is some potential to do memory compression for this computation. More...

struct	MiscComputationInfo

class	ModelCollapser

class	ModelUpdateConsolidator
	This class is responsible for consolidating the model-update part of backprop commands, for components in (e.g.) recurrent networks that need to have many separate backprop commands, into more efficient single commands operating on consolidated data in larger matrices. More...

class	NaturalGradientAffineComponent

class	NaturalGradientPerElementScaleComponent
	NaturalGradientPerElementScaleComponent is like PerElementScaleComponent but it uses a natural gradient update for the per-element scales. More...

class	NaturalGradientRepeatedAffineComponent

struct	NetworkNode
	NetworkNode is used to represent, three types of thing: either an input of the network (which pretty much just states the dimension of the input vector); a Component (e.g. More...

class	Nnet

class	NnetBatchComputer
	This class does neural net inference in a way that is optimized for GPU use: it combines chunks of multiple utterances into minibatches for more efficient computation. More...

struct	NnetBatchComputerOptions

class	NnetBatchDecoder
	Decoder object that uses multiple CPU threads for the graph search, plus a GPU for the neural net inference (that's done by a separate NnetBatchComputer object). More...

class	NnetBatchInference
	This class implements a simplified interface to class NnetBatchComputer, which is suitable for programs like 'nnet3-compute' where you want to support fast GPU-based inference on a sequence of utterances, and get them back from the object in the same order. More...

class	NnetChainComputeProb
	This class is for computing objective-function values in a nnet3+chain setup, for diagnostics. More...

struct	NnetChainExample
	NnetChainExample is like NnetExample, but specialized for lattice-free (chain) training. More...

struct	NnetChainExampleStructureCompare
	This comparator object compares just the structural aspects of the NnetChainExample without looking at the value of the features. More...

struct	NnetChainExampleStructureHasher
	This hashing object hashes just the structural aspects of the NnetExample without looking at the value of the features. More...

struct	NnetChainSupervision

class	NnetChainTrainer
	This class is for single-threaded training of neural nets using the 'chain' model. More...

struct	NnetChainTrainingOptions

struct	NnetComputation

struct	NnetComputationPrintInserter

struct	NnetComputeOptions

class	NnetComputeProb
	This class is for computing cross-entropy and accuracy values in a neural network, for diagnostics. More...

struct	NnetComputeProbOptions

class	NnetComputer
	class NnetComputer is responsible for executing the computation described in the "computation" object. More...

class	NnetComputerFromEg

class	NnetDiscriminativeComputeObjf
	This class is for computing objective-function values in a nnet3 discriminative training, for diagnostics. More...

struct	NnetDiscriminativeExample
	NnetDiscriminativeExample is like NnetExample, but specialized for sequence training. More...

struct	NnetDiscriminativeExampleStructureCompare
	This comparator object compares just the structural aspects of the NnetDiscriminativeExample without looking at the value of the features. More...

struct	NnetDiscriminativeExampleStructureHasher
	This hashing object hashes just the structural aspects of the NnetExample without looking at the value of the features. More...

struct	NnetDiscriminativeOptions

struct	NnetDiscriminativeSupervision

class	NnetDiscriminativeTrainer
	This class is for single-threaded discriminative training of neural nets. More...

struct	NnetExample
	NnetExample is the input data and corresponding label (or labels) for one or more frames of input, used for standard cross-entropy training of neural nets (and possibly for other objective functions). More...

struct	NnetExampleStructureCompare
	This comparator object compares just the structural aspects of the NnetExample without looking at the value of the features. More...

struct	NnetExampleStructureHasher
	This hashing object hashes just the structural aspects of the NnetExample without looking at the value of the features. More...

struct	NnetGenerationOptions

struct	NnetInferenceTask
	class NnetInferenceTask represents a chunk of an utterance that is requested to be computed. More...

struct	NnetIo

struct	NnetIoStructureCompare
	This comparison object compares just the structural aspects of the NnetIo object (name, indexes, feature dimension) without looking at the value of features. More...

struct	NnetIoStructureHasher
	This hashing object hashes just the structural aspects of the NnetIo object (name, indexes, feature dimension) without looking at the value of features. More...

class	NnetLdaStatsAccumulator

struct	NnetOptimizeOptions

struct	NnetSimpleComputationOptions

struct	NnetSimpleLoopedComputationOptions

class	NnetTrainer
	This class is for single-threaded training of neural nets using standard objective functions such as cross-entropy (implemented with logsoftmax nonlinearity and a linear objective function) and quadratic loss. More...

struct	NnetTrainerOptions

class	NonlinearComponent

class	NoOpComponent
	NoOpComponent just duplicates its input. More...

class	NormalizeComponent

struct	ObjectiveFunctionInfo

class	OffsetForwardingDescriptor
	Offsets in 't' and 'x' values of other ForwardingDescriptors. More...

class	OnlineNaturalGradient
	Keywords for search: natural gradient, naturalgradient, NG-SGD. More...

class	OnlineNaturalGradientSimple

class	OptionalSumDescriptor
	This is the case of class SumDescriptor, in which we contain just one term, and that term is optional (an IfDefined() expression). More...

struct	PairIsEqualComparator

struct	PerDimObjectiveInfo

class	PerElementOffsetComponent

class	PerElementScaleComponent
	PerElementScaleComponent scales each dimension of its input with a separate trainable scale; it's like a linear component with a diagonal matrix. More...

class	PermuteComponent
	PermuteComponent changes the order of the columns (i.e. More...

class	PnormComponent

class	RandomComponent

class	RectifiedLinearComponent

class	RepeatedAffineComponent

class	ReplaceIndexForwardingDescriptor
	This ForwardingDescriptor modifies the indexes (n, t, x) by replacing one of them (normally t) with a constant value and keeping the rest. More...

class	RestrictedAttentionComponent
	RestrictedAttentionComponent implements an attention model with restricted temporal context. More...

class	RoundingForwardingDescriptor
	For use in clockwork RNNs and the like, this forwarding-descriptor rounds the time-index t down to the the closest t' <= t that is an exact multiple of t_modulus_. More...

class	RowOpsSplitter

class	ScaleAndOffsetComponent

class	SigmoidComponent

class	SimpleForwardingDescriptor
	SimpleForwardingDescriptor is the base-case of ForwardingDescriptor, consisting of a source node in the graph with a given scalar weight (which will in the normal case be 1.0). More...

struct	SimpleObjectiveInfo

class	SimpleSumDescriptor
	This is the normal base-case of SumDescriptor which just wraps a ForwardingDescriptor. More...

class	SoftmaxComponent

class	SpecAugmentTimeMaskComponent
	SpecAugmentTimeMaskComponent implements the time part of SpecAugment. More...

class	SpecAugmentTimeMaskComponentPrecomputedIndexes

class	StatisticsExtractionComponent

class	StatisticsExtractionComponentPrecomputedIndexes

class	StatisticsPoolingComponent

class	StatisticsPoolingComponentPrecomputedIndexes

class	SumBlockComponent
	SumBlockComponent sums over blocks of its input: for instance, if you create one with the config "input-dim=400 output-dim=100", its output will be the sum over the 4 100-dimensional blocks of the input. More...

class	SumDescriptor
	This is an abstract base-class. More...

class	SumGroupComponent
	SumGroupComponent is used to sum up groups of posteriors. More...

class	SvdApplier

class	SwitchingForwardingDescriptor
	Chooses from different inputs based on the the time index modulo (the number of ForwardingDescriptors given as inputs). More...

class	TanhComponent

struct	TarjanNode

class	TdnnComponent
	TdnnComponent is a more memory-efficient alternative to manually splicing several frames of input and then using a NaturalGradientAffineComponent or a LinearComponent. More...

class	TimeHeightConvolutionComponent
	TimeHeightConvolutionComponent implements 2-dimensional convolution where one of the dimensions of convolution (which traditionally would be called the width axis) is identified with time (i.e. More...

class	UpdatableComponent
	Class UpdatableComponent is a Component which has trainable parameters; it extends the interface of Component. More...

class	UtteranceSplitter

class	VariableMergingOptimizer
	This class is responsible for merging matrices, although you probably want to access it via the the function VariableMergingOptimization(). More...

Typedefs
typedef TableWriter< KaldiObjectHolder< NnetChainExample > >	NnetChainExampleWriter

typedef SequentialTableReader< KaldiObjectHolder< NnetChainExample > >	SequentialNnetChainExampleReader

typedef RandomAccessTableReader< KaldiObjectHolder< NnetChainExample > >	RandomAccessNnetChainExampleReader

typedef std::pair< int32, Index >	Cindex

typedef TableWriter< KaldiObjectHolder< NnetDiscriminativeExample > >	NnetDiscriminativeExampleWriter

typedef SequentialTableReader< KaldiObjectHolder< NnetDiscriminativeExample > >	SequentialNnetDiscriminativeExampleReader

typedef RandomAccessTableReader< KaldiObjectHolder< NnetDiscriminativeExample > >	RandomAccessNnetDiscriminativeExampleReader

typedef TableWriter< KaldiObjectHolder< NnetExample > >	NnetExampleWriter

typedef SequentialTableReader< KaldiObjectHolder< NnetExample > >	SequentialNnetExampleReader

typedef RandomAccessTableReader< KaldiObjectHolder< NnetExample > >	RandomAccessNnetExampleReader

Enumerations
enum	AccessType { kReadAccess, kWriteAccess, kReadWriteAccess }

enum	ComponentProperties { kSimpleComponent = 0x001, kUpdatableComponent = 0x002, kPropagateInPlace = 0x004, kPropagateAdds = 0x008, kReordersIndexes = 0x010, kBackpropAdds = 0x020, kBackpropNeedsInput = 0x040, kBackpropNeedsOutput = 0x080, kBackpropInPlace = 0x100, kStoresStats = 0x200, kInputContiguous = 0x400, kOutputContiguous = 0x800, kUsesMemo = 0x1000, kRandomComponent = 0x2000 }

enum	CommandType { kAllocMatrix, kDeallocMatrix, kSwapMatrix, kSetConst, kPropagate, kBackprop, kBackpropNoModelUpdate, kMatrixCopy, kMatrixAdd, kCopyRows, kAddRows, kCopyRowsMulti, kCopyToRowsMulti, kAddRowsMulti, kAddToRowsMulti, kAddRowRanges, kCompressMatrix, kDecompressMatrix, kAcceptInput, kProvideOutput, kNoOperation, kNoOperationPermanent, kNoOperationMarker, kNoOperationLabel, kGotoLabel }
	CommandType is an enum that describes the category of the command used in the NnetComputation. More...

enum	ObjectiveType { kLinear, kQuadratic }
	This enum is for a kind of annotation we associate with output nodes of the network; it's for the convenience of calling code so that if the objective is one of a few standard types, we can compute it directly and know how to interpret the supervision labels. More...

enum	NodeType { kInput, kDescriptor, kComponent, kDimRange, kNone }

enum	FillMode { kNearest, kReflect }

Functions
void	UnitTestPreconditionDirectionsOnline ()

std::string	PrintCommand (int32 num_commands, int32 command)

void	UnitTestNnetAnalyze ()

static void	IndexesMultiToSubmatrixIndexes (const std::vector< std::pair< int32, int32 > > &indexes_multi, std::vector< int32 > *submatrix_indexes)
	given a vector of pairs from computation.indexes_multi_indexes containing paris (submatrix-index, row-index), this function outputs to "submatrix_indexes" all (unique) submatrix indexes that appear; and it outputs to "contains_null_marker" true if the pair (-1, -1) appears anywhere in indexes_multi, and false otherwise. More...

void	ComputeCommandAttributes (const Nnet &nnet, const NnetComputation &computation, const ComputationVariables &vars, std::vector< CommandAttributes > *attributes)

void	ComputeVariableAccesses (const ComputationVariables &variables, const std::vector< CommandAttributes > &command_attributes, std::vector< std::vector< Access > > *variable_accesses)
	After the command-level attributes have been computed, this function organizes them per variable (see class ComputationVariables for how a variable is defined; it is part of a matrix). More...

void	ComputeMatrixAccesses (const Nnet &nnet, const NnetComputation &computation, const ComputationVariables &variables, const std::vector< CommandAttributes > &command_attributes, std::vector< MatrixAccesses > *matrix_accesses)
	This function organizes information in the CommandAttributes in a way that is convenient to access per matrix. More...

static void	CheckComputationOnline (const Nnet &nnet, NnetComputation computation, bool check_rewrite)

void	CheckComputation (const Nnet &nnet, const NnetComputation &computation, bool check_rewrite=false)
	This is a convenience interface for class ComputationChecker. More...

void	ComputeMatrixToSubmatrix (const NnetComputation &computation, std::vector< std::vector< int32 > > *mat_to_submat)
	This function computes a vector 'mat_to_submat', indexed by matrix index, such that (*mat_to_submat)[m] is a list of all the submatrix indexes that refer to matrix m. More...

void	PrintMatrixAccesses (std::ostream &os, const std::vector< MatrixAccesses > &matrix_accesses)
	This function is to be used in debugging; it produces human-readable output. More...

void	PrintCommandAttributes (std::ostream &os, const std::vector< CommandAttributes > &attributes)
	This function is to be used in debugging; it produces human-readable output. More...

void	GetCommandsOfType (const NnetComputation &computation, CommandType t, std::vector< int32 > *command_indexes)
	This utility function works out from a computation, the command-indexes of the commands of the given type. More...

int64	GetMaxMemoryUse (const NnetComputation &computation)

int32	MaxMemoryUsage (const NnetComputation &computation)
	Returns the total memory, in bytes, used by the computation (just the temporary memory, not counting the memory used by the nnet itself). More...

void	MergeTaskOutput (const std::vector< NnetInferenceTask > &tasks, Matrix< BaseFloat > *output)
	Merges together the 'output_cpu' (if the 'output_to_cpu' members are true) or the 'output' members of 'tasks' into a single CPU matrix 'output'. More...

void	MergeTaskOutput (const std::vector< NnetInferenceTask > &tasks, CuMatrix< BaseFloat > *output)

static bool	HasXentOutputs (const Nnet &nnet)

void	RecomputeStats (const std::vector< NnetChainExample > &egs, const chain::ChainTrainingOptions &chain_config, const fst::StdVectorFst &den_fst, Nnet *nnet)
	This function zeros the stored component-level stats in the nnet using ZeroComponentStats(), then recomputes them with the supplied egs. More...

static void	MergeSupervision (const std::vector< const NnetChainSupervision > &inputs, NnetChainSupervision output)

void	MergeChainExamples (bool compress, std::vector< NnetChainExample > input, NnetChainExample output)
	This function merges a list of NnetChainExample objects into a single one– intended to be used when forming minibatches for neural net training. More...

void	GetChainComputationRequest (const Nnet &nnet, const NnetChainExample &eg, bool need_model_derivative, bool store_component_stats, bool use_xent_regularization, bool use_xent_derivative, ComputationRequest *computation_request)
	This function takes a NnetChainExample and produces a ComputationRequest. More...

void	ShiftChainExampleTimes (int32 frame_shift, const std::vector< std::string > &exclude_names, NnetChainExample *eg)
	Shifts the time-index t of everything in the input of "eg" by adding "t_offset" to all "t" values– but excluding those with names listed in "exclude_names", e.g. More...

int32	GetNnetChainExampleSize (const NnetChainExample &a)

int32	GetChainNnetExampleSize (const NnetChainExample &a)
	This function returns the 'size' of a chain example as defined for purposes of merging egs, which is defined as the largest number of Indexes in any of the inputs or outputs of the example. More...

int32	YzxVectorIndex (int32 x, int32 y, int32 z, int32 input_x_dim, int32 input_y_dim, int32 input_z_dim)

int32	ZyxVectorIndex (int32 x, int32 y, int32 z, int32 input_x_dim, int32 input_y_dim, int32 input_z_dim)

void	RearrangeIndexes (const std::vector< std::vector< int32 > > &in, std::vector< std::vector< int32 > > *out)

void	UnitTestIndexIo ()

void	UnitTestCindexIo ()

static void	WriteIndexVectorElementBinary (std::ostream &os, const std::vector< Index > &vec, int32 i)

static void	ReadIndexVectorElementBinary (std::istream &is, int32 i, std::vector< Index > *vec)

void	WriteIndexVector (std::ostream &os, bool binary, const std::vector< Index > &vec)

void	ReadIndexVector (std::istream &is, bool binary, std::vector< Index > *vec)

static void	WriteCindexVectorElementBinary (std::ostream &os, const std::vector< Cindex > &vec, int32 i)

static void	ReadCindexVectorElementBinary (std::istream &is, int32 i, std::vector< Cindex > *vec)

void	WriteCindexVector (std::ostream &os, bool binary, const std::vector< Cindex > &vec)

void	ReadCindexVector (std::istream &is, bool binary, std::vector< Cindex > *vec)

std::ostream &	operator<< (std::ostream &ostream, const Index &index)

std::ostream &	operator<< (std::ostream &ostream, const Cindex &cindex)

void	PrintCindex (std::ostream &os, const Cindex &cindex, const std::vector< std::string > &node_names)

void	PrintIndexes (std::ostream &ostream, const std::vector< Index > &indexes)
	this will only be used for pretty-printing. More...

void	PrintCindexes (std::ostream &ostream, const std::vector< Cindex > &cindexes, const std::vector< std::string > &node_names)
	this will only be used for pretty-printing. More...

void	PrintIntegerVector (std::ostream &os, const std::vector< int32 > &ints)

void	AppendCindexes (int32 node, const std::vector< Index > &indexes, std::vector< Cindex > *out)
	Appends to 'out' the pairs (node, indexes[0]), (node, indexes[1]), ... More...

void	ModifyNnetIvectorPeriod (int32 ivector_period, Nnet *nnet)
	This function modifies the descriptors in the neural network to change the periodicity with which it expects to read an iVector at its input. More...

int32	GetChunkSize (const Nnet &nnet, int32 frame_subsampling_factor, int32 advised_chunk_size)

template<class I >
I	Mod (I m, I n)
	Mod(m, n), defined for integers m and n where n > 0, returns the modulus m % n, defined as the integer 0 <= i < n such that i and m are congruent modulo n; for instance, Mod(13, 10) = 3. More...

static void	CreateComputationRequestInternal (int32 begin_input_t, int32 end_input_t, int32 begin_output_t, int32 end_output_t, int32 num_sequences, int32 frame_subsampling_factor, const std::set< int32 > &ivector_times, ComputationRequest *request)

void	CreateLoopedComputationRequest (const Nnet &nnet, int32 chunk_size, int32 frame_subsampling_factor, int32 ivector_period, int32 left_context_begin, int32 right_context, int32 num_sequences, ComputationRequest request1, ComputationRequest request2, ComputationRequest *request3)
	This function creates computation request suitable for giving to ComputeLooped(). More...

void	AddTimeOffsetToComputationRequest (int32 t_offset, ComputationRequest *request)

static bool	ExtrapolateComputationRequest (const ComputationRequest &request1, const ComputationRequest &request2, ComputationRequest *request3)

static bool	CompileLoopedInternal (const Nnet &nnet, NnetOptimizeOptions optimize_opts, const ComputationRequest &request1, const ComputationRequest &request2, const ComputationRequest &request3, int32 num_requests, NnetComputation *computation)

void	CompileLooped (const Nnet &nnet, const NnetOptimizeOptions &optimize_opts, const ComputationRequest &request1, const ComputationRequest &request2, const ComputationRequest &request3, NnetComputation *computation)
	CompileLooped() provides an internal interface for 'looped' computation. More...

void	CreateLoopedComputationRequestSimple (const Nnet &nnet, int32 chunk_size, int32 frame_subsampling_factor, int32 ivector_period, int32 extra_left_context_begin, int32 extra_right_context, int32 num_sequences, ComputationRequest request1, ComputationRequest request2, ComputationRequest *request3)
	This function is deprecated. More...

void	UnitTestNnetCompile ()

void	UnitTestNnetCompileMulti ()

void	UnitTestNnetCompileLooped ()

void	PrintVectorVectorPair (std::vector< std::vector< std::pair< int32, int32 > > > vec_vec_pair)

void	UnitTestSplitLocationsBackward (bool verbose)

void	UnitTestHasContiguousProperty ()

void	UnitTestEnsureContiguousProperty ()

void	UnitTestSplitLocations (bool verbose)

void	GetSubmatCounts (const std::vector< std::vector< std::pair< int32, int32 > > > &submat_lists, std::unordered_map< int32, int32 > submat_counts, std::vector< int32 > submats_with_large_counts)
	Gets counts of submatrices (the 1st members of pairs) in submat_lists. More...

void	SeparateSubmatsWithLargeCounts (const std::vector< int32 > &submats_to_separate, const std::vector< std::vector< std::pair< int32, int32 > > > &submat_lists, std::vector< std::vector< std::pair< int32, int32 > > > reduced_submat_lists, std::vector< std::vector< std::pair< int32, int32 > > > split_lists)
	This function, used in SplitLocations(), is used to make separate 'split lists' for certain high-count submatrix indexes, specified by the user in 'submats_to_separate'. More...

void	SplitLocations (const std::vector< std::vector< std::pair< int32, int32 > > > &submat_lists, std::vector< std::vector< std::pair< int32, int32 > > > *split_lists)
	The input to this function is a vector (indexed by matrix-row-index) of lists of pairs (submat_index, row_index), and this function splits it up into a list of vectors of pairs, where those vectors are indexed by matrix-row-index. More...

bool	ConvertToIndexes (const std::vector< std::pair< int32, int32 > > &location_vector, int32 first_value, std::vector< int32 > second_values)
	If it is the case for some i >= 0 that all the .first elements of "location_vector" are either i or -1, then output i to first_value and the .second elements into "second_values", and return true. More...

void	EnsureContiguousProperty (const std::vector< int32 > &indexes, std::vector< std::vector< int32 > > *indexes_out)
	This function takes a vector of indexes and splits it up into as separate vectors of the same size, as needed to ensure that the 'contiguous property' holds. More...

void	SplitPairList (std::vector< std::pair< int32, int32 > > &list, std::vector< std::vector< std::pair< int32, int32 > > > *split_lists)
	This function splits a vector of pairs into a list of vectors of pairs. More...

void	SplitLocationsBackward (const std::vector< std::vector< std::pair< int32, int32 > > > &submat_lists, std::vector< std::vector< std::pair< int32, int32 > > > *split_lists)
	This function has the same interface as SplitLocations(); however, it ensures certain additional properties of the output "split_lists", which are necessary because of the way it is used in backprop code. More...

bool	HasContiguousProperty (const std::vector< int32 > &indexes, std::vector< std::pair< int32, int32 > > *reverse_indexes)
	This function returns true if for each integer i != -1, all the indexes j at which indexes[j] == i are consecutive with no gaps (more formally: if j1 < j2 < j3 and indexes[j1] != -1 and indexes[j1] == indexes[j3], then indexes[j1] == indexes[j2]). More...

void	GetNxList (const std::vector< Index > &indexes, std::vector< std::pair< int32, int32 > > *pairs)
	This function outputs a unique, lexicographically sorted list of the pairs of (n, x) values that are encountered in the provided list of Indexes. More...

void	GetTList (const std::vector< Index > &indexes, std::vector< int32 > *t_values)
	This function outputs a sorted, unique list of the 't' values that are encountered in the provided list of Indexes If 't' values equal to kNoTime are encountered, they are ignored and are not output. More...

static void	ResetSeed (int32 rand_seed, const Component &c)

bool	CheckStringsApproxEqual (const std::string &a, const std::string &b, int32 tolerance=3)

void	TestNnetComponentIo (Component *c)

void	TestNnetComponentCopy (Component *c)

void	TestNnetComponentAddScale (Component *c)

void	TestNnetComponentVectorizeUnVectorize (Component *c)

void	TestNnetComponentUpdatable (Component *c)

ComponentPrecomputedIndexes *	GetPrecomputedIndexes (const Component &c, int32 num_rows)

void	TestSimpleComponentPropagateProperties (const Component &c)

bool	TestSimpleComponentDataDerivative (const Component &c, BaseFloat perturb_delta)

bool	TestSimpleComponentModelDerivative (const Component &c, BaseFloat perturb_delta, bool test_derivative)

void	UnitTestNnetComponent ()

std::ostream &	operator<< (std::ostream &os, const ComputationGraphBuilder::ComputableInfo &info)
	This is to be used in logging only. More...

void	ComputeComputationGraph (const Nnet &nnet, const ComputationRequest &request, ComputationGraph *graph)

static int32	SumVectorSizes (const std::vector< std::vector< int32 > > &vec)

static int32	SumVectorSizes (const std::vector< std::vector< std::vector< int32 > > > &vec)

static void	ComputeComputationPhasesForEpoch (const Nnet &nnet, const ComputationGraph &graph, const std::vector< int32 > &this_epoch, const std::vector< std::vector< int32 > > &dependencies_subset, const std::vector< std::vector< int32 > > &depend_on_subset, bool epoch_is_trivial, std::vector< int32 > phase_indexes, std::vector< std::vector< int32 > > phases)

void	ComputeComputationPhases (const Nnet &nnet, const ComputationGraph &computation_graph, std::vector< std::vector< std::vector< int32 > > > *phases_per_segment)
	This function divides a computation into 'phases', where a 'phase' is a collection of cindexes which can (as far as the computation graph is concerned) all be computed at the same time, and depend only on cindexes previously computed in earlier phases. More...

static void	GetIndexesStrings (const Nnet &nnet, const NnetComputation &computation, std::vector< std::string > *indexes_strings)

static void	GetIndexesMultiStrings (const Nnet &nnet, const NnetComputation &computation, std::vector< std::string > *indexes_multi_strings)

static void	PrintCommand (std::ostream &os_out, const Nnet &nnet, const NnetComputation &computation, int32 command_index, const std::vector< std::string > &submatrix_strings, const std::vector< std::string > &indexes_strings, const std::vector< std::string > &indexes_multi_strings)

static void	PrintComputationPreamble (std::ostream &os, const NnetComputation &c, const Nnet &nnet, const std::vector< std::string > &submatrix_strings, const std::vector< std::string > &indexes_strings, const std::vector< std::string > &indexes_multi_strings)

void	UnitTestNnetComputationIo (NnetComputation *computation)

void	UnitTestComputationRequestIo (ComputationRequest *request)

void	TestNnetDecodable (Nnet *nnet)

void	UnitTestNnetCompute ()

void	ComputeMinAndMaxTimes (const std::vector< Index > &indexes, int32 min_t, int32 max_t)

void	SetDerivTimesOptions (const ComputationRequest &request, NnetOptimizeOptions *opt_config)

void	UnitTestNnetModelDerivatives ()

void	UnitTestNnetInputDerivatives ()

ForwardingDescriptor *	GenRandForwardingDescriptor (int32 num_nodes)

SumDescriptor *	GenRandSumDescriptor (int32 num_nodes)

void	GenRandDescriptor (int32 num_nodes, Descriptor *desc)

void	UnitTestDescriptorIo ()

void	UnitTestGeneralDescriptor ()

std::string	NormalizeTextDescriptor (const std::vector< std::string > &node_names, const std::string &desc_str)

void	UnitTestGeneralDescriptorSpecial ()

static std::string	ParsingContext (const std::string *token_ptr)

static void	ExpectToken (const std::string &token, const std::string &what_we_are_parsing, const std::string **next_token)

static int32	ReadIntegerToken (const std::string &what_we_are_parsing, const std::string **next_token)

void	ComputeAccuracy (const GeneralMatrix &supervision, const CuMatrixBase< BaseFloat > &nnet_output, BaseFloat tot_weight, BaseFloat tot_accuracy, VectorBase< BaseFloat > tot_weight_vec=NULL, VectorBase< BaseFloat > tot_accuracy_vec=NULL)
	This function computes the frame accuracy for this minibatch. More...

void	MergeSupervision (const std::vector< const NnetDiscriminativeSupervision > &inputs, NnetDiscriminativeSupervision output)

void	MergeDiscriminativeExamples (bool compress, std::vector< NnetDiscriminativeExample > input, NnetDiscriminativeExample output)

void	GetDiscriminativeComputationRequest (const Nnet &nnet, const NnetDiscriminativeExample &eg, bool need_model_derivative, bool store_component_stats, bool use_xent_regularization, bool use_xent_derivative, ComputationRequest *computation_request)
	This function takes a NnetDiscriminativeExample and produces a ComputationRequest. More...

void	ShiftDiscriminativeExampleTimes (int32 frame_shift, const std::vector< std::string > &exclude_names, NnetDiscriminativeExample *eg)
	Shifts the time-index t of everything in the input of "eg" by adding "t_offset" to all "t" values– but excluding those with names listed in "exclude_names", e.g. More...

int32	GetNnetDiscriminativeExampleSize (const NnetDiscriminativeExample &a)

void	MergeDiscriminativeExamples (std::vector< NnetDiscriminativeExample > input, bool compress, NnetDiscriminativeExample output)
	Appends the given vector of examples (which must be non-empty) into a single output example. More...

int32	GetDiscriminativeNnetExampleSize (const NnetDiscriminativeExample &a)
	This function returns the 'size' of a discriminative example as defined for purposes of merging egs, which is defined as the largest number of Indexes in any of the inputs or outputs of the example. More...

void	UnitTestNnetExample ()

void	UnitTestNnetMergeExamples ()

static void	GetIoNames (const std::vector< NnetExample > &src, std::vector< std::string > *names_vec)

static void	GetIoSizes (const std::vector< NnetExample > &src, const std::vector< std::string > &names, std::vector< int32 > *sizes)

static void	MergeIo (const std::vector< NnetExample > &src, const std::vector< std::string > &names, const std::vector< int32 > &sizes, bool compress, NnetExample *merged_eg)

void	MergeExamples (const std::vector< NnetExample > &src, bool compress, NnetExample *dest)
	Merge a set of input examples into a single example (typically the size of "src" will be the minibatch size). More...

void	ShiftExampleTimes (int32 t_offset, const std::vector< std::string > &exclude_names, NnetExample *eg)
	Shifts the time-index t of everything in the "eg" by adding "t_offset" to all "t" values. More...

void	GetComputationRequest (const Nnet &nnet, const NnetExample &eg, bool need_model_derivative, bool store_component_stats, ComputationRequest *computation_request)
	This function takes a NnetExample (which should already have been frame-selected, if desired, and merged into a minibatch) and produces a ComputationRequest. More...

void	WriteVectorAsChar (std::ostream &os, bool binary, const VectorBase< BaseFloat > &vec)

void	ReadVectorAsChar (std::istream &is, bool binary, Vector< BaseFloat > *vec)

void	RoundUpNumFrames (int32 frame_subsampling_factor, int32 num_frames, int32 num_frames_overlap)

int32	GetNnetExampleSize (const NnetExample &a)
	This function returns the 'size' of a nnet-example as defined for purposes of merging egs, which is defined as the largest number of Indexes in any of the inputs or outputs of the example. More...

static void	CopyPairVector (const CuArray< Int32Pair > &in, std::vector< std::pair< int32, int32 > > *out)

static void	CopyPairVector (const std::vector< std::pair< int32, int32 > > &in, CuArray< Int32Pair > *out)

bool	AssertGraphEqual (const std::vector< std::vector< int32 > > &graph1, const std::vector< std::vector< int32 > > &graph2)

bool	AssertVectorEqual (const std::vector< int32 > &vec1, const std::vector< int32 > &vec2)

void	BuildTestGraph (std::vector< std::vector< int32 > > *graph)

void	BuildTestGraphTranspose (std::vector< std::vector< int32 > > *graph)

void	BuildTestSccs (std::vector< std::vector< int32 > > *sccs)

void	BuildTestSccGraph (std::vector< std::vector< int32 > > *scc_graph)

void	BuildTestTopSortOrder (std::vector< int32 > *node_to_order)

void	UnitTestComputeGraphTranspose ()

void	UnitTestFindSccs ()

void	UnitTestMakeSccGraph ()

void	UnitTestComputeTopSortOrder ()

void	UnitTestComputeTopSortOrder2 ()

void	NnetToDirectedGraph (const Nnet &nnet, std::vector< std::vector< int32 > > *graph)
	This function takes an nnet and turns it to a directed graph on nodes. More...

void	ComputeGraphTranspose (const std::vector< std::vector< int32 > > &graph, std::vector< std::vector< int32 > > *graph_transpose)
	Outputs a graph in which the order of arcs is reversed. More...

void	TarjanSccRecursive (int32 node, const std::vector< std::vector< int32 > > &graph, int32 global_index, std::vector< TarjanNode > tarjan_nodes, std::vector< int32 > tarjan_stack, std::vector< std::vector< int32 > > sccs)

void	FindSccsTarjan (const std::vector< std::vector< int32 > > &graph, std::vector< std::vector< int32 > > *sccs)

void	FindSccs (const std::vector< std::vector< int32 > > &graph, std::vector< std::vector< int32 > > *sccs)
	Given a directed graph (where each std::vector<int32> is a list of destination-nodes of arcs coming from the current node), partition it into strongly connected components (i.e. More...

void	MakeSccGraph (const std::vector< std::vector< int32 > > &graph, const std::vector< std::vector< int32 > > &sccs, std::vector< std::vector< int32 > > *scc_graph)
	Given a list of sccs of a graph (e.g. More...

void	ComputeTopSortOrderRecursive (int32 node, const std::vector< std::vector< int32 > > &graph, std::vector< bool > cycle_detector, std::vector< bool > is_visited, std::vector< int32 > *reversed_orders)

void	ComputeTopSortOrder (const std::vector< std::vector< int32 > > &graph, std::vector< int32 > *node_to_order)
	Given an acyclic graph (where each std::vector<int32> is a list of destination-nodes of arcs coming from the current node), compute a topological ordering of the graph nodes. More...

std::string	PrintGraphToString (const std::vector< std::vector< int32 > > &graph)
	Prints a graph to a string in a pretty way for human readability, e.g. More...

void	ComputeNnetComputationEpochs (const Nnet &nnet, std::vector< int32 > *node_to_epoch)
	This function computes the order in which we need to compute each node in the graph, where each node-index n maps to an epoch-index t = 0, 1, ... More...

bool	GraphHasCycles (const std::vector< std::vector< int32 > > &graph)
	This function returns 'true' if the graph represented in 'graph' contains cycles (including cycles where a single node has an arc to itself). More...

void	UnitTestNnetIo ()

static bool	UnitTestNnetOptimizeWithOptions (int32 srand_seed, NnetOptimizeOptions opt_config, CachingOptimizingCompilerOptions compiler_config)

static void	UnitTestNnetOptimizeInternal (int32 srand_seed)

static void	UnitTestNnetOptimize ()

void	IdentifySubmatrixArgs (NnetComputation::Command command, std::vector< int32 > *submatrix_args)
	This function outputs to "submatrix_args" the addresses of a subset of arguments arg1 through arg6 in "command", that correspond to the indexes of submatrices. More...

void	IdentifySubmatrixArgs (std::vector< NnetComputation::Command > commands, std::vector< int32 > *submatrix_args)
	This function outputs to "submatrix_args" the addresses of the args (arguments arg1 through arg6) in the vector "commands", that correspond to the indexes of submatrices. More...

void	IdentifyMatrixArgsInComputation (NnetComputation computation, std::vector< int32 > *matrix_args)

void	IdentifyIndexesMultiArgs (std::vector< NnetComputation::Command > commands, std::vector< int32 > *indexes_multi_args)
	Identifies in the vector of commands, arguments that correspond to indexes into the computation's indexes_multi array, and outputs a list of pointers to those arguments to 'indexes_multi_args'. More...

void	IdentifyIndexesRangesArgs (std::vector< NnetComputation::Command > commands, std::vector< int32 > *indexes_ranges_args)
	Identifies in the vector of commands, arguments that correspond to indexes into the computation's 'indexes_ranges' array, and outputs a list of pointers to those arguments to 'indexes_ranges_args'. More...

void	IdentifyIndexesArgs (std::vector< NnetComputation::Command > commands, std::vector< int32 > *indexes_args)
	Identifies in the vector of commands, arguments that correspond to indexes into the computation's 'indexes' array, and outputs a list of pointers to those arguments to 'indexes_args'. More...

void	IdentifySubmatrixArgsInComputation (NnetComputation computation, std::vector< int32 > *submatrix_args)
	This function outputs to "submatrix_args" the addresses of integers in 'computation' that correspond to submatrices. More...

void	RenumberComputation (NnetComputation *computation)
	This function detects submatrices and matrices that are never used (e.g. More...

static bool	IsNoop (const NnetComputation::Command &command)

void	RemoveNoOps (NnetComputation *computation)
	Removes commands of type kNoOperation in the computation. More...

static NnetComputation::SubMatrixInfo	GetSubMatrixOfSubMatrix (const NnetComputation &computation, int32 submat_a, int32 submat_b)
	This static function returns a SubMatrixInfo corresponding to replacing the matrix-index in a's "matrix_index" with, essentially, sub-matrix b. More...

void	ExtendMatrices (NnetComputation *computation)
	This is not really an optimization in itself but it can make things easier for class VariableMergingOptimizer (usually called by its wrapper VariableMergingOptimization()). More...

void	ConsolidateModelUpdate (const Nnet &nnet, NnetComputation *computation)
	This optimization consolidates the model-update part of backprop commands, for components in (e.g.) recurrent networks that need to have many separate backprop commands, into more efficient single commands operating on consolidated data in larger matrices. More...

void	LimitDerivativeTimes (const Nnet &nnet, int32 min_deriv_time, int32 max_deriv_time, NnetComputation *computation)

static bool	IndexesHaveSpecialStructure (const std::vector< int32 > &indexes, int32 first_nonnegative_pos, int32 first_nonnegative_value, int32 *num_nonnegative_indexes)

bool	ReplaceRowWithMatrixOps (NnetComputation *computation)
	This function detects cases where commands of type kCopyRows, kAddRows or kAddToRows can be converted to commands of type kMatrixCopy or kMatrixAdd, and converts them (this may involve adding submatrices). More...

static void	FindNumLeadingAndTrailingNegatives (const std::vector< int32 > &vec, int32 num_leading_negatives, int32 num_trailing_negatives)

static bool	SnipSingleRowOp (NnetComputation *computation, int32 command_index)

static void	FindNumLeadingAndTrailingNegatives (const std::vector< std::pair< int32, int32 > > &vec, int32 num_leading_negatives, int32 num_trailing_negatives)

static bool	SnipMultiRowOp (NnetComputation *computation, int32 command_index)

static void	FindNumLeadingAndTrailingIdenticals (const std::vector< std::pair< int32, int32 > > &vec, int32 num_leading_identicals, int32 num_trailing_identicals)

static bool	SnipRangesRowOp (NnetComputation *computation, int32 command_index)

bool	SnipRowOps (NnetComputation *computation)
	This function detects cases where commands of type kCopyRows, kAddRows, kAddRowsMulti, kAddToRowsMulti, kCopyRowsMulti, kCopyToRowsMulti or kAddRowRanges use indexes that start or end with -1's or equivalents, and replace them with similar commands that act on a sub-matrix of the matrices they are currently acting on. More...

bool	SplitRowOps (NnetComputation *computation)
	This function detects cases where commands of type kAddRowsMulti, kAddToRowsMulti, kCopyRowsMulti, kCopyToRowsMulti use indexes that correspond to at most two submatrices, in two distinct ranges without gaps filled by -1's, and could be converted to at most two commands of type kMatrixAdd, kMatrixCopy, kAddRows or kCopyRows. More...

static int32	FindNStride (const std::vector< Index > &indexes, bool full_check)

static int32	FindNStride (const std::vector< Cindex > &cindexes, bool full_check)

static void	ConvertNumNValues (int32 n_stride, int32 old_N, int32 new_N, const std::vector< Index > &indexes_in, std::vector< Index > *indexes_out)

void	ExpandComputation (const Nnet &nnet, const MiscComputationInfo &misc_info, const NnetComputation &computation, bool need_debug_info, int32 num_n_values, NnetComputation *expanded_computation)
	This function is used in 'shortcut' compilation to expand a computation that has been compiled for exactly 2 'n' values, to one that is suitable for some num_n_values > 2. More...

static bool	IoSpecificationIsDecomposable (const IoSpecification &io_spec, IoSpecification mini_io_spec, int32 num_n_values_out)

bool	RequestIsDecomposable (const ComputationRequest &request, ComputationRequest mini_request, int32 num_n_values)
	This function, used in 'shortcut' compilation where we first compile a smaller computation with the same structure but only 2 distinct 'n' values, works out whether a computation is 'decomposable'; if so, it returns true and outputs the 'mini_request' with the same structure, and the number of 'n' values. More...

void	OptimizeLoopedComputation (const Nnet &nnet, NnetComputation *computation)
	This function tries to optimize computation 'computation' for an 'looped' computation. More...

void	FixGotoLabel (NnetComputation *computation)
	This function ensures that the arg1 of a final command of type kGotoLabel is the same as the command with type kNoOperationLabel. More...

bool	MatrixIsUnused (const Analyzer &analyzer, const NnetComputation &computation, int32 m)
	This function returns true if matrix 1 <= m < computation->matrices.size() is unused, defined as: it is not an input or an output, and is not accessed other than via commands of type kAllocMatrix, kDeallocMatrix, and kSetConst. More...

void	RemoveCommandsForUnusedMatrix (const Analyzer &analyzer, int32 m, NnetComputation *computation)
	This function removes from 'computation' the commands accessing matrix 'm', which is assumed to be unused according to the MatrixIsUnused() command above. More...

void	InsertCommands (std::vector< std::pair< int32, NnetComputation::Command > > commands, NnetComputation computation)
	Inserts commands into the computation at the requested places. More...

void	OptimizeMemoryCompression (const Nnet &nnet, int32 memory_compression_level, NnetComputation *computation)
	Performs optimization to reduce memory usage where possible, making use of the kCompressMatrix and kDecompressMatrix commands. More...

int32	MaxOutputTimeInRequest (const ComputationRequest &request)

void	MoveSizingCommands (const Nnet &nnet, NnetComputation *computation)
	This optimization moves commands that allocate and zero matrices to as late as possible, and moves commands that deallocate matrices to as early as possible. More...

void	RemoveUnnecessaryZeroing (const Nnet &nnet, NnetComputation *computation)
	This optimization function removes, where possible, commands of type type kSetConst. More...

static void	ComputeCommandPairs (const std::pair< std::vector< int32 >, std::vector< int32 > > &lists, std::vector< std::pair< int32, int32 > > *pairs)

void	RemoveUnnecessaryAllocation (const Nnet &nnet, NnetComputation *computation)
	This optimization detects cases where we deallocate a matrix, and then later allocate another matrix of the same size; and replaces them with commands of type kAllocFromOther or kAllocFromOtherZeroed. More...

void	VariableMergingOptimization (const NnetOptimizeOptions &config, const Nnet &nnet, NnetComputation *computation)
	This wraps class VariableMergingOptimizer in a simplified interface. More...

void	ConvertAdditionToAssignment (const Nnet &nnet, NnetComputation *computation)
	This converts addition operations (things with Add in their names) to copy operations (things with Copy in their names). More...

void	Optimize (const NnetOptimizeOptions &config, const Nnet &nnet, int32 max_output_time_in_request, NnetComputation *computation)
	This is the top-level function for optimizing a computation. More...

static void	SplitComputationIntoSegments (const NnetComputation &computation, std::vector< std::pair< int32, int32 > > *segments)
	Split the computation up into segments bounded by kNoOperationMarker. More...

void	ConsolidateIoOperations (const Nnet &nnet, NnetComputation *computation)
	This optimization puts the input operations (kAcceptInput) and output operations (kProvideOutput) at the very beginning or end of segments of computation, respectively. More...

void	LimitDerivativeTimes (const Nnet &nnet, const ComputationRequest &request, const NnetOptimizeOptions &opts, NnetComputation *computation)
	This optimization, which has no effect unless you set –min-deriv-time or –max-deriv-time, modifies the backprop operations for efficiency based on the assumption that derivatives for any Cindex with t < min_deriv_time or t > max_deriv_time are zero. More...

void	UnitTestDescriptorTokenize ()

void	UnitTestSummarizeVector ()

void	UnitTestNameMatchesPattern ()

bool	DescriptorTokenize (const std::string &input, std::vector< std::string > *tokens)
	This function tokenizes input when parsing Descriptor configuration values. More...

std::string	ErrorContext (std::istream &is)
	Return a string used in error messages. More...

std::string	ErrorContext (const std::string &str)

static void	PrintFloatSuccinctly (std::ostream &os, BaseFloat f)

std::string	SummarizeVector (const VectorBase< float > &vec)
	Returns a string that summarizes a vector fairly succintly, for printing stats in info lines. More...

std::string	SummarizeVector (const VectorBase< double > &vec)

std::string	SummarizeVector (const CuVectorBase< BaseFloat > &cu_vec)

void	PrintParameterStats (std::ostringstream &os, const std::string &name, const CuVectorBase< BaseFloat > &params, bool include_mean=false)
	Print to 'os' some information about the mean and standard deviation of some parameters, used in Info() functions in nnet-simple-component.cc. More...

void	PrintParameterStats (std::ostringstream &os, const std::string &name, const CuMatrix< BaseFloat > &params, bool include_mean=false, bool include_row_norms=false, bool include_column_norms=false, bool include_singular_values=false)
	Print to 'os' some information about the mean and standard deviation of some parameters, used in Info() functions in nnet-simple-component.cc. More...

void	ParseConfigLines (const std::vector< std::string > &lines, std::vector< ConfigLine > *config_lines)

bool	NameMatchesPattern (const char name, const char pattern)

void	GenerateConfigSequenceSimplest (const NnetGenerationOptions &opts, std::vector< std::string > *configs)

void	GenerateConfigSequenceSimpleContext (const NnetGenerationOptions &opts, std::vector< std::string > *configs)

void	GenerateConfigSequenceSimple (const NnetGenerationOptions &opts, std::vector< std::string > *configs)

void	GenerateConfigSequenceStatistics (const NnetGenerationOptions &opts, std::vector< std::string > *configs)

void	GenerateConfigSequenceRnn (const NnetGenerationOptions &opts, std::vector< std::string > *configs)

void	GenerateConfigSequenceRnnClockwork (const NnetGenerationOptions &opts, std::vector< std::string > *configs)

void	GenerateConfigSequenceLstm (const NnetGenerationOptions &opts, std::vector< std::string > *configs)

void	GenerateConfigSequenceLstmWithTruncation (const NnetGenerationOptions &opts, std::vector< std::string > *configs)

void	GenerateConfigSequenceLstmType2 (const NnetGenerationOptions &opts, std::vector< std::string > *configs)

void	GenerateConfigSequenceCnn (const NnetGenerationOptions &opts, std::vector< std::string > *configs)

void	GenerateConfigSequenceCnnNew (const NnetGenerationOptions &opts, std::vector< std::string > *configs)

void	GenerateConfigSequenceRestrictedAttention (const NnetGenerationOptions &opts, std::vector< std::string > *configs)

void	GenerateConfigSequenceDistribute (const NnetGenerationOptions &opts, std::vector< std::string > *configs)

void	GenerateConfigSequenceCompositeBlock (const NnetGenerationOptions &opts, std::vector< std::string > *configs)
	Generate a config string with a composite component composed only of block affine, repeated affine, and natural gradient repeated affine components. More...

void	GenerateConfigSequence (const NnetGenerationOptions &opts, std::vector< std::string > *configs)
	Generates a sequence of at least one config files, output as strings, where the first in the sequence is the initial nnet, and the remaining ones may do things like add layers. More...

void	ComputeExampleComputationRequestSimple (const Nnet &nnet, ComputationRequest request, std::vector< Matrix< BaseFloat > > inputs)
	This function computes an example computation request, for testing purposes. More...

static void	GenerateRandomComponentConfig (std::string component_type, std::string config)

Component *	GenerateRandomSimpleComponent ()
	Generates random simple component for testing. More...

bool	NnetParametersAreIdentical (const Nnet &nnet1, const Nnet &nnet2, BaseFloat threshold)
	Used for testing that the updatable parameters in two networks are the same. More...

void	GenerateSimpleNnetTrainingExample (int32 num_supervised_frames, int32 left_context, int32 right_context, int32 input_dim, int32 output_dim, int32 ivector_dim, NnetExample *example)
	Low-level function that generates an nnet training example. More...

bool	ExampleApproxEqual (const NnetExample &eg1, const NnetExample &eg2, BaseFloat delta)
	Returns true if the examples are approximately equal (only intended to be used in testing). More...

void	ComputeObjectiveFunction (const GeneralMatrix &supervision, ObjectiveType objective_type, const std::string &output_name, bool supply_deriv, NnetComputer computer, BaseFloat tot_weight, BaseFloat *tot_objf)
	This function computes the objective function, and if supply_deriv = true, supplies its derivative to the NnetComputation object. More...

void	UnitTestNnetContext ()

void	UnitTestConvertRepeatedToBlockAffine ()

void	UnitTestConvertRepeatedToBlockAffineComposite ()

int32	NumOutputNodes (const Nnet &nnet)
	returns the number of output nodes of this nnet. More...

int32	NumInputNodes (const Nnet &nnet)
	returns the number of input nodes of this nnet. More...

bool	IsSimpleNnet (const Nnet &nnet)
	This function returns true if the nnet has the following properties: It has an output called "output" (other outputs are allowed but may be ignored). More...

void	EvaluateComputationRequest (const Nnet &nnet, const ComputationRequest &request, std::vector< std::vector< bool > > *is_computable)
	Given an nnet and a computation request, this function works out which requested outputs in the computation request are computable; it outputs this information as a vector "is_computable" indexed by the same indexes as request.outputs. More...

static bool	ComputeSimpleNnetContextForShift (const Nnet &nnet, int32 input_start, int32 window_size, int32 left_context, int32 right_context)

void	ComputeSimpleNnetContext (const Nnet &nnet, int32 left_context, int32 right_context)
	ComputeSimpleNnetContext computes the left-context and right-context of a nnet. More...

void	PerturbParams (BaseFloat stddev, Nnet *nnet)
	Calls PerturbParams (with the given stddev) on all updatable components of the nnet. More...

void	ComponentDotProducts (const Nnet &nnet1, const Nnet &nnet2, VectorBase< BaseFloat > *dot_prod)
	Returns dot products between two networks of the same structure (calls the DotProduct functions of the Updatable components and fill in the output vector). More...

std::string	PrintVectorPerUpdatableComponent (const Nnet &nnet, const VectorBase< BaseFloat > &vec)
	This function is for printing, to a string, a vector with one element per updatable component of the nnet (e.g. More...

BaseFloat	DotProduct (const Nnet &nnet1, const Nnet &nnet2)
	Returns dot product between two networks of the same structure (calls the DotProduct functions of the Updatable components and sums up the return values). More...

void	ZeroComponentStats (Nnet *nnet)
	Zeroes the component stats in all nonlinear components in the nnet. More...

void	SetLearningRate (BaseFloat learning_rate, Nnet *nnet)
	Sets the underlying learning rate for all the components in the nnet to this value. More...

void	SetNnetAsGradient (Nnet *nnet)
	Sets nnet as gradient by Setting is_gradient_ to true and learning_rate_ to 1 for each UpdatableComponent in nnet. More...

void	SetRequireDirectInput (bool b, Nnet *nnet)
	Calls the corresponding function in any component of type StatisticsPoolingComponent; used as a way to compute the 'real' left-right context of networks including SatisticsPoolingComponent, which will give you the minimum chunk size they can consume. More...

void	ScaleNnet (BaseFloat scale, Nnet *nnet)
	Scales the nnet parameters and stats by this scale. More...

void	AddNnetComponents (const Nnet &src, const Vector< BaseFloat > &alphas, BaseFloat scale, Nnet *dest)
	Does dest += alpha src for updatable components (affects nnet parameters), and dest += scale src for other components (affects stored stats). More...

void	AddNnet (const Nnet &src, BaseFloat alpha, Nnet *dest)
	Does dest += alpha src (affects nnet parameters and stored stats). More...

int32	NumParameters (const Nnet &src)
	Returns the total of the number of parameters in the updatable components of the nnet. More...

void	VectorizeNnet (const Nnet &src, VectorBase< BaseFloat > *params)
	Copies the nnet parameters to *params, whose dimension must be equal to NumParameters(src). More...

void	UnVectorizeNnet (const VectorBase< BaseFloat > &params, Nnet *dest)
	Copies the parameters from params to *dest. More...

int32	NumUpdatableComponents (const Nnet &dest)
	Returns the number of updatable components in the nnet. More...

void	FreezeNaturalGradient (bool freeze, Nnet *nnet)
	Controls if natural gradient will be updated. More...

void	ConvertRepeatedToBlockAffine (CompositeComponent *c_component)

void	ConvertRepeatedToBlockAffine (Nnet *nnet)
	Convert all components of type RepeatedAffineComponent or NaturalGradientRepeatedAffineComponent to BlockAffineComponent in nnet. More...

std::string	NnetInfo (const Nnet &nnet)
	This function returns various info about the neural net. More...

void	SetDropoutProportion (BaseFloat dropout_proportion, Nnet *nnet)
	This function sets the dropout proportion in all dropout components to dropout_proportion value. More...

bool	HasBatchnorm (const Nnet &nnet)
	Returns true if nnet has at least one component of type BatchNormComponent. More...

void	ScaleBatchnormStats (BaseFloat batchnorm_stats_scale, Nnet *nnet)
	This function scales the batchorm stats of any batchnorm components (components of type BatchNormComponent) in 'nnet' by the scale 'batchnorm_stats_scale'. More...

void	RecomputeStats (const std::vector< NnetExample > &egs, Nnet *nnet)
	This function zeros the stored component-level stats in the nnet using ZeroComponentStats(), then recomputes them with the supplied egs. More...

void	SetBatchnormTestMode (bool test_mode, Nnet *nnet)
	This function affects only components of type BatchNormComponent. More...

void	SetDropoutTestMode (bool test_mode, Nnet *nnet)
	This function affects components of child-classes of RandomComponent. More...

void	ResetGenerators (Nnet *nnet)
	This function calls 'ResetGenerator()' on all components in 'nnet' that inherit from class RandomComponent. More...

void	FindOrphanComponents (const Nnet &nnet, std::vector< int32 > *components)
	This function finds a list of components that are never used, and outputs the integer comopnent indexes (you can use these to index nnet.GetComponentNames() to get their names). More...

void	FindOrphanNodes (const Nnet &nnet, std::vector< int32 > *nodes)
	This function finds a list of nodes that are never used to compute any output, and outputs the integer node indexes (you can use these to index nnet.GetNodeNames() to get their names). More...

void	ConstrainOrthonormalInternal (BaseFloat scale, CuMatrixBase< BaseFloat > *M)

void	ConstrainOrthonormal (Nnet *nnet)
	This function, to be called after processing every minibatch, is responsible for enforcing the orthogonality constraint for any components of type LinearComponent or inheriting from AffineComponent that have the "orthonormal_constraint" value set. More...

void	ConsolidateMemory (Nnet *nnet)
	This just calls ConsolidateMemory() on all the components of the nnet. More...

void	ReduceRankOfComponents (const std::string component_name_pattern, int32 rank, Nnet *nnet)

void	ReadEditConfig (std::istream &config_file, Nnet *nnet)
	ReadEditConfig() reads a file with a similar-looking format to the config file read by Nnet::ReadConfig(), but this consists of a sequence of operations to perform on an existing network, mostly modifying components. More...

bool	NnetIsRecurrent (const Nnet &nnet)
	Returns true if 'nnet' has some kind of recurrency. More...

void	CollapseModel (const CollapseModelConfig &config, Nnet *nnet)
	This function modifies the neural net for efficiency, in a way that suitable to be done in test time. More...

bool	UpdateNnetWithMaxChange (const Nnet &delta_nnet, BaseFloat max_param_change, BaseFloat max_change_scale, BaseFloat scale, Nnet nnet, std::vector< int32 > num_max_change_per_component_applied, int32 *num_max_change_global_applied)
	This function does the operation 'nnet += scale delta_nnet', while respecting any max-parameter-change (max-param-change) specified in the updatable components, and also the global max-param-change specified as 'max_param_change'. More...

int32	GetNumNvalues (const std::vector< NnetIo > &io_vec, bool exhaustive)
	This utility function can be used to obtain the number of distinct 'n' values in a training example. More...

void	ApplyL2Regularization (const Nnet &nnet, BaseFloat l2_regularize_scale, Nnet *delta_nnet)
	This function is used as part of the regular training workflow, prior to UpdateNnetWithMaxChange(). More...

bool	UpdateNnetWithMaxChange (const Nnet &delta_nnet, BaseFloat max_param_change, BaseFloat max_change_scale, BaseFloat scale, Nnet nnet, MaxChangeStats stats)

BaseFloat	KlDivergence (const Vector< BaseFloat > &p, const Vector< BaseFloat > &q)

void	PrintPriorDiagnostics (const Vector< BaseFloat > &old_priors, const Vector< BaseFloat > &new_priors)

void	SetPriors (const TransitionModel &tmodel, const Vector< double > &transition_accs, double prior_floor, AmNnetSimple *am_nnet)

double	ComputeObjf (bool batchnorm_test_mode, bool dropout_test_mode, const std::vector< NnetExample > &egs, const Nnet &nnet, NnetComputeProb *prob_computer)

void	UpdateNnetMovingAverage (int32 num_models, const Nnet &nnet, Nnet *moving_average_nnet)

void	RenameOutputs (const std::string &new_name, NnetExample *eg)

void	ScaleSupervisionWeight (BaseFloat weight, NnetExample *eg)

int32	GetCount (double expected_count)

bool	ContainsSingleExample (const NnetExample &eg, int32 min_input_t, int32 max_input_t, int32 min_output_t, int32 max_output_t)
	Returns true if the "eg" contains just a single example, meaning that all the "n" values in the indexes are zero, and the example has NnetIo members named both "input" and "output". More...

void	FilterExample (const NnetExample &eg, int32 min_input_t, int32 max_input_t, int32 min_output_t, int32 max_output_t, NnetExample *eg_out)
	This function filters the indexes (and associated feature rows) in a NnetExample, removing any index/row in an NnetIo named "input" with t < min_input_t or t > max_input_t and any index/row in an NnetIo named "output" with t < min_output_t or t > max_output_t. More...

bool	SelectFromExample (const NnetExample &eg, std::string frame_str, int32 left_context, int32 right_context, int32 frame_shift, NnetExample *eg_out)
	This function is responsible for possibly selecting one frame from multiple supervised frames, and reducing the left and right context as specified. More...

static bool	ProcessFile (const discriminative::SplitDiscriminativeSupervisionOptions &config, const TransitionModel &tmodel, const MatrixBase< BaseFloat > &feats, const MatrixBase< BaseFloat > ivector_feats, int32 ivector_period, const discriminative::DiscriminativeSupervision &supervision, const std::string &utt_id, bool compress, UtteranceSplitter utt_splitter, NnetDiscriminativeExampleWriter *example_writer)

void	ApplyAffineTransform (MatrixBase< BaseFloat > &transform, int32 num_channels, MatrixBase< BaseFloat > *image, FillMode fill_mode)
	This function applies a geometric transformation 'transform' to the image. More...

void	PerturbImage (const ImageAugmentationConfig &config, MatrixBase< BaseFloat > *image)
	This function randomly modifies (perturbs) the image by applying different geometric transformations according to the options in 'config'. More...

void	PerturbImageInNnetExample (const ImageAugmentationConfig &config, NnetExample *eg)
	This function does image perturbation as directed by 'config' The example 'eg' is expected to contain a NnetIo member with the name 'input', representing an image. More...

static bool	ProcessFile (const GeneralMatrix &feats, const MatrixBase< BaseFloat > ivector_feats, int32 ivector_period, const MatrixBase< BaseFloat > &targets, const std::string &utt_id, bool compress, int32 num_targets, int32 length_tolerance, UtteranceSplitter utt_splitter, NnetExampleWriter *example_writer)

static bool	ProcessFile (const GeneralMatrix &feats, const MatrixBase< BaseFloat > ivector_feats, int32 ivector_period, const Posterior &pdf_post, const std::string &utt_id, bool compress, int32 num_pdfs, int32 length_tolerance, UtteranceSplitter utt_splitter, NnetExampleWriter *example_writer)

int32	NumOutputIndexes (const NnetExample &eg)

void	DivideIntoPieces (int32 a, int32 b, std::vector< int32 > *pieces)
	This function divides the number 'a' into 'b' pieces, such that the sum of the pieces equals 'a' and no two pieces differ by more than 1. More...

static void	RunNnetComputation (const MatrixBase< BaseFloat > &features, const Nnet &nnet, CachingOptimizingCompiler compiler, Vector< BaseFloat > xvector)

static void	ProcessRangeFile (const std::string &range_rxfilename, unordered_map< std::string, std::vector< ChunkInfo > > utt_to_chunks)

static void	WriteExamples (const MatrixBase< BaseFloat > &feats, const std::vector< ChunkInfo > &chunks, const std::string &utt, bool compress, int32 num_pdfs, int32 num_egs_written, std::vector< NnetExampleWriter > example_writers)

Variables
static bool	computation_checker_warned_unused_input = false
	Checks that we never use variables before they are allocated or after they are deallocated, and some other checks that can be done from the MatrixAccesses. More...

const int	kNoTime = std::numeric_limits<int32>::min()

Typedef Documentation

◆ Cindex

typedef std::pair<int32, Index> Cindex

Definition at line 115 of file nnet-common.h.

◆ NnetChainExampleWriter

typedef TableWriter<KaldiObjectHolder<NnetChainExample > > NnetChainExampleWriter

Definition at line 220 of file nnet-chain-example.h.

◆ NnetDiscriminativeExampleWriter

typedef TableWriter<KaldiObjectHolder<NnetDiscriminativeExample > > NnetDiscriminativeExampleWriter

Definition at line 214 of file nnet-discriminative-example.h.

◆ NnetExampleWriter

typedef TableWriter<KaldiObjectHolder<NnetExample > > NnetExampleWriter

Definition at line 170 of file nnet-example.h.

◆ RandomAccessNnetChainExampleReader

typedef RandomAccessTableReader<KaldiObjectHolder<NnetChainExample > > RandomAccessNnetChainExampleReader

Definition at line 222 of file nnet-chain-example.h.

◆ RandomAccessNnetDiscriminativeExampleReader

typedef RandomAccessTableReader<KaldiObjectHolder<NnetDiscriminativeExample > > RandomAccessNnetDiscriminativeExampleReader

Definition at line 216 of file nnet-discriminative-example.h.

◆ RandomAccessNnetExampleReader

typedef RandomAccessTableReader<KaldiObjectHolder<NnetExample > > RandomAccessNnetExampleReader

Definition at line 172 of file nnet-example.h.

◆ SequentialNnetChainExampleReader

typedef SequentialTableReader<KaldiObjectHolder<NnetChainExample > > SequentialNnetChainExampleReader

Definition at line 221 of file nnet-chain-example.h.

◆ SequentialNnetDiscriminativeExampleReader

typedef SequentialTableReader<KaldiObjectHolder<NnetDiscriminativeExample > > SequentialNnetDiscriminativeExampleReader

Definition at line 215 of file nnet-discriminative-example.h.

◆ SequentialNnetExampleReader

typedef SequentialTableReader<KaldiObjectHolder<NnetExample > > SequentialNnetExampleReader

Definition at line 171 of file nnet-example.h.

Enumeration Type Documentation

◆ AccessType

enum AccessType

Enumerator
kReadAccess
kWriteAccess
kReadWriteAccess

Definition at line 75 of file nnet-analyze.h.

                 {
   kReadAccess,
   kWriteAccess,
   kReadWriteAccess
 };

◆ CommandType

enum CommandType

CommandType is an enum that describes the category of the command used in the NnetComputation.

We declare it outside that class because it's so frequently used and we got tired of typing NnetComputation:: everywhere. We document the commands here. Note: for operations that naturally need to operate on entire matrices (i.e. allocation commands and input and output commands), we use the submatrix indexes of them, which turns out to be more convenient for optimization; but these submatrix indexes must refer to the whole of a matrix.

kAllocMatrix. Allocate a matrix (its values will be undefined). arg1 = submatrix index, which must refer to a whole matrix.
kDeallocMatrix: Deallocate a matrix. arg1 = submatrix index.
kSwapMatrix: initialize matrix with submatrix index arg1 using memory from matrix with submatrix index arg2 (using shallow swap). Both submatrices must refer to whole matrices. The expectation is that prior to the swap, arg1 was empty and arg2 was nonempty, but the execution code does not enforce this.
kSetConst: set all elements of submatrix index 'arg1' to the value 'alpha'.
kPropagate: Forward computation of neural net, see Component::Propagate()
- arg1 is is component-index in neural net
- arg2 is index into ComponentPrecomputedIndexes (0 if NULL; always 0 for simple Components)
- arg3 is sub-matrix index of input
- arg4 is sub-matrix index of output
- arg5 is the index of the memo saved from Propagate()'s return value, or 0 if it saves no memo.
- arg6 is 1 if we need to call StoreStats() after the Propagate, or 0 if we don't. We used to have a separate command for storing the stats, but that has been removed.
kBackprop: Do the back-propagation operation, see Component::Backprop()
- arg1 is index of component in neural net
- arg2 is index into ComponentPrecomputedIndexes (0 if NULL; always 0 for simple Components)
- arg3 is submatrix-index of input value (input to Propagate()); 0 if unused
- arg4 is submatrix-index of output value (output of Propagate()); 0 if unused
- arg5 is submatrix-index of output derivative
- arg6 is submatrix-index of input derivative; 0 if unused.
- arg7 is the index of the memo which is generated from the corresponding Propagate() function if the flag kUsesMemo is set; 0 if unused.
kBackpropNoModelUpdate: as kBackprop, but does not set the 'to_update' argument to the Backprop call, even if the model is updatable, so it skips the model-update phase of backprop.
kMatrixCopy: Copy (alpha times contents of sub-matrix arg2) to sub-matrix arg1, currently implemented as copy then scale. Note: to implement scaling a matrix, you can use kMatrixCopy with arg1 == arg2 and it won't do any redundant copying.
kMatrixAdd: Add (alpha times contents of sub-matrix arg2) to sub-matrix arg1
kCopyRows: call CopyRows() on sub-matrix arg1 with sub-matrix arg2 and indexes[arg3] as arguments, then if alpha != 1.0, scale sub-matrix arg1 by alpha.
kAddRows: call AddRows() on sub-matrix arg1 with alpha, sub-matrix arg2 and indexes[arg3] as arguments.
kAddRowsMulti, kAddToRowsMulti, kCopyRowsMulti, kCopyToRowsMulti: Call the corresponding function in class CuMatrix (Actually the names do not have 'Multi' in them, but they are the ones that accept a vector of 'Real*'.
- arg1 is sub-matrix index of *this matrix in operation
- arg2 is index into "indexes_multi", of which each pair is (sub-matrix index, row index) (or (-1,-1) for NULL marker), which is turned into a vector of BaseFloat* (pointers to matrix rows) before being given as the argument to the function. In the 'Add' functions 'alpha' is provided as an argument; for the 'Copy' functions, we scale the destination by 'alpha' after the copy, if alpha != 1.0. (However, for implementation reasons, kCopyToRowsMulti does not currently support alpha != 1.0 and will crash, so we avoid generating this code).
kAddRowRanges: call AddRowRanges() on sub-matrix arg1, with arg2 as source sub-matrix, and indexes given indexes_ranges[arg3]. We use the "alpha" as if AddRowRanges() accepted that argument, even though it doesn't (we fake it using other calls, if alpha != 1.0).
kCompressMatrix: Compresses the matrix which should be referred to by submatrix-index arg1. arg2 is a number that determines the compression type (it's converted from the enum CuCompressedMatrixType; 1=int8, 2=uint8, 3=int16, 4=uint16), and alpha determines the 'range' parameter (c.f. NewCuCompressedMatrix()). arg3 will be converted to the 'truncate' argument to the class CuCompressedMatrix; it should be false (0) if you know that the input is limited to the allowed range, and true (1) if the input may exceed that range (see docs for CuCompresedMatrix).
kDecompressMatrix: Decompresses the matrix which is referred to by submatrix-index arg1 (it should previously have been compressed).
kAcceptInput: accepts a matrix of input from the user, which may be either features, or derivatives w.r.t. the output. arg1 is the submatrix index of a whole matrix that the input goes to, and arg2 is the index of the network node associated with it (e.g. the node of "input" or "ivector"), for puroses of double checking.
kProvideOutput: outputs a matrix to the user: either a network output, or a matrix of derivatives w.r.t. an input. arg1 is the submatrix index of the output (which we expect to be a whole matrix), arg2 is the index of the network node associated with it (e.g. the node for "output").
kNoOperation: does nothing, and will be removed by optimization code (sometimes useful during optimization)
kNoOperationPermanent: like kNoOperation, but won't be removed by optimization code. This is used to ensure that for 'trivial' computations, which just copy the input to the output, the block of commands for the forward or backward propagation is nonempty (to avoid confusing the computation code).
kNoOperationMarker: does nothing, but used to mark end of a block of commands (like forward commands).
kNoOperationLabel: does nothing, but is the destination for the kGotoLabel command.
kGotoLabel: jumps to the kNoOperationLabel command. arg1 must be set to the location of that command. Since there are no conditionals, the kGotoLabel command should be the last command, as remaining commands will be unreachable.

Enumerator
kAllocMatrix
kDeallocMatrix
kSwapMatrix
kSetConst
kPropagate
kBackprop
kBackpropNoModelUpdate
kMatrixCopy
kMatrixAdd
kCopyRows
kAddRows
kCopyRowsMulti
kCopyToRowsMulti
kAddRowsMulti
kAddToRowsMulti
kAddRowRanges
kCompressMatrix
kDecompressMatrix
kAcceptInput
kProvideOutput
kNoOperation
kNoOperationPermanent
kNoOperationMarker
kNoOperationLabel
kGotoLabel

Definition at line 288 of file nnet-computation.h.

                  {
   kAllocMatrix, kDeallocMatrix, kSwapMatrix, kSetConst,
   kPropagate, kBackprop, kBackpropNoModelUpdate,
   kMatrixCopy, kMatrixAdd, kCopyRows, kAddRows,
   kCopyRowsMulti, kCopyToRowsMulti, kAddRowsMulti, kAddToRowsMulti,
   kAddRowRanges, kCompressMatrix, kDecompressMatrix,
   kAcceptInput, kProvideOutput,
   kNoOperation, kNoOperationPermanent, kNoOperationMarker, kNoOperationLabel,
   kGotoLabel };

◆ ComponentProperties

enum ComponentProperties

Enumerator
kSimpleComponent
kUpdatableComponent
kPropagateInPlace
kPropagateAdds
kReordersIndexes
kBackpropAdds
kBackpropNeedsInput
kBackpropNeedsOutput
kBackpropInPlace
kStoresStats
kInputContiguous
kOutputContiguous
kUsesMemo
kRandomComponent

Definition at line 37 of file nnet-component-itf.h.

                          {
   kSimpleComponent = 0x001,  // true if number of rows of input equals number of rows
                              // of output and this component doesn't care about the indexes
                              // (i.e. it maps each row of input to each row of output without
                              // regard to the index values).  Will normally be true.
   kUpdatableComponent = 0x002,  // true if the component has parameters that can
                                 // be updated.  Components that return this flag
                                 // must be dynamic_castable to type
                                 // UpdatableComponent (but components of type
                                 // UpdatableComponent do not have to return this
                                 // flag, e.g.  if this instance is not really
                                 // updatable).
   kPropagateInPlace = 0x004,  // true if we can do the propagate operation in-place
                               // (input and output matrices are the same).
                               // Note: if doing backprop, you'd also need to check
                               // that the kBackpropNeedsInput property is not true.
   kPropagateAdds = 0x008,  // true if the Propagate function adds to, rather
                            // than setting, its output, for non-in-place
                            // propagation.  The Component chooses whether to add
                            // or set, and the calling code has to accommodate
                            // it.
   kReordersIndexes = 0x010,  // true if the ReorderIndexes function might reorder
                              // the indexes (otherwise we can skip calling it).
                              // Must not be set for simple components.
   kBackpropAdds = 0x020,   // true if the Backprop function adds to, rather than
                            // setting, the "in_deriv" output for non-in-place
                            // backprop.  The Component chooses whether to add or
                            // set, and the calling code has to accommodate it.
   kBackpropNeedsInput = 0x040,  // true if backprop operation needs access to
                                 // forward-pass input.
   kBackpropNeedsOutput = 0x080,  // true if backprop operation needs access to
                                  // forward-pass output (e.g. true for Sigmoid).
   kBackpropInPlace = 0x100,   // true if we can do the backprop operation in-place
                              // (input and output matrices may be the same).
   kStoresStats = 0x200,      // true if the StoreStats operation stores
                              // statistics e.g. on average node activations and
                              // derivatives of the nonlinearity, (as it does for
                              // Tanh, Sigmoid, ReLU and Softmax).
   kInputContiguous = 0x400,  // true if the component requires its input data (and
                               // input derivatives) to have Stride()== NumCols().
   kOutputContiguous = 0x800,  // true if the component requires its input data (and
                                // output derivatives) to have Stride()== NumCols().
   kUsesMemo = 0x1000,  // true if the component returns a void* pointer from its
                        // Propagate() function that needs to be passed into the
                        // corresponding Backprop function.
   kRandomComponent = 0x2000   // true if the component has some kind of
                               // randomness, like DropoutComponent (these should
                               // inherit from class RandomComponent.
 };

◆ FillMode

enum FillMode

Enumerator
kNearest
kReflect

Definition at line 31 of file nnet3-egs-augment-image.cc.

31 { kNearest, kReflect };

kaldi::nnet3::kReflect

Definition: nnet3-egs-augment-image.cc:31

kaldi::nnet3::kNearest

Definition: nnet3-egs-augment-image.cc:31

◆ NodeType

enum NodeType

Enumerator
kInput
kDescriptor
kComponent
kDimRange
kNone

Definition at line 55 of file nnet-nnet.h.

55 { kInput, kDescriptor, kComponent, kDimRange, kNone };

kaldi::nnet3::kNone

Definition: nnet-nnet.h:55

kaldi::nnet3::kComponent

Definition: nnet-nnet.h:55

kaldi::nnet3::kInput

Definition: nnet-nnet.h:55

kaldi::nnet3::kDescriptor

Definition: nnet-nnet.h:55

kaldi::nnet3::kDimRange

Definition: nnet-nnet.h:55

◆ ObjectiveType

enum ObjectiveType

This enum is for a kind of annotation we associate with output nodes of the network; it's for the convenience of calling code so that if the objective is one of a few standard types, we can compute it directly and know how to interpret the supervision labels.

However, the core of the framework never makes use of the objective types, other than making them available to calling code which then supplies the derivatives.

Objective type kLinear is intended for Neural nets where the final component is a LogSoftmaxComponent, so the log-prob (negative cross-entropy) objective is just a linear function of the input.
Objective type kQuadratic is used to mean the objective function f(x, y) = -0.5 (x-y).(x-y), which is to be maximized, as in the kLinear case.

Enumerator
kLinear
kQuadratic

Definition at line 52 of file nnet-nnet.h.

52 { kLinear, kQuadratic };

kaldi::nnet3::kLinear

Definition: nnet-nnet.h:52

kaldi::nnet3::kQuadratic

Definition: nnet-nnet.h:52

Function Documentation

◆ AddNnet()

void AddNnet	(	const Nnet &	src,
		BaseFloat	alpha,
		Nnet *	dest
	)

Does *dest += alpha * src (affects nnet parameters and stored stats).

Definition at line 349 of file nnet-utils.cc.

References Component::Add(), Nnet::GetComponent(), KALDI_ERR, and Nnet::NumComponents().

Referenced by main(), kaldi::ReadModels(), NnetDiscriminativeTrainer::Train(), and UpdateNnetMovingAverage().

                                                            {
   if (src.NumComponents() != dest->NumComponents())
     KALDI_ERR << "Trying to add incompatible nnets.";
   for (int32 c = 0; c < src.NumComponents(); c++) {
     const Component *src_comp = src.GetComponent(c);
     Component *dest_comp = dest->GetComponent(c);
     dest_comp->Add(alpha, *src_comp);
   }
 }

◆ AddNnetComponents()

void AddNnetComponents	(	const Nnet &	src,
		const Vector< BaseFloat > &	alphas,
		BaseFloat	scale,
		Nnet *	dest
	)

Does *dest += alpha * src for updatable components (affects nnet parameters), and *dest += scale * src for other components (affects stored stats).

Here, alphas is a vector of size equal to the number of updatable components

Definition at line 322 of file nnet-utils.cc.

References Component::Add(), VectorBase< Real >::Dim(), Nnet::GetComponent(), rnnlm::i, KALDI_ASSERT, KALDI_ERR, kUpdatableComponent, Nnet::NumComponents(), and Component::Properties().

Referenced by UpdateNnetWithMaxChange().

                                                     {
   if (src.NumComponents() != dest->NumComponents())
     KALDI_ERR << "Trying to add incompatible nnets.";
   int32 i = 0;
   for (int32 c = 0; c < src.NumComponents(); c++) {
     const Component *src_comp = src.GetComponent(c);
     Component *dest_comp = dest->GetComponent(c);
     if (src_comp->Properties() & kUpdatableComponent) {
       // For now all updatable components inherit from class UpdatableComponent.
       // If that changes in future, we will change this code.
       const UpdatableComponent *src_uc =
           dynamic_cast<const UpdatableComponent*>(src_comp);
       UpdatableComponent *dest_uc =
           dynamic_cast<UpdatableComponent*>(dest_comp);
       if (src_uc == NULL || dest_uc == NULL)
         KALDI_ERR << "Updatable component does not inherit from class "
             "UpdatableComponent; change this code.";
       KALDI_ASSERT(i < alphas.Dim());
       dest_uc->Add(alphas(i++), *src_uc);
     } else { // add stored stats
       dest_comp->Add(scale, *src_comp);
     }
   }
   KALDI_ASSERT(i == alphas.Dim());
 }

◆ AddTimeOffsetToComputationRequest()

void kaldi::nnet3::AddTimeOffsetToComputationRequest	(	int32	t_offset,
		ComputationRequest *	request
	)

Definition at line 235 of file nnet-compile-looped.cc.

References rnnlm::i, ComputationRequest::inputs, rnnlm::j, and ComputationRequest::outputs.

Referenced by ExtrapolateComputationRequest().

                                                                     {
   for (size_t i = 0; i < request->inputs.size(); i++) {
     size_t size = request->inputs[i].indexes.size();
     for (size_t j = 0; j < size; j++)
       request->inputs[i].indexes[j].t += t_offset;
   }
   for (size_t i = 0; i < request->outputs.size(); i++) {
     size_t size = request->outputs[i].indexes.size();
     for (size_t j = 0; j < size; j++)
       request->outputs[i].indexes[j].t += t_offset;
   }
 }

◆ AppendCindexes()

void AppendCindexes	(	int32	node,
		const std::vector< Index > &	indexes,
		std::vector< Cindex > *	out
	)

Appends to 'out' the pairs (node, indexes[0]), (node, indexes[1]), ...

Definition at line 1384 of file nnet-compile.cc.

References rnnlm::i.

Referenced by Compiler::OutputDebugInfo().

                                             {
   size_t indexes_size = indexes.size();
   if (indexes_size > out->size())
     out->reserve(out->size() + indexes_size);
   for (size_t i = 0; i < indexes_size; i++)
     out->push_back(Cindex(node, indexes[i]));
 }

◆ ApplyAffineTransform()

void kaldi::nnet3::ApplyAffineTransform	(	MatrixBase< BaseFloat > &	transform,
		int32	num_channels,
		MatrixBase< BaseFloat > *	image,
		FillMode	fill_mode
	)

This function applies a geometric transformation 'transform' to the image.

Reference: Digital Image Processing book by Gonzalez and Woods.

Parameters

[in]	transform	The 3x3 geometric transformation matrix to apply.
[in]	num_channels	Number of channels (i.e. colors) of the image
[in,out]	image	The image matrix to be modified. image->NumRows() is the width (number of x values) in the image; image->NumCols() is the height times number of channels (channel varies the fastest).

Definition at line 110 of file nnet3-egs-augment-image.cc.

References KALDI_ASSERT, kNearest, ImageAugmentationConfig::num_channels, MatrixBase< Real >::NumCols(), and MatrixBase< Real >::NumRows().

Referenced by PerturbImage().

                                               {
   int32 num_rows = image->NumRows(),
         num_cols = image->NumCols(),
         height = num_cols / num_channels,
         width = num_rows;
   KALDI_ASSERT(num_cols % num_channels == 0);
   Matrix<BaseFloat> original_image(*image);
   for (int32 r = 0; r < width; r++) {
     for (int32 c = 0; c < height; c++) {
       // (r_old, c_old) is the coordinate of the pixel in the original image
       // while (r, c) is the coordinate in the new (transformed) image.
       BaseFloat r_old = transform(0, 0) * r +
                                           transform(0, 1) * c + transform(0, 2);
       BaseFloat c_old = transform(1, 0) * r +
                                           transform(1, 1) * c + transform(1, 2);
       // We are going to do bilinear interpolation between 4 closest points
       // to the point (r_old, c_old) of the original image. We have:
       // r1  <=  r_old  <=  r2
       // c1  <=  c_old  <=  c2
       int32 r1 = static_cast<int32>(floor(r_old));
       int32 c1 = static_cast<int32>(floor(c_old));
       int32 r2 = r1 + 1;
       int32 c2 = c1 + 1;
 
       // These weights determine how much each of the 4 points contributes
       // to the final interpolated value:
       BaseFloat weight_11 = (r2 - r_old) * (c2 - c_old),
           weight_12 = (r2 - r_old) * (c_old - c1),
           weight_21 = (r_old - r1) * (c2 - c_old),
           weight_22 = (r_old - r1) * (c_old - c1);
       // Handle edge conditions:
       if (fill_mode == kNearest) {
         if (r1 < 0) {
           r1 = 0;
           if (r2 < 0) r2 = 0;
         }
         if (r2 >= width) {
           r2 = width - 1;
           if (r1 >= width) r1 = width - 1;
         }
         if (c1 < 0) {
           c1 = 0;
           if (c2 < 0) c2 = 0;
         }
         if (c2 >= height) {
           c2 = height - 1;
           if (c1 >= height) c1 = height - 1;
         }
       } else {
         KALDI_ASSERT(fill_mode == kReflect);
         if (r1 < 0) {
           r1 = - r1;
           if (r2 < 0) r2 = - r2;
         }
         if (r2 >= width) {
           r2 = 2 * width - 2 - r2;
           if (r1 >= width) r1 = 2 * width - 2 - r1;
         }
         if (c1 < 0) {
           c1 = - c1;
           if (c2 < 0) c2 = -c2;
         }
         if (c2 >= height) {
           c2 = 2 * height - 2 - c2;
           if (c1 >= height) c1 = 2 * height - 2 - c1;
         }
       }
       for (int32 ch = 0; ch < num_channels; ch++) {
         // find the values at the 4 points
         BaseFloat p11 = original_image(r1, num_channels * c1 + ch),
             p12 = original_image(r1, num_channels * c2 + ch),
             p21 = original_image(r2, num_channels * c1 + ch),
             p22 = original_image(r2, num_channels * c2 + ch);
         (*image)(r, num_channels * c + ch) = weight_11 * p11 + weight_12 * p12 +
             weight_21 * p21 + weight_22 * p22;
       }
     }
   }
 }

◆ ApplyL2Regularization()

void ApplyL2Regularization	(	const Nnet &	nnet,
		BaseFloat	l2_regularize_scale,
		Nnet *	delta_nnet
	)

This function is used as part of the regular training workflow, prior to UpdateNnetWithMaxChange().

For each updatable component c in the neural net, suppose it has a l2-regularization constant alpha set at the component level (see UpdatableComponent::L2Regularization()), and a learning-rate eta, then this function does (and this is not real code):

delta_nnet->c -= 2.0 * l2_regularize_scale * alpha * eta * nnet.c

The factor of -1.0 comes from the fact that we are maximizing, and we'd add the l2 regularization term (of the form ||||_2^2, i.e. squared l2 norm) in the objective function with negative sign; the factor of 2.0 comes from the derivative of the squared parameters. The factor 'l2_regularize_scale' is provided to this function, see below for an explanation.

Note: the way we do it is a little bit approximate, due to the interaction with natural gradient. The issue is that the regular gradients are multiplied by the inverse of the approximated, smoothed and factored inverse Fisher matrix, but the l2 gradients are not. This means that what we're optimizing is not exactly the (regular objective plus the L2 term)– we could view it as optimizing (regular objective plus the l2 term times the Fisher matrix)– with the proviso that the Fisher matrix has been scaled in such a way that the amount of parameter change is not affected, so this is not an issue of affecting the overall strength of l2, just an issue of the direction-wise weighting. In effect, the l2 term will be larger, relative to the gradient contribution, in directions where the Fisher matrix is large. This is probably not ideal– but it's hard to judge without experiments. Anyway the l2 effect is small enough, and the Fisher matrix sufficiently smoothed with the identity, that I doubt this makes much of a difference.

Parameters

[in]	nnet	The neural net that is being trained; expected to be different from delta_nnet
[in]	l2_regularize_scale	A scale on the l2 regularization. Usually this will be equal to the number of distinct examples (e.g. the number of chunks of speech– more precisely, the number of distinct 'n' values) in the minibatch, but this is multiplied by a configuration value –l2-regularize-factor passed in from the command line. The reason for making l2 proportional to the number of elements in the minibatch is that we add the parameter gradients over the minibatch (we don't average), so multiplying the l2 factor by the number of elements in the minibatch is necessary to make the amount of l2 vs. gradient contribution stay the same when we vary the minibatch size. The –l2-regularize-factor option is provided so that the calling script can correct for the effects of parallelization via model-averaging (we'd normally set this to 1/num-parallel-jobs).
[out]	delta_nnet	The neural net containing the parameter updates; this is a copy of 'nnet' that is used for purposes of momentum and applying max-change values. This is what this code adds to.

Definition at line 2244 of file nnet-utils.cc.

References Component::Add(), Nnet::GetComponent(), KALDI_ASSERT, kUpdatableComponent, UpdatableComponent::L2Regularization(), UpdatableComponent::LearningRate(), Nnet::NumComponents(), and Component::Properties().

Referenced by CollapseModelConfig::CollapseModelConfig(), NnetChainTrainer::TrainInternal(), NnetTrainer::TrainInternal(), NnetChainTrainer::TrainInternalBackstitch(), and NnetTrainer::TrainInternalBackstitch().

                                              {
   if (l2_regularize_scale == 0.0)
     return;
   for (int32 c = 0; c < nnet.NumComponents(); c++) {
     const Component *src_component_in = nnet.GetComponent(c);
     if (src_component_in->Properties() & kUpdatableComponent) {
       const UpdatableComponent *src_component =
           dynamic_cast<const UpdatableComponent*>(src_component_in);
       UpdatableComponent *dest_component =
           dynamic_cast<UpdatableComponent*>(delta_nnet->GetComponent(c));
       // The following code will segfault if they aren't both updatable, which
       // would be a bug in the calling code.
       BaseFloat lrate = dest_component->LearningRate(),
           l2_regularize = dest_component->L2Regularization();
       KALDI_ASSERT(lrate >= 0 && l2_regularize >= 0);
       BaseFloat scale = -2.0 * l2_regularize_scale * lrate * l2_regularize;
       if (scale != 0.0)
         dest_component->Add(scale, *src_component);
     }
   }
 }

◆ AssertGraphEqual()

bool kaldi::nnet3::AssertGraphEqual	(	const std::vector< std::vector< int32 > > &	graph1,
		const std::vector< std::vector< int32 > > &	graph2
	)

Definition at line 26 of file nnet-graph-test.cc.

References rnnlm::i, and rnnlm::j.

Referenced by UnitTestComputeGraphTranspose(), UnitTestFindSccs(), and UnitTestMakeSccGraph().

                                                                   {
   if (graph1.size() != graph2.size()) { return false; }
   for (int32 i = 0; i < graph1.size(); ++i) {
     if (graph1[i].size() != graph2[i].size()) { return false; }
     for (int32 j = 0; j < graph1[i].size(); ++j) {
       if (graph1[i][j] != graph2[i][j]) { return false; }
     }
   }
   return true;
 }

◆ AssertVectorEqual()

bool kaldi::nnet3::AssertVectorEqual	(	const std::vector< int32 > &	vec1,
		const std::vector< int32 > &	vec2
	)

Definition at line 38 of file nnet-graph-test.cc.

References rnnlm::i.

Referenced by UnitTestComputeTopSortOrder(), and UnitTestComputeTopSortOrder2().

                                                      {
   if (vec1.size() != vec2.size()) { return false; }
   for (int32 i = 0; i < vec1.size(); ++i) {
     if (vec1[i] != vec2[i]) { return false; }
   }
   return true;
 }

◆ BuildTestGraph()

void kaldi::nnet3::BuildTestGraph ( std::vector< std::vector< int32 > > * graph )

Definition at line 47 of file nnet-graph-test.cc.

References KALDI_ASSERT.

Referenced by UnitTestComputeGraphTranspose(), UnitTestFindSccs(), and UnitTestMakeSccGraph().

                                                          {
   KALDI_ASSERT(graph != NULL);
   graph->clear();
   graph->resize(8);
 
   // We create the following graph for testing.
   // 0 --> 4
   // 1 --> 0
   // 2 --> 1 3
   // 3 --> 2
   // 4 --> 1
   // 5 --> 1 4 6
   // 6 --> 5
   // 7 --> 7 3 6
   std::vector<int32> tmp;
   tmp.resize(1); tmp[0] = 4; (*graph)[0] = tmp;
   tmp.resize(1); tmp[0] = 0; (*graph)[1] = tmp;
   tmp.resize(2); tmp[0] = 1; tmp[1] = 3; (*graph)[2] = tmp;
   tmp.resize(1); tmp[0] = 2; (*graph)[3] = tmp;
   tmp.resize(1); tmp[0] = 1; (*graph)[4] = tmp;
   tmp.resize(3); tmp[0] = 1; tmp[1] = 4; tmp[2] = 6; (*graph)[5] = tmp;
   tmp.resize(1); tmp[0] = 5; (*graph)[6] = tmp;
   tmp.resize(3); tmp[0] = 7; tmp[1] = 3; tmp[2] = 6; (*graph)[7] = tmp;
 }

◆ BuildTestGraphTranspose()

void kaldi::nnet3::BuildTestGraphTranspose ( std::vector< std::vector< int32 > > * graph )

Definition at line 72 of file nnet-graph-test.cc.

References KALDI_ASSERT.

Referenced by UnitTestComputeGraphTranspose().

                                                                   {
   KALDI_ASSERT(graph != NULL);
   graph->clear();
   graph->resize(8);
 
   // We create the following graph for testing.
   // 0 --> 1
   // 1 --> 2 4 5
   // 2 --> 3
   // 3 --> 2 7
   // 4 --> 0 5
   // 5 --> 6
   // 6 --> 5 7
   // 7 --> 7
   std::vector<int32> tmp;
   tmp.resize(1); tmp[0] = 1; (*graph)[0] = tmp;
   tmp.resize(3); tmp[0] = 2; tmp[1] = 4; tmp[2] = 5; (*graph)[1] = tmp;
   tmp.resize(1); tmp[0] = 3; (*graph)[2] = tmp;
   tmp.resize(2); tmp[0] = 2; tmp[1] = 7; (*graph)[3] = tmp;
   tmp.resize(2); tmp[0] = 0; tmp[1] = 5; (*graph)[4] = tmp;
   tmp.resize(1); tmp[0] = 6; (*graph)[5] = tmp;
   tmp.resize(2); tmp[0] = 5; tmp[1] = 7; (*graph)[6] = tmp;
   tmp.resize(1); tmp[0] = 7; (*graph)[7] = tmp;
 }

◆ BuildTestSccGraph()

void kaldi::nnet3::BuildTestSccGraph ( std::vector< std::vector< int32 > > * scc_graph )

Definition at line 114 of file nnet-graph-test.cc.

References KALDI_ASSERT.

Referenced by UnitTestComputeTopSortOrder(), and UnitTestMakeSccGraph().

                                                                 {
   KALDI_ASSERT(scc_graph != NULL);
   scc_graph->clear();
   scc_graph->resize(4);
 
   // We create the following SCC graph for testing.
   // 0 -->
   // 1 --> 0
   // 2 --> 0
   // 3 --> 1 2
   std::vector<int32> tmp;
   tmp.resize(0); (*scc_graph)[0] = tmp;
   tmp.resize(1); tmp[0] = 0; (*scc_graph)[1] = tmp;
   tmp.resize(1); tmp[0] = 0; (*scc_graph)[2] = tmp;
   tmp.resize(2); tmp[0] = 1; tmp[1] = 2; (*scc_graph)[3] = tmp;
 }

◆ BuildTestSccs()

void kaldi::nnet3::BuildTestSccs ( std::vector< std::vector< int32 > > * sccs )

Definition at line 97 of file nnet-graph-test.cc.

References KALDI_ASSERT.

Referenced by UnitTestFindSccs(), and UnitTestMakeSccGraph().

                                                        {
   KALDI_ASSERT(sccs != NULL);
   sccs->clear();
   sccs->resize(4);
 
   // We create the following SCCs for testing.
   // 0 --> 1 4 0
   // 1 --> 3 2
   // 2 --> 6 5
   // 3 --> 7
   std::vector<int32> tmp;
   tmp.resize(3); tmp[0] = 1; tmp[1] = 4; tmp[2] = 0; (*sccs)[0] = tmp;
   tmp.resize(2); tmp[0] = 3; tmp[1] = 2; (*sccs)[1] = tmp;
   tmp.resize(2); tmp[0] = 6; tmp[1] = 5; (*sccs)[2] = tmp;
   tmp.resize(1); tmp[0] = 7; (*sccs)[3] = tmp;
 }

◆ BuildTestTopSortOrder()

void kaldi::nnet3::BuildTestTopSortOrder ( std::vector< int32 > * node_to_order )

Definition at line 131 of file nnet-graph-test.cc.

References KALDI_ASSERT.

Referenced by UnitTestComputeTopSortOrder().

                                                             {
   KALDI_ASSERT(node_to_order != NULL);
   node_to_order->clear();
   node_to_order->resize(4);
 
   // The topological sorting order of the above SCC graph is as follows (from
   // our particular algorithm):
   // 0 --> 3
   // 1 --> 2
   // 2 --> 1
   // 3 --> 0
   (*node_to_order)[0] = 3;
   (*node_to_order)[1] = 2;
   (*node_to_order)[2] = 1;
   (*node_to_order)[3] = 0;
 }

◆ CheckComputation()

void CheckComputation	(	const Nnet &	nnet,
		const NnetComputation &	computation,
		bool	check_rewrite = `false`
	)

This is a convenience interface for class ComputationChecker.

Call it with check_rewrite = true only if the computation is pre-optimization. If the computation is an 'online' computation, this function treats it specially.

Definition at line 1145 of file nnet-analyze.cc.

References ComputationChecker::Check(), CheckComputationOptions::check_rewrite, CheckComputationOnline(), NnetComputation::commands, KALDI_ERR, kGotoLabel, and NnetComputation::Print().

Referenced by CachingOptimizingCompiler::CompileViaShortcut(), and Optimize().

                                           {
   try {
     if (!computation.commands.empty() &&
         computation.commands.back().command_type == kGotoLabel) {
       // Online computations need to be treated specially.
       CheckComputationOnline(nnet, computation, check_rewrite);
     } else {
       CheckComputationOptions opts;
       opts.check_rewrite = check_rewrite;
       ComputationChecker checker(opts, nnet, computation);
       checker.Check();
     }
   } catch (...) {
     computation.Print(std::cerr, nnet);
     KALDI_ERR << "Computation check failed for computation printed above "
         "(actual error message is above computation)";
   }
 }

◆ CheckComputationOnline()

static void kaldi::nnet3::CheckComputationOnline	(	const Nnet &	nnet,
		NnetComputation	computation,
		bool	check_rewrite
	)

static

Definition at line 1118 of file nnet-analyze.cc.

References ComputationChecker::Check(), CheckComputationOptions::check_rewrite, CheckComputationOptions::check_unused_variables, NnetComputation::commands, KALDI_ASSERT, kDeallocMatrix, kGotoLabel, kSwapMatrix, and kaldi::swap().

Referenced by CheckComputation().

                                                        {
   int32 num_commands = computation.commands.size();
   KALDI_ASSERT(computation.commands[num_commands-1].command_type == kGotoLabel);
   for (int32 c = num_commands - 2;
        c >= 0 && computation.commands[c].command_type == kSwapMatrix;
        c--) {
     // this command can be interpreted as "initialize matrix referred to by
     // c.arg2 with the matrix referred to by c.arg2".
     // Because this would be interpreted by the analysis code as initializing a
     // matrix that has already been initialized, we turn this into a command
     // that just deallocates the matrix in c.arg2. [note: all these indexes
     // are actually submatrix indexes].
     computation.commands[c].command_type = kDeallocMatrix;
     std::swap(computation.commands[c].arg1, computation.commands[c].arg2);
   }
 
   CheckComputationOptions opts;
   opts.check_rewrite = check_rewrite;
   opts.check_unused_variables = false;
   // We can always do this check with online computations, since they do not
   // have the RemoveUnnecessaryAllocation() optimization applied.
   ComputationChecker checker(opts, nnet, computation);
   checker.Check();
 }

◆ CheckStringsApproxEqual()

bool kaldi::nnet3::CheckStringsApproxEqual	(	const std::string &	a,
		const std::string &	b,
		int32	tolerance = `3`
	)

Definition at line 39 of file nnet-component-test.cc.

References KALDI_WARN, and kaldi::StringsApproxEqual().

Referenced by TestNnetComponentAddScale(), TestNnetComponentIo(), TestNnetComponentUpdatable(), and TestNnetComponentVectorizeUnVectorize().

                                                   {
   if (!StringsApproxEqual(a, b, tolerance)) {
     KALDI_WARN << "Strings differ: " << a
                << "\nvs.\n" << b;
     return false;
   } else {
     return true;
   }
 }

◆ CollapseModel()

void CollapseModel	(	const CollapseModelConfig &	config,
		Nnet *	nnet
	)

This function modifies the neural net for efficiency, in a way that suitable to be done in test time.

For example, it tries to get rid of dropout, batchnorm and fixed-scale components, and to collapse subsequent affine components if doing so won't hurt speed.

Definition at line 2100 of file nnet-utils.cc.

References ModelCollapser::Collapse().

Referenced by CollapseModelConfig::CollapseModelConfig(), main(), and UnitTestNnetCompute().

                                {
   ModelCollapser c(config, nnet);
   c.Collapse();
 }

◆ CompileLooped()

void CompileLooped	(	const Nnet &	nnet,
		const NnetOptimizeOptions &	optimize_opts,
		const ComputationRequest &	request1,
		const ComputationRequest &	request2,
		const ComputationRequest &	request3,
		NnetComputation *	computation
	)

CompileLooped() provides an internal interface for 'looped' computation.

It's usable for inference only (not training), meaning that backprop is not supported (for now, at least). CompileLooped() allows you to do the neural net computation for small chunks with increasing 't' values, and naturally cache the intermediate activations (rather than recomputing them every time you see new input data).

This function does both compilation and optimization, so it's like a combination of Compiler::CreateComputation() [nnet-compile.h] and Optimize() [nnet-optimize.h].

You provide 3 computation requests. request1 is the first computation request of an utterance (or other type of segment) that contains any required extra left context in the input. request2 and request3 are the second and third computation request, and must have exactly the same structure, except for a fixed time offset (change in 't' index) between them. This will be extrapolated to an infinite sequence of further requests (request4, request5, etc.). In practice the way it's done is that we extrapolate to a small finite number of requests (like 10), and then attempt to identify a common structure in the computation where, after processing, as an example, the 3nd computation request, the active variables can be identified with those present at, say, the 7th computation request, and we then cut and splice the computation together at this points, like making a tape loop, by adding a goto statement that jumps from the end of the 7th computation request to the end of the 3rd computation request. We also have to identify the variables with each other (merge variables).

That's done in the optimization code.

Definition at line 329 of file nnet-compile-looped.cc.

References CompileLoopedInternal(), Timer::Elapsed(), KALDI_ERR, KALDI_LOG, and KALDI_VLOG.

Referenced by DecodableNnetSimpleLoopedInfo::Init(), and UnitTestNnetCompileLooped().

                                                  {
   int32 num_requests1 = 5, factor = 2, max_requests = 100,
       num_requests;
 
   Timer timer;
 
   for (num_requests = num_requests1; num_requests <= max_requests;
        num_requests *= factor) {
     if (CompileLoopedInternal(nnet, optimize_opts,
                              request1, request2, request3,
                              num_requests, computation)) {
       KALDI_LOG << "Spent " << timer.Elapsed()
                 << " seconds in looped compilation.";
       return;
     } else {
       KALDI_VLOG(2) << "Looped compilation failed with "
                     << num_requests << " requests, trying "
                     << (num_requests * factor);
     }
   }
   KALDI_ERR << "Looped compilation failed with "
             << (num_requests/factor) << " requests, which "
             << "we expect should be enough... something "
             << "went wrong.";
 }

◆ CompileLoopedInternal()

static bool kaldi::nnet3::CompileLoopedInternal	(	const Nnet &	nnet,
		NnetOptimizeOptions	optimize_opts,
		const ComputationRequest &	request1,
		const ComputationRequest &	request2,
		const ComputationRequest &	request3,
		int32	num_requests,
		NnetComputation *	computation
	)

static

Definition at line 284 of file nnet-compile-looped.cc.

References NnetComputation::commands, Compiler::CreateComputation(), ExtrapolateComputationRequest(), rnnlm::i, KALDI_ASSERT, KALDI_ERR, KALDI_LOG, kGotoLabel, MaxOutputTimeInRequest(), Optimize(), NnetOptimizeOptions::optimize_looped_computation, and ComputationRequest::Print().

Referenced by CompileLooped().

                                   {
 
   KALDI_ASSERT(num_requests >= 3);
   std::vector<ComputationRequest> extra_requests(num_requests - 3);
   const ComputationRequest *prev_request = &request2;
   const ComputationRequest *cur_request = &request3;
   for (int32 i = 0; i < num_requests - 3; i++) {
     if (!ExtrapolateComputationRequest(*prev_request, *cur_request,
                                        &(extra_requests[i]))) {
       KALDI_LOG << "prev_request is:";
       prev_request->Print(std::cerr);
       KALDI_LOG << "cur_request is:";
       cur_request->Print(std::cerr);
       KALDI_ERR << "Computation requests do not have the right relationship";
     }
     prev_request = cur_request;
     cur_request = &(extra_requests[i]);
   }
 
   std::vector<const ComputationRequest*> requests;
   requests.push_back(&request1);
   requests.push_back(&request2);
   requests.push_back(&request3);
   for (int32 i = 0; i < num_requests - 3; i++)
     requests.push_back(&(extra_requests[i]));
   Compiler compiler(requests, nnet);
   CompilerOptions compiler_opts;
   compiler.CreateComputation(compiler_opts, computation);
   optimize_opts.optimize_looped_computation = true;
 
   int32 dont_really_care = MaxOutputTimeInRequest(request3);
   Optimize(optimize_opts, nnet,
            dont_really_care, computation);
 
   return computation->commands.size() != 0 &&
       computation->commands.back().command_type == kGotoLabel;
 }

◆ ComponentDotProducts()

void ComponentDotProducts	(	const Nnet &	nnet1,
		const Nnet &	nnet2,
		VectorBase< BaseFloat > *	dot_prod
	)

Returns dot products between two networks of the same structure (calls the DotProduct functions of the Updatable components and fill in the output vector).

Definition at line 211 of file nnet-utils.cc.

References VectorBase< Real >::Data(), VectorBase< Real >::Dim(), UpdatableComponent::DotProduct(), Nnet::GetComponent(), KALDI_ASSERT, kUpdatableComponent, Nnet::NumComponents(), and Component::Properties().

Referenced by main().

                                                            {
   KALDI_ASSERT(nnet1.NumComponents() == nnet2.NumComponents());
   int32 updatable_c = 0;
   for (int32 c = 0; c < nnet1.NumComponents(); c++) {
     const Component *comp1 = nnet1.GetComponent(c),
                     *comp2 = nnet2.GetComponent(c);
     if (comp1->Properties() & kUpdatableComponent) {
       const UpdatableComponent
           *u_comp1 = dynamic_cast<const UpdatableComponent*>(comp1),
           *u_comp2 = dynamic_cast<const UpdatableComponent*>(comp2);
       KALDI_ASSERT(u_comp1 != NULL && u_comp2 != NULL);
       dot_prod->Data()[updatable_c] = u_comp1->DotProduct(*u_comp2);
       updatable_c++;
     }
   }
   KALDI_ASSERT(updatable_c == dot_prod->Dim());
 }

◆ ComputeAccuracy()

void ComputeAccuracy	(	const GeneralMatrix &	supervision,
		const CuMatrixBase< BaseFloat > &	nnet_output,
		BaseFloat *	tot_weight,
		BaseFloat *	tot_accuracy,
		VectorBase< BaseFloat > *	tot_weight_vec = `NULL`,
		VectorBase< BaseFloat > *	tot_accuracy_vec = `NULL`
	)

This function computes the frame accuracy for this minibatch.

It interprets the supervision information in "supervision" as labels or soft labels; it picks the maximum element in each row and treats that as the label for purposes of computing the accuracy (in situations where you would care about the accuracy, there will normally be just one nonzero label). The hypothesized labels are computed by taking the neural net output (supplied as a CuMatrix), and finding the maximum element in each row. See also the function ComputeObjectiveFunction, declared in nnet-training.h.

Parameters

[in]	supervision	The supervision information (no elements may be negative); only the maximum in each row matters (although we expect that usually there will be just one nonzero element in each row); and the sum of each row is interpreted as a weighting factor (although we expect that this sum will usually be one).
[in]	nnet_output	The neural net output must have the same dimensions as the supervision. Only the index of the maximum value in each row matters. Ties will be broken in an unspecified way.
[out]	tot_weight	The sum of the values in the supervision matrix
[out]	tot_accuracy	The total accuracy, equal to the sum over all row indexes r such that the maximum column index of row r of supervision and nnet_output is the same, of the sum of the r'th row of supervision (i.e. the row's weight).
[out]	tot_weight_vec	If non-NULL, we write to this location the counts per-class in the supervision matrix. This is expected to have the same dimension as the corresponding output in the network.
[out]	tot_accuracy_vec	If non-NULL, we write to this location the accuracy per-class. For index j, the value is equal to the sum over all row indexes r such that the maximum column index of row r of supervision is j and nnet_output is also j, of the sum of the r'th row of supervision (i.e. the row's weight)

Definition at line 206 of file nnet-diagnostics.cc.

References CuArrayBase< T >::CopyToVec(), VectorBase< Real >::Dim(), CuMatrixBase< Real >::FindRowMaxId(), GeneralMatrix::GetFullMatrix(), GeneralMatrix::GetMatrix(), GeneralMatrix::GetSparseMatrix(), KALDI_ASSERT, KALDI_ERR, kaldi::kCompressedMatrix, kaldi::kFullMatrix, kaldi::kSparseMatrix, SparseVector< Real >::Max(), VectorBase< Real >::Max(), CuMatrixBase< Real >::NumCols(), GeneralMatrix::NumCols(), CuMatrixBase< Real >::NumRows(), GeneralMatrix::NumRows(), SparseMatrix< Real >::Row(), VectorBase< Real >::Set(), SparseVector< Real >::Sum(), VectorBase< Real >::Sum(), and GeneralMatrix::Type().

Referenced by NnetComputeProb::ProcessOutputs().

                                                               {
   int32 num_rows = nnet_output.NumRows(),
       num_cols = nnet_output.NumCols();
   KALDI_ASSERT(supervision.NumRows() == num_rows &&
                supervision.NumCols() == num_cols);
 
   if (tot_accuracy_vec || tot_weight_vec)
     KALDI_ASSERT(tot_accuracy_vec && tot_weight_vec &&
                  tot_accuracy_vec->Dim() == num_cols &&
                  tot_weight_vec->Dim() == num_cols);
   if (tot_accuracy_vec) tot_accuracy_vec->Set(0.0);
   if (tot_weight_vec) tot_weight_vec->Set(0.0);
 
   CuArray<int32> best_index(num_rows);
   nnet_output.FindRowMaxId(&best_index);
   std::vector<int32> best_index_cpu;
   // wasteful copy, but doesn't dominate.
   best_index.CopyToVec(&best_index_cpu);
 
 
   double tot_weight = 0.0,
       tot_accuracy = 0.0;
 
   // note: we expect that in most cases where this code is called,
   // supervision.Type() will be kSparseMatrix.
   switch (supervision.Type()) {
     case kCompressedMatrix: {
       Matrix<BaseFloat> mat;
       supervision.GetMatrix(&mat);
       for (int32 r = 0; r < num_rows; r++) {
         SubVector<BaseFloat> vec(mat, r);
         BaseFloat row_sum = vec.Sum();
         int32 best_index;
         vec.Max(&best_index);  // discard max value.
         tot_weight += row_sum;
         if (tot_weight_vec)
           (*tot_weight_vec)(best_index) += row_sum;
         if (best_index == best_index_cpu[r]) {
           tot_accuracy += row_sum;
           if (tot_accuracy_vec)
             (*tot_accuracy_vec)(best_index) += row_sum;
         }
       }
       break;
     }
     case kFullMatrix: {
       const Matrix<BaseFloat> &mat = supervision.GetFullMatrix();
       for (int32 r = 0; r < num_rows; r++) {
         SubVector<BaseFloat> vec(mat, r);
         BaseFloat row_sum = vec.Sum();
         int32 best_index;
         vec.Max(&best_index);  // discard max value.
         tot_weight += row_sum;
         if (tot_weight_vec)
           (*tot_weight_vec)(best_index) += row_sum;
         if (best_index == best_index_cpu[r]) {
           tot_accuracy += row_sum;
           if (tot_accuracy_vec)
             (*tot_accuracy_vec)(best_index) += row_sum;
         }
       }
       break;
     }
     case kSparseMatrix: {
       const SparseMatrix<BaseFloat> &smat = supervision.GetSparseMatrix();
       for (int32 r = 0; r < num_rows; r++) {
         const SparseVector<BaseFloat> &row = smat.Row(r);
         BaseFloat row_sum = row.Sum();
         int32 best_index;
         row.Max(&best_index);
         KALDI_ASSERT(best_index < num_cols);
         tot_weight += row_sum;
         if (tot_weight_vec)
           (*tot_weight_vec)(best_index) += row_sum;
         if (best_index == best_index_cpu[r]) {
           tot_accuracy += row_sum;
           if (tot_accuracy_vec)
             (*tot_accuracy_vec)(best_index) += row_sum;
         }
       }
       break;
     }
     default: KALDI_ERR << "Bad general-matrix type.";
   }
   *tot_weight_out = tot_weight;
   *tot_accuracy_out = tot_accuracy;
 }

◆ ComputeCommandAttributes()

void ComputeCommandAttributes	(	const Nnet &	nnet,
		const NnetComputation &	computation,
		const ComputationVariables &	vars,
		std::vector< CommandAttributes > *	attributes
	)

Definition at line 284 of file nnet-analyze.cc.

References NnetComputation::Command::arg1, NnetComputation::Command::arg2, NnetComputation::Command::arg3, NnetComputation::Command::arg4, NnetComputation::Command::arg5, NnetComputation::Command::arg6, NnetComputation::Command::command_type, NnetComputation::commands, count, Nnet::GetComponent(), CommandAttributes::has_side_effects, rnnlm::i, NnetComputation::indexes, NnetComputation::indexes_multi, IndexesMultiToSubmatrixIndexes(), kAcceptInput, kAddRowRanges, kAddRows, kAddRowsMulti, kAddToRowsMulti, KALDI_ERR, kAllocMatrix, kBackprop, kBackpropAdds, kBackpropNoModelUpdate, kCompressMatrix, kCopyRows, kCopyRowsMulti, kCopyToRowsMulti, kDeallocMatrix, kDecompressMatrix, kGotoLabel, kMatrixAdd, kMatrixCopy, kNoOperation, kNoOperationLabel, kNoOperationMarker, kNoOperationPermanent, kPropagate, kPropagateAdds, kProvideOutput, kReadAccess, kReadWriteAccess, kSetConst, kSwapMatrix, kUpdatableComponent, kWriteAccess, CommandAttributes::matrices_read, CommandAttributes::matrices_written, Component::Properties(), ComputationVariables::RecordAccessForSubmatrix(), kaldi::SortAndUniq(), CommandAttributes::submatrices_read, CommandAttributes::submatrices_written, CommandAttributes::variables_read, and CommandAttributes::variables_written.

Referenced by NnetComputer::Init(), Analyzer::Init(), and MoveSizingCommands().

                                               {
   int32 num_commands = computation.commands.size();
   attributes->clear();
   attributes->resize(num_commands);
   for (int32 command_index = 0; command_index < num_commands; command_index++) {
     const NnetComputation::Command &c = computation.commands[command_index];
     CommandAttributes &attr = (*attributes)[command_index];
     switch (c.command_type) {
       case kAllocMatrix:
       case kDeallocMatrix:
       case kSwapMatrix:
         break;  // the commands above leave the matrix undefined.
       case kSetConst:
         vars.RecordAccessForSubmatrix(c.arg1, kWriteAccess, &attr);
         break;
       case kPropagate:
         vars.RecordAccessForSubmatrix(c.arg3, kReadAccess, &attr);
         if (nnet.GetComponent(c.arg1)->Properties() & kPropagateAdds)
           vars.RecordAccessForSubmatrix(c.arg4, kReadWriteAccess, &attr);
         else
           vars.RecordAccessForSubmatrix(c.arg4, kWriteAccess, &attr);
         break;
       case kBackprop:
       case kBackpropNoModelUpdate:
         vars.RecordAccessForSubmatrix(c.arg3, kReadAccess, &attr);
         vars.RecordAccessForSubmatrix(c.arg4, kReadAccess, &attr);
         vars.RecordAccessForSubmatrix(c.arg5, kReadAccess, &attr);
         if (nnet.GetComponent(c.arg1)->Properties() & kBackpropAdds)
           vars.RecordAccessForSubmatrix(c.arg6, kReadWriteAccess, &attr);
         else
           vars.RecordAccessForSubmatrix(c.arg6, kWriteAccess, &attr);
         if (c.command_type == kBackprop &&
             nnet.GetComponent(c.arg1)->Properties() & kUpdatableComponent)
           attr.has_side_effects = true;
         break;
       case kMatrixCopy:
         vars.RecordAccessForSubmatrix(c.arg1, kWriteAccess, &attr);
         vars.RecordAccessForSubmatrix(c.arg2, kReadAccess, &attr);
         break;
       case kMatrixAdd:
         vars.RecordAccessForSubmatrix(c.arg1, kReadWriteAccess, &attr);
         vars.RecordAccessForSubmatrix(c.arg2, kReadAccess, &attr);
         break;
       case kAddRows:
         vars.RecordAccessForSubmatrix(c.arg1, kReadWriteAccess, &attr);
         vars.RecordAccessForSubmatrix(c.arg2, kReadAccess, &attr);
         break;
       case kCopyRows: {
         const std::vector<int32> &indexes = computation.indexes[c.arg3];
         // if there are -1's in "indexes", then the result of the operation
         // will depend on the initial value of the matrix, so it's
         // a "rw" operation, not a "write" operation.
         if (std::count(indexes.begin(), indexes.end(), -1) > 0)
           vars.RecordAccessForSubmatrix(c.arg1, kReadWriteAccess, &attr);
         else
           vars.RecordAccessForSubmatrix(c.arg1, kWriteAccess, &attr);
         vars.RecordAccessForSubmatrix(c.arg2, kReadAccess, &attr);
         break;
       }
       case kAddRowsMulti: {
         vars.RecordAccessForSubmatrix(c.arg1, kReadWriteAccess, &attr);
         std::vector<int32> submatrix_indexes;
         IndexesMultiToSubmatrixIndexes(computation.indexes_multi[c.arg2],
                                        &submatrix_indexes);
         for (size_t i = 0; i < submatrix_indexes.size(); i++)
           vars.RecordAccessForSubmatrix(submatrix_indexes[i],
                                         kReadAccess, &attr);
         break;
       }
       case kCopyRowsMulti: {
         std::vector<int32> submatrix_indexes;
         IndexesMultiToSubmatrixIndexes(computation.indexes_multi[c.arg2],
                                        &submatrix_indexes);
         // note: the CopyRows command assigns zero in cases where
         // there is no source for some row
         vars.RecordAccessForSubmatrix(c.arg1, kWriteAccess, &attr);
         for (size_t i = 0; i < submatrix_indexes.size(); i++)
           vars.RecordAccessForSubmatrix(submatrix_indexes[i],
                                         kReadAccess, &attr);
         break;
       }
       case kAddToRowsMulti:
       case kCopyToRowsMulti: {
         vars.RecordAccessForSubmatrix(c.arg1, kReadAccess, &attr);
         // if the submatrixes we're writing to (in kCopyToRowsMulti) had all
         // rows covered, it would be a pure write operation.
         std::vector<int32> submatrix_indexes;
         IndexesMultiToSubmatrixIndexes(computation.indexes_multi[c.arg2],
                                        &submatrix_indexes);
         for (size_t i = 0; i < submatrix_indexes.size(); i++)
           vars.RecordAccessForSubmatrix(submatrix_indexes[i], kReadWriteAccess,
                                         &attr);
         break;
       }
       case kAddRowRanges: {
         vars.RecordAccessForSubmatrix(c.arg1, kReadWriteAccess, &attr);
         vars.RecordAccessForSubmatrix(c.arg2, kReadAccess, &attr);
         break;
       }
       case kCompressMatrix: {
         vars.RecordAccessForSubmatrix(c.arg1, kReadWriteAccess, &attr);
         break;
       }
       case kDecompressMatrix: {
         vars.RecordAccessForSubmatrix(c.arg1, kWriteAccess, &attr);
         break;
       }
       case kAcceptInput: {
         vars.RecordAccessForSubmatrix(c.arg1, kWriteAccess, &attr);
         break;
       }
       case kProvideOutput: {
         vars.RecordAccessForSubmatrix(c.arg1, kReadAccess, &attr);
         break;
       }
       case kNoOperation:
       case kNoOperationPermanent:
       case kNoOperationMarker:
       case kNoOperationLabel:
       case kGotoLabel:
         break;
       default:
         KALDI_ERR << "Unknown command type.";
     }
     SortAndUniq(&attr.variables_read);
     SortAndUniq(&attr.variables_written);
     SortAndUniq(&attr.submatrices_read);
     SortAndUniq(&attr.submatrices_written);
     SortAndUniq(&attr.matrices_read);
     SortAndUniq(&attr.matrices_written);
   }
 }

◆ ComputeCommandPairs()

static void kaldi::nnet3::ComputeCommandPairs	(	const std::pair< std::vector< int32 >, std::vector< int32 > > &	lists,
		std::vector< std::pair< int32, int32 > > *	pairs
	)

static

Definition at line 328 of file nnet-optimize.cc.

References kaldi::CopyVectorToSet(), rnnlm::d, and KALDI_PARANOID_ASSERT.

Referenced by RemoveUnnecessaryAllocation().

                                              {
   std::vector<int32> d_list = lists.first;
 
   std::set<int32> a_set;
   CopyVectorToSet(lists.second, &a_set);
 
   std::vector<int32>::reverse_iterator iter = d_list.rbegin(),
       end = d_list.rend();
 
   // from the latest to the earliest deallocation command...
   for (; iter != end; ++iter) {
     int32 d = *iter;
     std::set<int32>::iterator a_iter = a_set.upper_bound(d);
     // a_iter is an iterator to the first element a of the set 'a_set' such
     // that a > d, or a_set.end() if no such element exists.
     if (a_iter == a_set.end())
       continue;  // we will output no pair for this d.
     int32 a = *a_iter;
     KALDI_PARANOID_ASSERT(a > d);  // or code error
     a_set.erase(a_iter);  // remove this a from 'a_set' so it doesn't get used
                           // twice
     pairs->push_back(std::pair<int32,int32>(d, a));
   }
 }

◆ ComputeComputationGraph()

void kaldi::nnet3::ComputeComputationGraph	(	const Nnet &	nnet,
		const ComputationRequest &	request,
		ComputationGraph *	graph
	)

Definition at line 1152 of file nnet-computation-graph.cc.

References kaldi::nnet3::computation_graph::AddInputToGraph(), kaldi::nnet3::computation_graph::AddOutputToGraph(), ComputationGraph::cindexes, NetworkNode::component_index, ComputationGraph::dependencies, NetworkNode::descriptor, ComputationGraph::GetCindexId(), Nnet::GetComponent(), Descriptor::GetDependencies(), Component::GetInputIndexes(), Nnet::GetNode(), rnnlm::i, ComputationGraph::is_input, KALDI_ASSERT, KALDI_ERR, kComponent, kDescriptor, kDimRange, kInput, ComputationRequest::misc_info, rnnlm::n, NetworkNode::node_index, NetworkNode::node_type, kaldi::SortAndUniq(), and NetworkNode::u.

                                                       {
   using namespace computation_graph;
   // make sure graph is empty at the start.
   KALDI_ASSERT(graph->cindexes.empty());
 
   AddInputToGraph(request, nnet, graph);
   AddOutputToGraph(request, nnet, graph);
 
   // queue of cindex_ids to process.
   std::vector<int32> queue(graph->cindexes.size());
   for (int32 i = 0; i < graph->cindexes.size(); i++)
     queue.push_back(i);
 
   while (!queue.empty()) {
     int32 cindex_id = queue.back();
     queue.pop_back();
     if (static_cast<int32>(graph->dependencies.size()) <= cindex_id)
       graph->dependencies.resize(cindex_id + 1);
 
     if (graph->is_input[cindex_id])
       continue;
     Cindex cindex = graph->cindexes[cindex_id];
 
     // find the dependencies of this cindex.
     int32 n = cindex.first;
     const Index &index = cindex.second;
     const NetworkNode &node = nnet.GetNode(n);
 
     std::vector<Cindex> input_cindexes;
 
     // the following switch statement sets up "input_cindexes".
     switch (node.node_type) {
       case kDescriptor: {
         // desc describes how this node obtains its input from other nodes.
         const Descriptor &desc = node.descriptor;
         desc.GetDependencies(index, &input_cindexes);
         break;
       }
       case kComponent: {
         int32 c = node.u.component_index;
         const Component *component = nnet.GetComponent(c);
         std::vector<Index> input_indexes;
         component->GetInputIndexes(request.misc_info, index,
                                    &input_indexes);
         // each Component node should be preceded by a node that describes its
         // input, of type kDescriptor
         KALDI_ASSERT(nnet.GetNode(n-1).node_type ==
                      kDescriptor);
 
         input_cindexes.resize(input_indexes.size());
         for (size_t i = 0; i < input_indexes.size(); i++) {
           input_cindexes[i].first = n - 1;  // preceding node.
           input_cindexes[i].second = input_indexes[i];
         }
         break;
       }
       case kDimRange: {
         input_cindexes.resize(1);
         input_cindexes[0] = Cindex(node.u.node_index, index);
         break;
       }
       case kInput: default:
         // for kInput, you should have hit the "continue" statement above.
         KALDI_ERR << "Invalid node type";
     }
     std::vector<int32> &this_dep = graph->dependencies[cindex_id];
 
     int32 num_dependencies = input_cindexes.size();
     this_dep.resize(num_dependencies);
     for (size_t i = 0; i < num_dependencies; i++) {
       bool is_input = false, is_new;
       int32 dep_cindex_id = graph->GetCindexId(input_cindexes[i],
                                                is_input, &is_new);
       this_dep[i] = dep_cindex_id;
       if (is_new)
         queue.push_back(dep_cindex_id);
     }
 
     // remove duplicates of dependencies.
     SortAndUniq(&this_dep);
   }
 }

◆ ComputeComputationPhases()

void ComputeComputationPhases	(	const Nnet &	nnet,
		const ComputationGraph &	computation_graph,
		std::vector< std::vector< std::vector< int32 > > > *	phases_per_segment
	)

This function divides a computation into 'phases', where a 'phase' is a collection of cindexes which can (as far as the computation graph is concerned) all be computed at the same time, and depend only on cindexes previously computed in earlier phases.

So the phases are an ordering of the Cindexes in the computation, but an ordering that depends on graph-theoretic considerations only, and not practical concerns like whether the cindexes belong to the same node [for that, see the notion of steps].

Parameters

[in]	nnet	The neural network this computation is for
[in]	graph	The computation graph that we're computing phases for.
[out]	phases_per_segment	The phases, listed separately for each segment of the computation [there will be just one segment in the normal case, more in the online-recognition case]. Consider just one segment for now. Suppose the computation can be completed in 20 phases, then (phases)[0].size() will be 20 at exit, and (phases)[0][0] will be a sorted list of cindex_ids. that belong to the first phase, and so on. (Remember, a cindex_id is an index into graph->cindexes; it compactly identifies a cindex.) The sets represented by the int32's in 'phases_per_segment' will be disjoint and will cover all elements in [0 .. computation.cindexes.size() - 1].

Note: we assume you have called PruneComputationGraph() before this function. Even so, this function will be crash if the computation cannot actually be computed– there are some mal-formed computations where you can build the computation graph but not the ordering of cindexes because there are dependencies forward and backward in time that intertwine.

Definition at line 1406 of file nnet-computation-graph.cc.

References ComputationGraph::cindexes, ComputeComputationPhasesForEpoch(), kaldi::nnet3::computation_graph::ComputeDependenciesSubset(), kaldi::nnet3::computation_graph::ComputeEpochInfo(), ComputeGraphTranspose(), KALDI_ASSERT, ComputationGraph::segment_ends, and SumVectorSizes().

Referenced by Compiler::CreateComputation().

                                                                   {
   using namespace computation_graph;
   int32 num_cindex_ids = graph.cindexes.size();
 
   std::vector<int32> cindex_id_to_segment_and_epoch;
   std::vector<std::vector<std::vector<int32 > > > epochs_per_segment;
   std::vector<bool> epoch_is_trivial;
   ComputeEpochInfo(nnet, graph, &cindex_id_to_segment_and_epoch,
                    &epochs_per_segment, &epoch_is_trivial);
 
   KALDI_ASSERT(SumVectorSizes(epochs_per_segment) == num_cindex_ids);
 
   // dependencies_subset contains just the subset of dependencies
   // of each cindex_id, that have the same epoch index as
   // cindex_id itself.  This will be used to correctly order
   // cindexes within a certain epoch (relevant for things like
   // LSTMs).
   std::vector<std::vector<int32> > dependencies_subset;
   ComputeDependenciesSubset(graph, cindex_id_to_segment_and_epoch,
                             &dependencies_subset);
   // destroy cindex_id_to_segment_and_epoch, it's no longer needed.
   { std::vector<int32> temp; temp.swap(cindex_id_to_segment_and_epoch);  }
 
   // depend_on_subset is a subset of the normal "depend_on" list (i.e. a list of
   // all cindex_ids that depend on the current cindex_id), limited to just those
   // cindex_ids that have the same epoch index.
   std::vector<std::vector<int32> > depend_on_subset;
   ComputeGraphTranspose(dependencies_subset, &depend_on_subset);
 
   int32 num_epoch_indexes = epoch_is_trivial.size(),
       num_segments = graph.segment_ends.size();
 
   // "phase_indexes" is used inside ComputeComputationPhasesForEpoch.
   std::vector<int32> phase_indexes(num_cindex_ids, -1);
 
   phases_per_segment->clear();
   phases_per_segment->resize(num_segments);
 
   for (int32 segment = 0; segment < num_segments; segment++) {
     phases_per_segment->reserve(50);  // minimize unnecessary copies.  50 is
                                       // very arbitrarily chosen.
     for (int32 epoch = 0; epoch < num_epoch_indexes; epoch++)
       ComputeComputationPhasesForEpoch(nnet, graph,
                                        epochs_per_segment[segment][epoch],
                                        dependencies_subset,
                                        depend_on_subset,
                                        epoch_is_trivial[epoch],
                                        &phase_indexes,
                                        &((*phases_per_segment)[segment]));
   }
 
 
   // make sure everything was computable.  If the next assert fails it's likely
   // a bug in this function or in PruneComputataionGraph.
   KALDI_ASSERT(SumVectorSizes(*phases_per_segment) == num_cindex_ids);
 }

◆ ComputeComputationPhasesForEpoch()

static void kaldi::nnet3::ComputeComputationPhasesForEpoch	(	const Nnet &	nnet,
		const ComputationGraph &	graph,
		const std::vector< int32 > &	this_epoch,
		const std::vector< std::vector< int32 > > &	dependencies_subset,
		const std::vector< std::vector< int32 > > &	depend_on_subset,
		bool	epoch_is_trivial,
		std::vector< int32 > *	phase_indexes,
		std::vector< std::vector< int32 > > *	phases
	)

inlinestatic

Definition at line 1307 of file nnet-computation-graph.cc.

References rnnlm::d, KALDI_ASSERT, KALDI_ERR, and kaldi::SortAndUniq().

Referenced by ComputeComputationPhases().

                                           {
   std::vector<int32> this_phase, next_phase_candidates;
 
   if (this_epoch.empty())
     return;
 
   if (epoch_is_trivial) { // an optimization
     this_phase = this_epoch;
   } else {
     // Start out with all elements of this epoch that have no
     // dependencies within the same epoch (i.e. those that
     // can be computed first).
     std::vector<int32>::const_iterator iter = this_epoch.begin(),
         end = this_epoch.end();
     for (; iter != end; ++iter) {
       int32 cindex_id = *iter;
       if (dependencies_subset[cindex_id].empty())
         this_phase.push_back(cindex_id);
     }
   }
 
   // if the next assert fails, the graph at the level of cindex_ids is not acyclic.
   KALDI_ASSERT(!this_phase.empty() &&
                "Trying to process computation with cycles");
 
   while (!this_phase.empty()) {
     // The next two lines are a more efficient version of:
     // phases->push_back(this_phase);
     phases->resize(phases->size() + 1);
     phases->back().swap(this_phase);
     // The next if-statement is an optimization: if for this epoch index
     // there is just one node, we can skip the rest of this loop.  Note: if
     // epoch == 0, even if there is just one node, cindex_ids from
     // multiple nodes may be put here because of the rule that cindex_ids which
     // are inputs always get epoch 0.  But it's still true that they
     // will have no dependencies, so we can still skip the code below.
     if (epoch_is_trivial)
       return;
 
     int32 cur_phase_index = phases->size() - 1;
 
     // next_phases_candidates is a list of cindexes that we should check
     // whether they are computable now, because one of the things they depend
     // on just became computable.
     next_phase_candidates.clear();
     std::vector<int32>::const_iterator this_phase_iter = phases->back().begin(),
         this_phase_end = phases->back().end();
 
     for (; this_phase_iter != this_phase_end; ++this_phase_iter) {
       int32 c = *this_phase_iter;  // c is a cindex_id with phase cur_phase_index.
       (*phase_indexes)[c] = cur_phase_index;
       std::vector<int32>::const_iterator iter = depend_on_subset[c].begin(),
           end = depend_on_subset[c].end();
       for (; iter != end; ++iter) {
         int32 d = *iter;  // cindex_id that depends on c.
         next_phase_candidates.push_back(d);
       }
     }
     SortAndUniq(&next_phase_candidates);
     // note, at this point 'this_phase' will be the empty vector [see the 'swap'
     // above].
     this_phase.reserve(next_phase_candidates.size());
     // now check the candidates that might be in the next phase, and put any
     // members that we are currently able to compute into "this_phase".
     std::vector<int32>::const_iterator iter = next_phase_candidates.begin(),
         end = next_phase_candidates.end();
     for (; iter != end; ++iter) {
       int32 c = *iter;
       std::vector<int32>::const_iterator
           dep_iter = dependencies_subset[c].begin(),
           dep_end = dependencies_subset[c].end();
       for (; dep_iter != dep_end; ++dep_iter) {
         int32 d = *dep_iter;  // d is cindex_id that c depends on.
         if ((*phase_indexes)[d] < 0)  // we can't compute c yet because something we depend
           break;                      // on has not yet been computed.
       }
       if (dep_iter == dep_end) {
         // we reached the end and did not break -> all dependencies satisfied
         this_phase.push_back(c);
       }
     }
     if (!next_phase_candidates.empty() && this_phase.empty())  {
       // this should have been caught earlier so likely a code error rather than
       // a problem with user input.
       KALDI_ERR << "Your model has a type of recurrence that cannot be computed. "
                 << "E.g. if x[t] depends on both x[t+1] and x[t-1]... no order "
                 << "of computation will work.";
     }
   }
 }

◆ ComputeExampleComputationRequestSimple()

void ComputeExampleComputationRequestSimple	(	const Nnet &	nnet,
		ComputationRequest *	request,
		std::vector< Matrix< BaseFloat > > *	inputs
	)

This function computes an example computation request, for testing purposes.

The "Simple" in the name means that it currently only supports neural nets that satisfy IsSimple(nnet) (defined in nnet-utils.h). If there are 2 inputs, the "input" will be first, followed by "ivector".

In order to expand the range of things you can test with this (mainly to stop crashes with statistics-pooling/statistics-extraction components), this function always generates computation-requests where at least 3 successive frames of input are requested.

Definition at line 1338 of file nnet-test-utils.cc.

References ComputeSimpleNnetContext(), Nnet::InputDim(), ComputationRequest::inputs, IsSimpleNnet(), KALDI_ASSERT, rnnlm::n, ComputationRequest::need_model_derivative, ComputationRequest::outputs, kaldi::Rand(), and ComputationRequest::store_component_stats.

Referenced by NnetGenerationOptions::NnetGenerationOptions(), UnitTestNnetAnalyze(), UnitTestNnetCompile(), UnitTestNnetCompileMulti(), UnitTestNnetCompute(), UnitTestNnetInputDerivatives(), UnitTestNnetModelDerivatives(), and UnitTestNnetOptimizeWithOptions().

                                            {
   KALDI_ASSERT(IsSimpleNnet(nnet));
 
   int32 left_context, right_context;
   ComputeSimpleNnetContext(nnet, &left_context, &right_context);
 
   int32 num_output_frames = 1 + Rand() % 10,
       output_start_frame = Rand() % 10,
       num_examples = 1 + Rand() % 4,
       output_end_frame = output_start_frame + num_output_frames,
       input_start_frame = output_start_frame - left_context - (Rand() % 3),
       input_end_frame = output_end_frame + right_context + (Rand() % 3),
       n_offset = Rand() % 2;
   bool need_deriv = (Rand() % 2 == 0);
   // make sure there are at least 3 frames of input available.  this makes a
   // difference for our tests of statistics-pooling and statistics-extraction
   // component.
   if (input_end_frame < input_start_frame + 3)
     input_end_frame = input_start_frame + 3;
 
   request->inputs.clear();
   request->outputs.clear();
   inputs->clear();
 
   std::vector<Index> input_indexes, ivector_indexes, output_indexes;
   for (int32 n = n_offset; n < n_offset + num_examples; n++) {
     for (int32 t = input_start_frame; t < input_end_frame; t++)
       input_indexes.push_back(Index(n, t, 0));
     for (int32 t = output_start_frame; t < output_end_frame; t++)
       output_indexes.push_back(Index(n, t, 0));
     ivector_indexes.push_back(Index(n, 0, 0));
   }
   request->outputs.push_back(IoSpecification("output", output_indexes));
   if (need_deriv || (Rand() % 3 == 0))
     request->outputs.back().has_deriv = true;
   request->inputs.push_back(IoSpecification("input", input_indexes));
   if (need_deriv && (Rand() % 2 == 0))
     request->inputs.back().has_deriv = true;
   int32 input_dim = nnet.InputDim("input");
   KALDI_ASSERT(input_dim > 0);
   inputs->push_back(
       Matrix<BaseFloat>((input_end_frame - input_start_frame) * num_examples,
                         input_dim));
   inputs->back().SetRandn();
   int32 ivector_dim = nnet.InputDim("ivector");  // may not exist.
   if (ivector_dim != -1) {
     request->inputs.push_back(IoSpecification("ivector", ivector_indexes));
     inputs->push_back(Matrix<BaseFloat>(num_examples, ivector_dim));
     inputs->back().SetRandn();
     if (need_deriv && (Rand() % 2 == 0))
       request->inputs.back().has_deriv = true;
   }
   if (Rand() % 2 == 0)
     request->need_model_derivative = need_deriv;
   if (Rand() % 2 == 0)
     request->store_component_stats = true;
 }

◆ ComputeGraphTranspose()

void ComputeGraphTranspose	(	const std::vector< std::vector< int32 > > &	graph,
		std::vector< std::vector< int32 > > *	graph_transpose
	)

Outputs a graph in which the order of arcs is reversed.

Definition at line 63 of file nnet-graph.cc.

References rnnlm::n.

Referenced by ComputeComputationPhases(), FindOrphanNodes(), and UnitTestComputeGraphTranspose().

                                                                           {
   int32 size = graph.size();
   graph_transpose->clear();
   graph_transpose->resize(size);
   for (int32 n = 0; n < size; n++) {
     const std::vector<int32> &nodes = graph[n];
     std::vector<int32>::const_iterator iter = nodes.begin(), end = nodes.end();
     for (; iter != end; ++iter) {
       int32 dest = *iter;
       (*graph_transpose)[dest].push_back(n);
     }
   }
 }

◆ ComputeMatrixAccesses()

void ComputeMatrixAccesses	(	const Nnet &	nnet,
		const NnetComputation &	computation,
		const ComputationVariables &	variables,
		const std::vector< CommandAttributes > &	command_attributes,
		std::vector< MatrixAccesses > *	matrix_accesses
	)

This function organizes information in the CommandAttributes in a way that is convenient to access per matrix.

See the declaration of struct MatrixAccesses for the output format; the output "matrix_accesses" is indexed by matrix index (the same index as computation.matrices).

Definition at line 467 of file nnet-analyze.cc.

References NnetComputation::Command::arg1, NnetComputation::Command::arg2, NnetComputation::Command::command_type, NnetComputation::commands, kaldi::IsSortedAndUniq(), NnetComputation::IsWholeMatrix(), kAcceptInput, KALDI_ASSERT, KALDI_ERR, kAllocMatrix, kDeallocMatrix, kProvideOutput, kReadAccess, kReadWriteAccess, kSwapMatrix, kWriteAccess, NnetComputation::matrices, CommandAttributes::matrices_read, CommandAttributes::matrices_written, kaldi::SortAndUniq(), and NnetComputation::submatrices.

Referenced by Analyzer::Init(), MatrixAccesses::MatrixAccesses(), and MoveSizingCommands().

                                                 {
   int32 num_matrices = computation.matrices.size(),
       num_commands = command_attributes.size();
   matrix_accesses->clear();
   matrix_accesses->resize(num_matrices);
   for (int32 c = 0; c < num_commands; c++) {
     const CommandAttributes &attr = command_attributes[c];
     KALDI_ASSERT(IsSortedAndUniq(attr.matrices_read));
     KALDI_ASSERT(IsSortedAndUniq(attr.matrices_written));
     std::vector<int32> all_matrices;
     all_matrices.reserve(attr.matrices_read.size() +
                           attr.matrices_written.size());
     all_matrices.insert(all_matrices.end(), attr.matrices_read.begin(),
                          attr.matrices_read.end());
     all_matrices.insert(all_matrices.end(), attr.matrices_written.begin(),
                          attr.matrices_written.end());
     SortAndUniq(&all_matrices);
 
     std::vector<int32>::const_iterator iter = all_matrices.begin(),
         end = all_matrices.end();
     for (; iter != end; ++iter) {
       int32 matrix_index = *iter;
       bool is_read = std::binary_search(attr.matrices_read.begin(),
                                         attr.matrices_read.end(),
                                         matrix_index),
           is_written = (!is_read ? true :
                         std::binary_search(attr.matrices_written.begin(),
                                            attr.matrices_written.end(),
                                            matrix_index));
       if (is_read && is_written) {
         (*matrix_accesses)[matrix_index].accesses.push_back(
             Access(c, kReadWriteAccess));
       } else if (is_read) {
         (*matrix_accesses)[matrix_index].accesses.push_back(
             Access(c, kReadAccess));
       } else {
         (*matrix_accesses)[matrix_index].accesses.push_back(
             Access(c, kWriteAccess));
       }
     }
     // Now set up allocate_command, deallocate_command,
     // is_input and is_output.
     const NnetComputation::Command &command = computation.commands[c];
     int32 matrix_index1, matrix_index2;
 
     switch (command.command_type) {
       case kAllocMatrix:
         if (!computation.IsWholeMatrix(command.arg1))
           KALDI_ERR << "Command does not operate on whole matrix";
         matrix_index1 = computation.submatrices[command.arg1].matrix_index;
         if ((*matrix_accesses)[matrix_index1].allocate_command != -1)
           KALDI_ERR << "Matrix " << matrix_index1 << " initialized twice.";
         (*matrix_accesses)[matrix_index1].allocate_command = c;
         break;
       case kSwapMatrix:
         if (!computation.IsWholeMatrix(command.arg1))
           KALDI_ERR << "Command does not operate on whole matrix";
         matrix_index1 = computation.submatrices[command.arg1].matrix_index;
         KALDI_ASSERT(computation.IsWholeMatrix(command.arg2));
         matrix_index2 = computation.submatrices[command.arg2].matrix_index;
         if ((*matrix_accesses)[matrix_index1].allocate_command != -1)
           KALDI_ERR << "Matrix " << matrix_index1 << " initialized twice.";
         (*matrix_accesses)[matrix_index1].allocate_command = c;
         if ((*matrix_accesses)[matrix_index2].deallocate_command != -1)
           KALDI_ERR << "Matrix " << matrix_index2 << " destroyed twice.";
         (*matrix_accesses)[matrix_index2].deallocate_command = c;
         break;
       case kDeallocMatrix:
         if (!computation.IsWholeMatrix(command.arg1))
           KALDI_ERR << "Command does not operate on whole matrix";
         matrix_index1 = computation.submatrices[command.arg1].matrix_index;
         if ((*matrix_accesses)[matrix_index1].deallocate_command != -1)
           KALDI_ERR << "Matrix " << matrix_index1 << " destroyed twice.";
         (*matrix_accesses)[matrix_index1].deallocate_command = c;
         break;
       case kAcceptInput:
         if (!computation.IsWholeMatrix(command.arg1))
           KALDI_ERR << "Command does not operate on whole matrix";
         matrix_index1 = computation.submatrices[command.arg1].matrix_index;
         (*matrix_accesses)[matrix_index1].is_input = true;
         // If a certain matrix is accepted as input multiple times, we
         // count the first one as allocating it (the second will just
         // allocate it again, which is harmless).
         if ((*matrix_accesses)[matrix_index1].allocate_command == -1)
           (*matrix_accesses)[matrix_index1].allocate_command = c;
         break;
       case kProvideOutput:
         if (!computation.IsWholeMatrix(command.arg1))
           KALDI_ERR << "Command does not operate on whole matrix";
         matrix_index1 = computation.submatrices[command.arg1].matrix_index;
         (*matrix_accesses)[matrix_index1].is_output = true;
         break;
       default:
         ;
     }
   }
 }

◆ ComputeMatrixToSubmatrix()

void ComputeMatrixToSubmatrix	(	const NnetComputation &	computation,
		std::vector< std::vector< int32 > > *	mat_to_submat
	)

This function computes a vector 'mat_to_submat', indexed by matrix index, such that (*mat_to_submat)[m] is a list of all the submatrix indexes that refer to matrix m.

Note, (*mat_to_submat)[0] will be the empty vector.

Definition at line 1166 of file nnet-analyze.cc.

References KALDI_ASSERT, NnetComputation::matrices, and NnetComputation::submatrices.

Referenced by VariableMergingOptimizer::VariableMergingOptimizer().

                                                  {
   int32 num_matrices = computation.matrices.size(),
       num_submatrices = computation.submatrices.size();
   mat_to_submat->clear();
   mat_to_submat->resize(num_matrices);
   for (int32 submatrix_index = 1;
        submatrix_index < num_submatrices;
        submatrix_index++) {
     int32 matrix_index = computation.submatrices[submatrix_index].matrix_index;
     KALDI_ASSERT(matrix_index > 0 && matrix_index < num_matrices);
     (*mat_to_submat)[matrix_index].push_back(submatrix_index);
   }
 }

◆ ComputeMinAndMaxTimes()

void kaldi::nnet3::ComputeMinAndMaxTimes	(	const std::vector< Index > &	indexes,
		int32 *	min_t,
		int32 *	max_t
	)

Definition at line 31 of file nnet-derivative-test.cc.

References KALDI_ASSERT, and rnnlm::n.

Referenced by SetDerivTimesOptions().

                                          {
   KALDI_ASSERT(!indexes.empty());
   *min_t = indexes[0].t;
   *max_t = *min_t;
   for (int32 n = 1; n < static_cast<int32>(indexes.size()); n++) {
     *min_t = std::min(*min_t, indexes[n].t);
     *max_t = std::max(*max_t, indexes[n].t);
   }
 }

◆ ComputeNnetComputationEpochs()

void ComputeNnetComputationEpochs	(	const Nnet &	nnet,
		std::vector< int32 > *	node_to_epoch
	)

This function computes the order in which we need to compute each node in the graph, where each node-index n maps to an epoch-index t = 0, 1, ...

that says when we should compute it. Nodes that are part of a strongly connected component (SCC) will all be computed at the same time, but any two nodes that are not part of an SCC will have different epoch-index, and these epoch-indexes will be such that a node computed at a larger epoch-index may depend on a node computed at a smaller epoch-index, but not vice versa.

Internally it calls NnetToDirectedGraph, FindSccs, MakeSccGraph and ComputeTopSortOrder.

Definition at line 265 of file nnet-graph.cc.

References ComputeTopSortOrder(), FindSccs(), kaldi::GetVerboseLevel(), rnnlm::i, rnnlm::j, KALDI_ASSERT, KALDI_VLOG, MakeSccGraph(), NnetToDirectedGraph(), and PrintGraphToString().

Referenced by kaldi::nnet3::computation_graph::ComputeEpochInfo().

                                                                    {
   KALDI_ASSERT(node_to_epoch != NULL);
 
   std::vector<std::vector<int32> > graph;
   NnetToDirectedGraph(nnet, &graph);
   KALDI_VLOG(6) << "graph is: " << PrintGraphToString(graph);
 
   std::vector<std::vector<int32> > sccs;
   FindSccs(graph, &sccs);
 
   std::vector<std::vector<int32> > scc_graph;
   MakeSccGraph(graph, sccs, &scc_graph);
   KALDI_VLOG(6) << "scc graph is: " << PrintGraphToString(scc_graph);
 
   std::vector<int32> scc_node_to_epoch;
   ComputeTopSortOrder(scc_graph, &scc_node_to_epoch);
   if (GetVerboseLevel() >= 6) {
     std::ostringstream os;
     for (int32 i = 0; i < scc_node_to_epoch.size(); i++)
       os << scc_node_to_epoch[i] << ", ";
     KALDI_VLOG(6) << "scc_node_to_epoch is: " << os.str();
   }
 
   node_to_epoch->clear();
   node_to_epoch->resize(graph.size());
   for (int32 i = 0; i < sccs.size(); ++i) {
     for (int32 j = 0; j < sccs[i].size(); ++j) {
       int32 node = sccs[i][j];
       KALDI_ASSERT(node >= 0 && node < graph.size());
       (*node_to_epoch)[node] = scc_node_to_epoch[i];
     }
   }
 }

◆ ComputeObjectiveFunction()

void ComputeObjectiveFunction	(	const GeneralMatrix &	supervision,
		ObjectiveType	objective_type,
		const std::string &	output_name,
		bool	supply_deriv,
		NnetComputer *	computer,
		BaseFloat *	tot_weight,
		BaseFloat *	tot_objf
	)

This function computes the objective function, and if supply_deriv = true, supplies its derivative to the NnetComputation object.

See also the function ComputeAccuracy(), declared in nnet-diagnostics.h.

Parameters

[in]	supervision	A GeneralMatrix, typically derived from a NnetExample, containing the supervision posteriors or features.
[in]	objective_type	The objective function type: kLinear = output * supervision, or kQuadratic = -0.5 * (output - supervision)^2. kLinear is used for softmax objectives; the network contains a LogSoftmax layer which correctly normalizes its output.
[in]	output_name	The name of the output node (e.g. "output"), used to look up the output in the NnetComputer object.
[in]	supply_deriv	If this is true, this function will compute the derivative of the objective function and supply it to the network using the function NnetComputer::AcceptOutputDeriv
[in,out]	computer	The NnetComputer object, from which we get the output using GetOutput and to which we may supply the derivatives using AcceptOutputDeriv.
[out]	tot_weight	The total weight of the training examples. In the kLinear case, this is the sum of the supervision matrix; in the kQuadratic case, it is the number of rows of the supervision matrix. In order to make it possible to weight samples with quadratic objective functions, we may at some point make it possible for the supervision matrix to have an extra column containing weights. At the moment, this is not supported.
[out]	tot_objf	The total objective function; divide this by the tot_weight to get the normalized objective function.

Definition at line 339 of file nnet-training.cc.

References NnetComputer::AcceptInput(), CuMatrixBase< Real >::CopyFromGeneralMat(), CuSparseMatrix< Real >::CopyToMat(), GeneralMatrix::GetFullMatrix(), GeneralMatrix::GetMatrix(), NnetComputer::GetOutput(), GeneralMatrix::GetSparseMatrix(), KALDI_ERR, kaldi::kCompressedMatrix, kaldi::kFullMatrix, kLinear, kQuadratic, kaldi::kSparseMatrix, kaldi::kTrans, kaldi::kUndefined, CuMatrixBase< Real >::NumCols(), GeneralMatrix::NumCols(), CuMatrixBase< Real >::NumRows(), GeneralMatrix::NumRows(), CuSparseMatrix< Real >::Sum(), CuMatrixBase< Real >::Sum(), CuMatrix< Real >::Swap(), kaldi::TraceMatMat(), kaldi::TraceMatSmat(), and GeneralMatrix::Type().

Referenced by NnetComputeProb::ProcessOutputs(), and NnetTrainer::ProcessOutputs().

                                                    {
   const CuMatrixBase<BaseFloat> &output = computer->GetOutput(output_name);
 
   if (output.NumCols() != supervision.NumCols())
     KALDI_ERR << "Nnet versus example output dimension (num-classes) "
               << "mismatch for '" << output_name << "': " << output.NumCols()
               << " (nnet) vs. " << supervision.NumCols() << " (egs)\n";
 
   switch (objective_type) {
     case kLinear: {
       // objective is x * y.
       switch (supervision.Type()) {
         case kSparseMatrix: {
           const SparseMatrix<BaseFloat> &post = supervision.GetSparseMatrix();
           CuSparseMatrix<BaseFloat> cu_post(post);
           // The cross-entropy objective is computed by a simple dot product,
           // because after the LogSoftmaxLayer, the output is already in the form
           // of log-likelihoods that are normalized to sum to one.
           *tot_weight = cu_post.Sum();
           *tot_objf = TraceMatSmat(output, cu_post, kTrans);
           if (supply_deriv) {
             CuMatrix<BaseFloat> output_deriv(output.NumRows(), output.NumCols(),
                                              kUndefined);
             cu_post.CopyToMat(&output_deriv);
             computer->AcceptInput(output_name, &output_deriv);
           }
           break;
         }
         case kFullMatrix: {
           // there is a redundant matrix copy in here if we're not using a GPU
           // but we don't anticipate this code branch being used in many cases.
           CuMatrix<BaseFloat> cu_post(supervision.GetFullMatrix());
           *tot_weight = cu_post.Sum();
           *tot_objf = TraceMatMat(output, cu_post, kTrans);
           if (supply_deriv)
             computer->AcceptInput(output_name, &cu_post);
           break;
         }
         case kCompressedMatrix: {
           Matrix<BaseFloat> post;
           supervision.GetMatrix(&post);
           CuMatrix<BaseFloat> cu_post;
           cu_post.Swap(&post);
           *tot_weight = cu_post.Sum();
           *tot_objf = TraceMatMat(output, cu_post, kTrans);
           if (supply_deriv)
             computer->AcceptInput(output_name, &cu_post);
           break;
         }
       }
       break;
     }
     case kQuadratic: {
       // objective is -0.5 (x - y)^2
       CuMatrix<BaseFloat> diff(supervision.NumRows(),
                                supervision.NumCols(),
                                kUndefined);
       diff.CopyFromGeneralMat(supervision);
       diff.AddMat(-1.0, output);
       *tot_weight = diff.NumRows();
       *tot_objf = -0.5 * TraceMatMat(diff, diff, kTrans);
       if (supply_deriv)
         computer->AcceptInput(output_name, &diff);
       break;
     }
     default:
       KALDI_ERR << "Objective function type " << objective_type
                 << " not handled.";
   }
 }

◆ ComputeObjf()

double kaldi::nnet3::ComputeObjf	(	bool	batchnorm_test_mode,
		bool	dropout_test_mode,
		const std::vector< NnetExample > &	egs,
		const Nnet &	nnet,
		NnetComputeProb *	prob_computer
	)

Definition at line 35 of file nnet3-combine.cc.

References NnetComputeProb::Compute(), NnetComputeProb::GetTotalObjective(), KALDI_ASSERT, NnetComputeProb::Reset(), SetBatchnormTestMode(), and SetDropoutTestMode().

Referenced by main().

                                                    {
   if (batchnorm_test_mode || dropout_test_mode) {
     Nnet nnet_copy(nnet);
     if (batchnorm_test_mode)
       SetBatchnormTestMode(true, &nnet_copy);
     if (dropout_test_mode)
       SetDropoutTestMode(true, &nnet_copy);
     NnetComputeProbOptions compute_prob_opts;
     NnetComputeProb prob_computer_test(compute_prob_opts, nnet_copy);
     return ComputeObjf(false, false, egs, nnet_copy, &prob_computer_test);
   } else {
     prob_computer->Reset();
     std::vector<NnetExample>::const_iterator iter = egs.begin(),
                                               end = egs.end();
     for (; iter != end; ++iter)
       prob_computer->Compute(*iter);
     double tot_weights,
         tot_objf = prob_computer->GetTotalObjective(&tot_weights);
     KALDI_ASSERT(tot_weights > 0.0);
     // inf/nan tot_objf->return -inf objective.
     if (!(tot_objf == tot_objf && tot_objf - tot_objf == 0))
       return -std::numeric_limits<double>::infinity();
     // we prefer to deal with normalized objective functions.
     return tot_objf / tot_weights;
   }
 }

◆ ComputeSimpleNnetContext()

void ComputeSimpleNnetContext	(	const Nnet &	nnet,
		int32 *	left_context,
		int32 *	right_context
	)

ComputeSimpleNnetContext computes the left-context and right-context of a nnet.

The nnet must satisfy IsSimpleNnet(nnet).

It does this by constructing a ComputationRequest with a certain number of inputs available, outputs can be computed.. It does the same after shifting the time index of the output to all values 0, 1, ... n-1, where n is the output of nnet.Modulus(). Then it returns the largest left context and the largest right context that it infers from any of these computation requests.

Definition at line 146 of file nnet-utils.cc.

References ComputeSimpleNnetContextForShift(), IsSimpleNnet(), KALDI_ASSERT, KALDI_ERR, and Nnet::Modulus().

Referenced by ComputeExampleComputationRequestSimple(), CreateLoopedComputationRequestSimple(), CachingOptimizingCompiler::GetSimpleNnetContext(), Nnet::Info(), DecodableNnetSimpleLoopedInfo::Init(), main(), NnetBatchComputer::NnetBatchComputer(), NnetInfo(), AmNnetSimple::SetContext(), and UnitTestNnetContext().

                                                     {
   KALDI_ASSERT(IsSimpleNnet(nnet));
   int32 modulus = nnet.Modulus();
   // modulus >= 1 is a number such that the network ought to be
   // invariant to time shifts (of both the input and output) that
   // are a multiple of this number.  We need to test all shifts modulo
   // this number in case the left and right context vary at all within
   // this range.
 
   std::vector<int32> left_contexts(modulus + 1);
   std::vector<int32> right_contexts(modulus + 1);
 
   // window_size is a number which needs to be greater than the total context
   // of the nnet, else we won't be able to work out the context.  Large window
   // size will make this code slow, so we start off with small window size, and
   // if it isn't enough, we keep doubling it up to a maximum.
   int32 window_size = 40, max_window_size = 800;
 
   while (window_size < max_window_size) {
 
     // by going "<= modulus" instead of "< modulus" we do one more computation
     // than we really need; it becomes a sanity check.
     int32 input_start;
     for (input_start = 0; input_start <= modulus; input_start++) {
       if (!ComputeSimpleNnetContextForShift(nnet, input_start, window_size,
                                             &(left_contexts[input_start]),
                                             &(right_contexts[input_start])))
         break;
     }
     if (input_start <= modulus) {
       // We broke from the loop over 'input_start', which means there was
       // a failure in ComputeSimpleNnextContextForShift-- we assume at
       // this point that it was because window_size was too small.
       window_size *= 2;
       continue;
     }
 
     KALDI_ASSERT(left_contexts[0] == left_contexts[modulus] &&
                  "nnet does not have the properties we expect.");
     KALDI_ASSERT(right_contexts[0] == right_contexts[modulus] &&
                  "nnet does not have the properties we expect.");
     *left_context =
         *std::max_element(left_contexts.begin(), left_contexts.end());
     *right_context =
         *std::max_element(right_contexts.begin(), right_contexts.end());
     // Success.
     return;
   }
   KALDI_ERR << "Failure in ComputeSimpleNnetContext (perhaps not a simple nnet?)";
 }

◆ ComputeSimpleNnetContextForShift()

static bool kaldi::nnet3::ComputeSimpleNnetContextForShift	(	const Nnet &	nnet,
		int32	input_start,
		int32	window_size,
		int32 *	left_context,
		int32 *	right_context
	)

static

Definition at line 92 of file nnet-utils.cc.

References EvaluateComputationRequest(), Nnet::GetNodeIndex(), IoSpecification::indexes, ComputationRequest::inputs, KALDI_ASSERT, Nnet::Modulus(), rnnlm::n, IoSpecification::name, and ComputationRequest::outputs.

Referenced by ComputeSimpleNnetContext().

                           {
 
   int32 input_end = input_start + window_size;
   IoSpecification input;
   input.name = "input";
   IoSpecification output;
   output.name = "output";
   IoSpecification ivector;  // we might or might not use this.
   ivector.name = "ivector";
 
   int32 n = rand() % 10;
   // in the IoSpecification for now we we will request all the same indexes at
   // output that we requested at input.
   for (int32 t = input_start; t < input_end; t++) {
     input.indexes.push_back(Index(n, t));
     output.indexes.push_back(Index(n, t));
   }
 
   // most networks will just require the ivector at time t = 0,
   // but this might not always be the case, and some might use rounding
   // descriptors with the iVector which might require it at an earlier
   // frame than the regular input, so we provide the iVector in as wide a range
   // as it might possibly be needed.
   for (int32 t = input_start - nnet.Modulus(); t < input_end; t++) {
     ivector.indexes.push_back(Index(n, t));
   }
 
   ComputationRequest request;
   request.inputs.push_back(input);
   request.outputs.push_back(output);
   if (nnet.GetNodeIndex("ivector") != -1)
     request.inputs.push_back(ivector);
   std::vector<std::vector<bool> > computable;
   EvaluateComputationRequest(nnet, request, &computable);
 
   KALDI_ASSERT(computable.size() == 1);
   std::vector<bool> &output_ok = computable[0];
   std::vector<bool>::iterator iter =
       std::find(output_ok.begin(), output_ok.end(), true);
   int32 first_ok = iter - output_ok.begin();
   int32 first_not_ok = std::find(iter, output_ok.end(), false) -
       output_ok.begin();
   if (first_ok == window_size || first_not_ok <= first_ok)
     return false;
   *left_context = first_ok;
   *right_context = window_size - first_not_ok;
   return true;
 }

◆ ComputeTopSortOrder()

void ComputeTopSortOrder	(	const std::vector< std::vector< int32 > > &	graph,
		std::vector< int32 > *	node_to_order
	)

Given an acyclic graph (where each std::vector<int32> is a list of destination-nodes of arcs coming from the current node), compute a topological ordering of the graph nodes.

The output format is that node_to_order[n] contains an integer t = 0, 1, ... which is the order of node n in a topological sorting. node_to_order should contain some permutation of the numbers 0 ... graph.size() - 1. This function should crash if the graph contains cycles.

Definition at line 223 of file nnet-graph.cc.

References ComputeTopSortOrderRecursive(), rnnlm::i, and KALDI_ASSERT.

Referenced by ComputeNnetComputationEpochs(), UnitTestComputeTopSortOrder(), and UnitTestComputeTopSortOrder2().

                                                           {
   // Internally we use DFS, but we only put the node to <node_to_order> when all
   // its parents have been visited.
   KALDI_ASSERT(node_to_order != NULL);
   node_to_order->resize(graph.size());
 
   std::vector<bool> cycle_detector(graph.size(), false);
   std::vector<bool> is_visited(graph.size(), false);
 
   std::vector<int32> reversed_orders;
   for(int32 i = 0; i < graph.size(); ++i) {
     if (!is_visited[i]) {
       ComputeTopSortOrderRecursive(i, graph, &cycle_detector,
                                    &is_visited, &reversed_orders);
     }
   }
 
   KALDI_ASSERT(node_to_order->size() == reversed_orders.size());
   for (int32 i = 0; i < reversed_orders.size(); ++i) {
     KALDI_ASSERT(reversed_orders[i] >= 0 && reversed_orders[i] < graph.size());
     (*node_to_order)[reversed_orders[i]] = graph.size() - i - 1;
   }
 }

◆ ComputeTopSortOrderRecursive()

void kaldi::nnet3::ComputeTopSortOrderRecursive	(	int32	node,
		const std::vector< std::vector< int32 > > &	graph,
		std::vector< bool > *	cycle_detector,
		std::vector< bool > *	is_visited,
		std::vector< int32 > *	reversed_orders
	)

Definition at line 196 of file nnet-graph.cc.

References rnnlm::i, KALDI_ASSERT, and KALDI_ERR.

Referenced by ComputeTopSortOrder().

                                                                      {
   KALDI_ASSERT(node >= 0 && node < graph.size());
   KALDI_ASSERT(cycle_detector != NULL);
   KALDI_ASSERT(is_visited != NULL);
   KALDI_ASSERT(reversed_orders != NULL);
   if ((*cycle_detector)[node]) {
     KALDI_ERR << "Cycle detected when computing the topological sorting order";
   }
 
   if (!(*is_visited)[node]) {
     (*cycle_detector)[node] = true;
     for (int32 i = 0; i < graph[node].size(); ++i) {
       ComputeTopSortOrderRecursive(graph[node][i], graph,
                                    cycle_detector, is_visited, reversed_orders);
     }
     (*cycle_detector)[node] = false;
     (*is_visited)[node] = true;
     // At this point we have added all the children to <reversed_orders>, so we
     // can add the current now.
     reversed_orders->push_back(node);
   }
 }

◆ ComputeVariableAccesses()

void ComputeVariableAccesses	(	const ComputationVariables &	variables,
		const std::vector< CommandAttributes > &	command_attributes,
		std::vector< std::vector< Access > > *	variable_accesses
	)

After the command-level attributes have been computed, this function organizes them per variable (see class ComputationVariables for how a variable is defined; it is part of a matrix).

Parameters

[in]	variables	The definition of variables for this computation
[in]	command_attributes	A vector of attributes, one per command, as obtained from ComputeCommandAttributes().
[out]	variable_accesses	The output will have a size equal to the number of variables, and each element will be a vector of accesses, sorted by command index; each command will only be listed once in this vector.

Definition at line 421 of file nnet-analyze.cc.

References kaldi::IsSortedAndUniq(), KALDI_ASSERT, kReadAccess, kReadWriteAccess, kWriteAccess, ComputationVariables::NumVariables(), kaldi::SortAndUniq(), CommandAttributes::variables_read, and CommandAttributes::variables_written.

Referenced by Analyzer::Init(), MoveSizingCommands(), and Access::operator<().

                                                       {
   int32 num_variables = variables.NumVariables(),
       num_commands = command_attributes.size();
   variable_accesses->clear();
   variable_accesses->resize(num_variables);
   for (int32 c = 0; c < num_commands; c++) {
     const CommandAttributes &attr = command_attributes[c];
     KALDI_ASSERT(IsSortedAndUniq(attr.variables_read));
     KALDI_ASSERT(IsSortedAndUniq(attr.variables_written));
     std::vector<int32> all_variables;
     all_variables.reserve(attr.variables_read.size() +
                           attr.variables_written.size());
     all_variables.insert(all_variables.end(), attr.variables_read.begin(),
                          attr.variables_read.end());
     all_variables.insert(all_variables.end(), attr.variables_written.begin(),
                          attr.variables_written.end());
     SortAndUniq(&all_variables);
 
     std::vector<int32>::const_iterator iter = all_variables.begin(),
         end = all_variables.end();
     for (; iter != end; ++iter) {
       int32 variable_index = *iter;
       bool is_read = std::binary_search(attr.variables_read.begin(),
                                         attr.variables_read.end(),
                                         variable_index),
           is_written = (!is_read ? true :
                         std::binary_search(attr.variables_written.begin(),
                                            attr.variables_written.end(),
                                            variable_index));
       if (is_read && is_written) {
         (*variable_accesses)[variable_index].push_back(
             Access(c, kReadWriteAccess));
       } else if (is_read) {
         (*variable_accesses)[variable_index].push_back(
             Access(c, kReadAccess));
       } else {
         (*variable_accesses)[variable_index].push_back(
             Access(c, kWriteAccess));
       }
     }
   }
 }

◆ ConsolidateIoOperations()

void ConsolidateIoOperations	(	const Nnet &	nnet,
		NnetComputation *	computation
	)

This optimization puts the input operations (kAcceptInput) and output operations (kProvideOutput) at the very beginning or end of segments of computation, respectively.

This is actually necessary for computations to be run easily, because if these commands were interspersed with the regular commands, you'd have to call computer.Run() between the individual AcceptInput() and GetOutput() function calls.

Definition at line 869 of file nnet-optimize.cc.

References NnetComputation::commands, kAcceptInput, KALDI_ASSERT, kNoOperationMarker, kProvideOutput, and SplitComputationIntoSegments().

Referenced by Compiler::CreateComputation(), and Optimize().

                                                            {
   // These segments, represented as (start-index, end-index),
   // are segments of the computation separated by kNoOperationMarker.
   std::vector<std::pair<int32, int32> > segments;
   SplitComputationIntoSegments(*computation, &segments);
 
   int32 num_commands = computation->commands.size();
   std::vector<NnetComputation::Command> reordered_commands(num_commands);
   // put kNoOperationMarker between all segments in the reordered commands.
   for (size_t s = 0; s + 1 < segments.size(); s++)
     reordered_commands[segments[s].second].command_type = kNoOperationMarker;
 
   // for each segment we'll divide the commands up into those that must appear
   // at the left of the segment (kAcceptInput for inputs and output-derivs), those
   // that must appear in the middle (most commands), those that must appear
   // on the right (kProvideOutput for output nodes and input derivatives).
   std::vector<int32> left_commands, middle_commands, right_commands;
 
   for (size_t s = 0; s < segments.size(); s++) {
     int32 segment_start = segments[s].first,
         segment_end = segments[s].second;
     left_commands.clear();
     middle_commands.clear();
     right_commands.clear();
     for (int32 c = segment_start; c < segment_end; c++) {
       if (computation->commands[c].command_type == kProvideOutput) {
         right_commands.push_back(c);
       } else if (computation->commands[c].command_type == kAcceptInput) {
         left_commands.push_back(c);
       } else {
         middle_commands.push_back(c);
       }
     }
     std::vector<int32>::const_iterator iter = left_commands.begin(),
         end = left_commands.end();
     int32 c = segment_start;
     for (; iter != end; ++iter, ++c)
       reordered_commands[c] = computation->commands[*iter];
     iter = middle_commands.begin();
     end = middle_commands.end();
     for (; iter != end; ++iter, ++c)
       reordered_commands[c] = computation->commands[*iter];
     iter = right_commands.begin();
     end = right_commands.end();
     for (; iter != end; ++iter, ++c)
       reordered_commands[c] = computation->commands[*iter];
     KALDI_ASSERT(c == segment_end);
   }
   computation->commands.swap(reordered_commands);
 }

◆ ConsolidateMemory()

void ConsolidateMemory ( Nnet * nnet )

This just calls ConsolidateMemory() on all the components of the nnet.

This is called by the training code after processing the first minibatch. On some components this will do nothing; on some components it will reallocate certain quantities that have been allocated during training (mostly the contents of NaturalGradientOnline objects, and stats for NonlinearComponents) so that they can be put into low memory. This will tend to minimize memory fragmentation. Read comments in ../cudamatrix/cu-allocator.h for more explanation.

Definition at line 1147 of file nnet-utils.cc.

References Component::ConsolidateMemory(), Nnet::GetComponent(), kaldi::GetVerboseLevel(), KALDI_VLOG, and Nnet::NumComponents().

Referenced by CollapseModelConfig::CollapseModelConfig(), NonlinearComponent::OutputDim(), NnetChainTrainer::Train(), and NnetTrainer::Train().

                                    {
 #if HAVE_CUDA == 1
   if (CuDevice::Instantiate().Enabled()) {
     bool print_memory_info = (GetVerboseLevel() >= 1);
     if (print_memory_info) {
       KALDI_VLOG(1) << "Consolidating memory; will print memory usage before "
           "and after consolidating:";
       g_cuda_allocator.PrintMemoryUsage();
     }
     for (int32 c = 0; c < nnet->NumComponents(); c++) {
       Component *comp = nnet->GetComponent(c);
       comp->ConsolidateMemory();
     }
     if (print_memory_info) {
       g_cuda_allocator.PrintMemoryUsage();
     }
   }
 #endif
 }

◆ ConsolidateModelUpdate()

void ConsolidateModelUpdate	(	const Nnet &	nnet,
		NnetComputation *	computation
	)

This optimization consolidates the model-update part of backprop commands, for components in (e.g.) recurrent networks that need to have many separate backprop commands, into more efficient single commands operating on consolidated data in larger matrices.

This consolidates the model-update parts of the backprop into larger operations (applicable mostly to recurrent setups)– internally it uses class ModelUpdateConsolidator.

This is useful for recurrent networks. The resulting computation separates the backprop for data-derivatives from the model-update part of backprop.

Will fail if called a second time.

Definition at line 1551 of file nnet-optimize-utils.cc.

References ModelUpdateConsolidator::ConsolidateModelUpdate(), and NnetComputation::need_model_derivative.

Referenced by Optimize().

                                                           {
   // This following if-statement is an optimization: if the computation
   // request(s) had need_model_derivative == false, there would be nothing to
   // optimize, so don't bother trying.
   if (!computation->need_model_derivative)
     return;
   ModelUpdateConsolidator consolidator(nnet, computation);
   consolidator.ConsolidateModelUpdate();
 }

◆ ConstrainOrthonormal()

void ConstrainOrthonormal ( Nnet * nnet )

This function, to be called after processing every minibatch, is responsible for enforcing the orthogonality constraint for any components of type LinearComponent or inheriting from AffineComponent that have the "orthonormal_constraint" value set.

This function, to be called after processing every minibatch, is responsible for enforcing the orthogonality constraint for any components of type LinearComponent or inheriting from AffineComponent that have the "orthonormal-constraint" value set to a nonzero value.

Technically what we are doing is constraining the parameter matrix M to be a "semi-orthogonal" matrix times a constant alpha. That is: if num-rows > num-cols, this amounts to asserting that M M^T == alpha^2 I; otherwise, that M^T M == alpha^2 I.

If, for a particular component, orthonormal-constraint > 0.0, then that value becomes the "alpha" mentioned above. If orthonormal-constraint == 0.0, then nothing is done. If orthonormal-constraint < 0.0, then it's like letting alpha "float", i.e. we try to make M closer to (any constant alpha) times a semi-orthogonal matrix.

In order to make it efficient on GPU, it doesn't make it completely orthonormal, it just makes it closer to being orthonormal (times the 'orthonormal_constraint' value). Over multiple iterations this rapidly makes it almost exactly orthonormal.

See http://www.danielpovey.com/files/2018_interspeech_tdnnf.pdf

Definition at line 1108 of file nnet-utils.cc.

References ConstrainOrthonormalInternal(), CuMatrixBase< Real >::CopyFromMat(), Nnet::GetComponent(), kaldi::kTrans, AffineComponent::LinearParams(), TdnnComponent::LinearParams(), CuMatrixBase< Real >::NumCols(), Nnet::NumComponents(), CuMatrixBase< Real >::NumRows(), AffineComponent::OrthonormalConstraint(), TdnnComponent::OrthonormalConstraint(), LinearComponent::OrthonormalConstraint(), LinearComponent::Params(), and kaldi::RandInt().

Referenced by CollapseModelConfig::CollapseModelConfig(), NnetChainTrainer::TrainInternal(), NnetTrainer::TrainInternal(), NnetChainTrainer::TrainInternalBackstitch(), and NnetTrainer::TrainInternalBackstitch().

                                       {
 
   for (int32 c = 0; c < nnet->NumComponents(); c++) {
     Component *component = nnet->GetComponent(c);
     CuMatrixBase<BaseFloat> *params = NULL;
     BaseFloat orthonormal_constraint = 0.0;
 
     LinearComponent *lc = dynamic_cast<LinearComponent*>(component);
     if (lc != NULL && lc->OrthonormalConstraint() != 0.0) {
       orthonormal_constraint = lc->OrthonormalConstraint();
       params = &(lc->Params());
     }
     AffineComponent *ac = dynamic_cast<AffineComponent*>(component);
     if (ac != NULL && ac->OrthonormalConstraint() != 0.0) {
       orthonormal_constraint = ac->OrthonormalConstraint();
       params = &(ac->LinearParams());
     }
     TdnnComponent *tc = dynamic_cast<TdnnComponent*>(component);
     if (tc != NULL && tc->OrthonormalConstraint() != 0.0) {
       orthonormal_constraint = tc->OrthonormalConstraint();
       params = &(tc->LinearParams());
     }
     if (orthonormal_constraint == 0.0 || RandInt(0, 3) != 0) {
       // For efficiency, only do this every 4 or so minibatches-- it won't have
       // time stray far from the constraint in between.
       continue;
     }
 
     int32 rows = params->NumRows(), cols = params->NumCols();
     if (rows <= cols) {
       ConstrainOrthonormalInternal(orthonormal_constraint, params);
     } else {
       CuMatrix<BaseFloat> params_trans(*params, kTrans);
       ConstrainOrthonormalInternal(orthonormal_constraint, &params_trans);
       params->CopyFromMat(params_trans, kTrans);
     }
   }
 }

◆ ConstrainOrthonormalInternal()

void kaldi::nnet3::ConstrainOrthonormalInternal	(	BaseFloat	scale,
		CuMatrixBase< BaseFloat > *	M
	)

Definition at line 982 of file nnet-utils.cc.

References CuMatrixBase< Real >::AddMat(), CuMatrixBase< Real >::AddMatMat(), CuMatrixBase< Real >::AddToDiag(), CuMatrixBase< Real >::CopyLowerToUpper(), CuMatrixBase< Real >::FrobeniusNorm(), kaldi::GetVerboseLevel(), KALDI_ASSERT, KALDI_VLOG, kaldi::kNoTrans, kaldi::kTrans, CuMatrixBase< Real >::NumCols(), CuMatrixBase< Real >::NumRows(), CuMatrixBase< Real >::SymAddMat2(), CuMatrixBase< Real >::Trace(), and kaldi::TraceMatMat().

Referenced by ConstrainOrthonormal().

                                                                                {
   KALDI_ASSERT(scale != 0.0);
 
   // We'd like to enforce the rows of M to be orthonormal.
   // define P = M M^T.  If P is unit then M has orthonormal rows.
   // We actually want P to equal scale^2 * I, so that M's rows are
   // orthogonal with 2-norms equal to 'scale'.
   // We (notionally) add to the objective function, the value
   // -alpha times the sum of squared elements of Q = (P - scale^2 * I).
   int32 rows = M->NumRows(), cols = M->NumCols();
   CuMatrix<BaseFloat> M_update(rows, cols);
   CuMatrix<BaseFloat> P(rows, rows);
   P.SymAddMat2(1.0, *M, kNoTrans, 0.0);
   P.CopyLowerToUpper();
 
   // The 'update_speed' is a constant that determines how fast we approach a
   // matrix with the desired properties (larger -> faster).  Larger values will
   // update faster but will be more prone to instability.  0.125 (1/8) is the
   // value that gives us the fastest possible convergence when we are already
   // close to be a semi-orthogonal matrix (in fact, it will lead to quadratic
   // convergence).
   // See  http://www.danielpovey.com/files/2018_interspeech_tdnnf.pdf
   // for more details.
   BaseFloat update_speed = 0.125;
   bool floating_scale = (scale < 0.0);
 
 
   if (floating_scale) {
     // This (letting the scale "float") is described in Sec. 2.3 of
     // http://www.danielpovey.com/files/2018_interspeech_tdnnf.pdf,
     // where 'scale' here is written 'alpha' in the paper.
     //
     // We pick the scale that will give us an update to M that is
     // orthogonal to M (viewed as a vector): i.e., if we're doing
     // an update M := M + X, then we want to have tr(M X^T) == 0.
     // The following formula is what gives us that.
     // With P = M M^T, our update formula is doing to be:
     //  M := M + (-4 * alpha * (P - scale^2 I) * M).
     // (The math below explains this update formula; for now, it's
     // best to view it as an established fact).
     // So X (the change in M) is -4 * alpha * (P - scale^2 I) * M,
     // where alpha == update_speed / scale^2.
     // We want tr(M X^T) == 0.  First, forget the -4*alpha, because
     // we don't care about constant factors.  So we want:
     //  tr(M * M^T * (P - scale^2 I)) == 0.
     // Since M M^T == P, that means:
     //  tr(P^2 - scale^2 P) == 0,
     // or scale^2 = tr(P^2) / tr(P).
     // Note: P is symmetric so it doesn't matter whether we use tr(P P) or
     // tr(P^T P); we use tr(P^T P) because I believe it's faster to compute.
 
     BaseFloat trace_P = P.Trace(), trace_P_P = TraceMatMat(P, P, kTrans);
 
     scale = std::sqrt(trace_P_P / trace_P);
 
     // The following is a tweak to avoid divergence when the eigenvalues aren't
     // close to being the same.  trace_P is the sum of eigenvalues of P, and
     // trace_P_P is the sum-square of eigenvalues of P.  Treat trace_P as a sum
     // of positive values, and trace_P_P as their sumsq.  Then mean = trace_P /
     // dim, and trace_P_P cannot be less than dim * (trace_P / dim)^2,
     // i.e. trace_P_P >= trace_P^2 / dim.  If ratio = trace_P_P * dim /
     // trace_P^2, then ratio >= 1.0, and the excess above 1.0 is a measure of
     // how far we are from convergence.  If we're far from convergence, we make
     // the learning rate slower to reduce the risk of divergence, since the
     // update may not be stable for starting points far from equilibrium.
     BaseFloat ratio = (trace_P_P * P.NumRows() / (trace_P * trace_P));
     KALDI_ASSERT(ratio > 0.99);
     if (ratio > 1.02) {
       update_speed *= 0.5;  // Slow down the update speed to reduce the risk of divergence.
       if (ratio > 1.1) update_speed *= 0.5;  // Slow it down even more.
     }
   }
 
   P.AddToDiag(-1.0 * scale * scale);
 
   // We may want to un-comment the following code block later on if we have a
   // problem with instability in setups with a non-floating orthonormal
   // constraint.
   /*
   if (!floating_scale) {
     // This is analogous to the stuff with 'ratio' above, but when we don't have
     // a floating scale.  It reduces the chances of divergence when we have
     // a bad initialization.
     BaseFloat error = P.FrobeniusNorm(),
         error_proportion = error * error / P.NumRows();
     // 'error_proportion' is the sumsq of elements in (P - I) divided by the
     // sumsq of elements of I.  It should be much less than one (i.e. close to
     // zero) if the error is small.
     if (error_proportion > 0.02) {
       update_speed *= 0.5;
       if (error_proportion > 0.1)
         update_speed *= 0.5;
     }
   }
   */
 
   if (GetVerboseLevel() >= 1) {
     BaseFloat error = P.FrobeniusNorm();
     KALDI_VLOG(2) << "Error in orthogonality is " << error;
   }
 
   // see Sec. 2.2 of http://www.danielpovey.com/files/2018_interspeech_tdnnf.pdf
   // for explanation of the 1/(scale*scale) factor, but there is a difference in
   // notation; 'scale' here corresponds to 'alpha' in the paper, and
   // 'update_speed' corresponds to 'nu' in the paper.
   BaseFloat alpha = update_speed / (scale * scale);
 
   // At this point, the matrix P contains what, in the math, would be Q =
   // P-scale^2*I.  The derivative of the objective function w.r.t. an element q(i,j)
   // of Q is now equal to -2*alpha*q(i,j), i.e. we could write q_deriv(i,j)
   // = -2*alpha*q(i,j) This is also the derivative of the objective function
   // w.r.t. p(i,j): i.e. p_deriv(i,j) = -2*alpha*q(i,j).
   // Suppose we have define this matrix as 'P_deriv'.
   // The derivative of the objective w.r.t M equals
   // 2 * P_deriv * M, which equals -4*alpha*(P-scale^2*I)*M.
   // (Currently the matrix P contains what, in the math, is P-scale^2*I).
   M_update.AddMatMat(-4.0 * alpha, P, kNoTrans, *M, kNoTrans, 0.0);
   M->AddMat(1.0, M_update);
 }

◆ ContainsSingleExample()

bool kaldi::nnet3::ContainsSingleExample	(	const NnetExample &	eg,
		int32 *	min_input_t,
		int32 *	max_input_t,
		int32 *	min_output_t,
		int32 *	max_output_t
	)

Returns true if the "eg" contains just a single example, meaning that all the "n" values in the indexes are zero, and the example has NnetIo members named both "input" and "output".

Also computes the minimum and maximum "t" values in the "input" and "output" NnetIo members.

Definition at line 82 of file nnet3-copy-egs.cc.

References rnnlm::i, NnetIo::indexes, NnetExample::io, KALDI_ASSERT, KALDI_WARN, and NnetIo::name.

Referenced by SelectFromExample().

                                                 {
   bool done_input = false, done_output = false;
   int32 num_indexes = eg.io.size();
   for (int32 i = 0; i < num_indexes; i++) {
     const NnetIo &io = eg.io[i];
     std::vector<Index>::const_iterator iter = io.indexes.begin(),
                                         end = io.indexes.end();
     // Should not have an empty input/output type.
     KALDI_ASSERT(!io.indexes.empty());
     if (io.name == "input" || io.name == "output") {
       int32 min_t = iter->t, max_t = iter->t;
       for (; iter != end; ++iter) {
         int32 this_t = iter->t;
         min_t = std::min(min_t, this_t);
         max_t = std::max(max_t, this_t);
         if (iter->n != 0) {
           KALDI_WARN << "Example does not contain just a single example; "
                      << "too late to do frame selection or reduce context.";
           return false;
         }
       }
       if (io.name == "input") {
         done_input = true;
         *min_input_t = min_t;
         *max_input_t = max_t;
       } else {
         KALDI_ASSERT(io.name == "output");
         done_output = true;
         *min_output_t = min_t;
         *max_output_t = max_t;
       }
     } else {
       for (; iter != end; ++iter) {
         if (iter->n != 0) {
           KALDI_WARN << "Example does not contain just a single example; "
                      << "too late to do frame selection or reduce context.";
           return false;
         }
       }
     }
   }
   if (!done_input) {
     KALDI_WARN << "Example does not have any input named 'input'";
     return false;
   }
   if (!done_output) {
     KALDI_WARN << "Example does not have any output named 'output'";
     return false;
   }
   return true;
 }

◆ ConvertAdditionToAssignment()

void ConvertAdditionToAssignment	(	const Nnet &	nnet,
		NnetComputation *	computation
	)

This converts addition operations (things with Add in their names) to copy operations (things with Copy in their names).

This is slightly more efficient, and it may later allow us to remove unnecessary zeroing.

Definition at line 430 of file nnet-optimize.cc.

References NnetComputation::Command::alpha, Analyzer::command_attributes, NnetComputation::Command::command_type, NnetComputation::commands, ComputationAnalysis::FirstNontrivialAccess(), Analyzer::Init(), kAddRows, kAddRowsMulti, kAddToRowsMulti, KALDI_ASSERT, KALDI_ERR, kCopyRows, kCopyRowsMulti, kCopyToRowsMulti, kMatrixAdd, and kMatrixCopy.

Referenced by Optimize().

                                                                {
   Analyzer analyzer;
   analyzer.Init(nnet, *computation);
   ComputationAnalysis analysis(*computation, analyzer);
   int32 num_commands = computation->commands.size();
   for (int32 command = 0; command < num_commands; command++) {
     NnetComputation::Command &c = computation->commands[command];
     switch (c.command_type) {
       case kMatrixAdd: case kAddRows: case kAddRowsMulti:
       case kAddToRowsMulti: {
         const std::vector<int32> &submatrices_written =
             analyzer.command_attributes[command].submatrices_written;
         KALDI_ASSERT(!submatrices_written.empty());
         std::vector<int32>::const_iterator iter = submatrices_written.begin(),
             end = submatrices_written.end();
         bool can_convert = true;
         for (; iter != end; ++iter) {
           int32 submatrix_written = *iter;
           int32 first_access_command = analysis.FirstNontrivialAccess(
               submatrix_written);
           // first_access_command is first command other than zeroing and
           // allocation that accesses this submatrix.  It can be assumed to be a
           // write command, since it makes no sense to read a variable before
           // it's written to.  If it's before this command then we need to add
           // rather than copy; we can't do the conversion to a copy command.
           if (first_access_command != command) {
             can_convert = false;
             break;
           }
         }
         if (can_convert) {  // convert to a copy command.
           switch (c.command_type) {
             case kMatrixAdd: c.command_type = kMatrixCopy;
               break;
             case kAddRows: c.command_type = kCopyRows;
                break;
             case kAddRowsMulti: c.command_type = kCopyRowsMulti;
               break;
             // note: kCopyToRowsMulti does not currently support alpha != 1.0.
             case kAddToRowsMulti: if (c.alpha == 1.0) c.command_type = kCopyToRowsMulti;
               break;
             default: KALDI_ERR << "Unexpected command type.";
           }
         }
         break;
       }
       default:
         break;
     }
   }
 }

◆ ConvertNumNValues()

static void kaldi::nnet3::ConvertNumNValues	(	int32	n_stride,
		int32	old_N,
		int32	new_N,
		const std::vector< Index > &	indexes_in,
		std::vector< Index > *	indexes_out
	)

static

Definition at line 3106 of file nnet-optimize-utils.cc.

References KALDI_ASSERT, Index::n, and rnnlm::n.

Referenced by ComputationExpander::ExpandIndexes(), and IoSpecificationIsDecomposable().

                                                              {
   int32 size_in = indexes_in.size();
   KALDI_ASSERT(size_in > 0 && indexes_in[size_in - 1].n == old_N - 1);
   int32 block_size_in = n_stride * old_N,
       block_size_out = n_stride * new_N;
 
   indexes_out->resize((size_in / old_N) * new_N);
   for (int32 i_in = 0; i_in < size_in; i_in++) {
     if (indexes_in[i_in].n != 0)
       continue;
     Index index(indexes_in[i_in]);
     int32 block_index = i_in / block_size_in,
         offset_within_block = i_in % block_size_in;
 
 
     int32 i_out = block_index * block_size_out +
         offset_within_block;
     for (int32 n = 0; n < new_N; n++, i_out += n_stride) {
       index.n = n;
       (*indexes_out)[i_out] = index;
     }
   }
 }

◆ ConvertRepeatedToBlockAffine() [1/2]

void kaldi::nnet3::ConvertRepeatedToBlockAffine ( CompositeComponent * c_component )

Definition at line 447 of file nnet-utils.cc.

References CompositeComponent::GetComponent(), rnnlm::i, KALDI_ASSERT, CompositeComponent::NumComponents(), CompositeComponent::SetComponent(), and Component::Type().

Referenced by ConvertRepeatedToBlockAffine(), main(), UnitTestConvertRepeatedToBlockAffine(), and UnitTestConvertRepeatedToBlockAffineComposite().

                                                                    {
   for(int32 i = 0; i < c_component->NumComponents(); i++) {
     const Component *c = c_component->GetComponent(i);
     KALDI_ASSERT(c->Type() != "CompositeComponent" &&
                  "Nesting CompositeComponent within CompositeComponent is not allowed.\n"
                  "(We may change this as more complicated components are introduced.)");
 
     if(c->Type() == "RepeatedAffineComponent" ||
        c->Type() == "NaturalGradientRepeatedAffineComponent") {
       // N.B.: NaturalGradientRepeatedAffineComponent is a subclass of
       // RepeatedAffineComponent.
       const RepeatedAffineComponent *rac =
         dynamic_cast<const RepeatedAffineComponent*>(c);
       KALDI_ASSERT(rac != NULL);
       BlockAffineComponent *bac = new BlockAffineComponent(*rac);
       // following call deletes rac
       c_component->SetComponent(i, bac);
     }
   }
 }

◆ ConvertRepeatedToBlockAffine() [2/2]

void ConvertRepeatedToBlockAffine ( Nnet * nnet )

Convert all components of type RepeatedAffineComponent or NaturalGradientRepeatedAffineComponent to BlockAffineComponent in nnet.

Definition at line 468 of file nnet-utils.cc.

References ConvertRepeatedToBlockAffine(), Nnet::GetComponent(), rnnlm::i, KALDI_ASSERT, Nnet::NumComponents(), Nnet::SetComponent(), and Component::Type().

                                               {
   for(int32 i = 0; i < nnet->NumComponents(); i++) {
     const Component *const_c = nnet->GetComponent(i);
     if(const_c->Type() == "RepeatedAffineComponent" ||
        const_c->Type() == "NaturalGradientRepeatedAffineComponent") {
       // N.B.: NaturalGradientRepeatedAffineComponent is a subclass of
       // RepeatedAffineComponent.
       const RepeatedAffineComponent *rac =
         dynamic_cast<const RepeatedAffineComponent*>(const_c);
       KALDI_ASSERT(rac != NULL);
       BlockAffineComponent *bac = new BlockAffineComponent(*rac);
       // following call deletes rac
       nnet->SetComponent(i, bac);
     } else if (const_c->Type() == "CompositeComponent") {
       // We must modify the composite component, so we use the
       // non-const GetComponent() call here.
       Component *c = nnet->GetComponent(i);
       CompositeComponent *cc = dynamic_cast<CompositeComponent*>(c);
       KALDI_ASSERT(cc != NULL);
       ConvertRepeatedToBlockAffine(cc);
     }
   }
 }

◆ ConvertToIndexes()

bool ConvertToIndexes	(	const std::vector< std::pair< int32, int32 > > &	location_vector,
		int32 *	first_value,
		std::vector< int32 > *	second_values
	)

If it is the case for some i >= 0 that all the .first elements of "location_vector" are either i or -1, then output i to first_value and the .second elements into "second_values", and return true.

Otherwise return false and the outputs are don't-cares.

Definition at line 190 of file nnet-compile-utils.cc.

Referenced by Compiler::CompileBackwardFromSubmatLocations(), Compiler::CompileForwardFromSubmatLocations(), SplitLocationsBackward(), UnitTestSplitLocations(), and UnitTestSplitLocationsBackward().

                                       {
   *first_value = -1;
   second_values->clear();
   second_values->reserve(location_vector.size());
   std::vector<std::pair<int32, int32> >::const_iterator iter;
   for (iter = location_vector.begin(); iter < location_vector.end(); ++iter)  {
     if (iter->first != -1) {
       if (*first_value == -1)
         *first_value = iter->first;
       if (iter->first != *first_value)
         return false;
       second_values->push_back(iter->second);
     } else  {
       second_values->push_back(-1);
     }
   }
   return true;
 }

◆ CopyPairVector() [1/2]

static void kaldi::nnet3::CopyPairVector	(	const CuArray< Int32Pair > &	in,
		std::vector< std::pair< int32, int32 > > *	out
	)

static

Definition at line 31 of file nnet-general-component.cc.

References CuArrayBase< T >::CopyToVec().

Referenced by StatisticsExtractionComponentPrecomputedIndexes::Read(), StatisticsPoolingComponentPrecomputedIndexes::Read(), StatisticsExtractionComponentPrecomputedIndexes::Write(), and StatisticsPoolingComponentPrecomputedIndexes::Write().

                                                                 {
   in.CopyToVec(reinterpret_cast<std::vector<Int32Pair>*>(out));
 }

◆ CopyPairVector() [2/2]

static void kaldi::nnet3::CopyPairVector	(	const std::vector< std::pair< int32, int32 > > &	in,
		CuArray< Int32Pair > *	out
	)

static

Definition at line 36 of file nnet-general-component.cc.

References CuArray< T >::CopyFromVec().

                                                  {
   const std::vector<Int32Pair> *in_cast =
       reinterpret_cast<const std::vector<Int32Pair>*>(&in);
   out->CopyFromVec(*in_cast);
 }

◆ CreateComputationRequestInternal()

static void kaldi::nnet3::CreateComputationRequestInternal	(	int32	begin_input_t,
		int32	end_input_t,
		int32	begin_output_t,
		int32	end_output_t,
		int32	num_sequences,
		int32	frame_subsampling_factor,
		const std::set< int32 > &	ivector_times,
		ComputationRequest *	request
	)

static

Definition at line 113 of file nnet-compile-looped.cc.

References ComputationRequest::inputs, rnnlm::n, and ComputationRequest::outputs.

Referenced by CreateLoopedComputationRequest().

                                  {
   request->inputs.reserve(2);
   request->inputs.clear();
   request->inputs.resize(1 + (ivector_times.empty() ? 0 : 1));
   request->inputs[0].name = "input";
   request->inputs[0].has_deriv = false;
   request->outputs.clear();
   request->outputs.resize(1);
   request->outputs[0].name = "output";
   request->outputs[0].has_deriv = false;
   if (!ivector_times.empty()) {
     request->inputs[1].name = "ivector";
     request->inputs[1].has_deriv = false;
   }
 
   // in the computation request the 'n' indexes (the sequence/utterance indexes)
   // have the larger stride than 't', although this is opposite to the way it's
   // done inside the computation.  This is for user convenience where it may be
   // easier to deal with submatrixes per sequence.
   for (int32 n = 0; n < num_sequences; n++) {
     int32 x = 0;
     for (int32 t = begin_input_t; t < end_input_t; t++) {
       request->inputs[0].indexes.push_back(Index(n, t, x));
     }
     for (int32 t = begin_output_t;
          t < end_output_t;
          t += frame_subsampling_factor)
       request->outputs[0].indexes.push_back(Index(n, t, x));
   }
   if (!ivector_times.empty()) {
     request->inputs.resize(2);
     request->inputs[1].name = "ivector";
     request->inputs[1].has_deriv = false;
     for (int32 n = 0; n < num_sequences; n++) {
       // note: std::sets store things in sorted order.
       for (std::set<int32>::const_iterator iter = ivector_times.begin();
            iter != ivector_times.end(); ++iter) {
         int32 t = *iter, x = 0;
         request->inputs[1].indexes.push_back(Index(n, t, x));
       }
     }
   }
 }

◆ CreateLoopedComputationRequest()

void CreateLoopedComputationRequest	(	const Nnet &	nnet,
		int32	chunk_size,
		int32	frame_subsampling_factor,
		int32	ivector_period,
		int32	left_context_begin,
		int32	right_context,
		int32	num_sequences,
		ComputationRequest *	request1,
		ComputationRequest *	request2,
		ComputationRequest *	request3
	)

This function creates computation request suitable for giving to ComputeLooped().

It's intended for use with a 'simple' nnet (one satisfying IsSimpleNnet()), and this basically means that the inputs must be named "input" and possibly "ivector", and that there is an output named "output", and that those are the ones you care about (it won't generate any other outputs or use any other inputs).

If you want to use looped computation for different types of neural net, you should use the deeper interface, CompileLooped().

Parameters

[in]	nnet	The neural net this computation request is to be used with. This is used to check whether the neural net accepts iVectors, and to work out the left-context and right-context required by the network.
[in]	chunk_size	The number of frames of output that will be generated for each chunk (note: this is the shift in the t-index, which will not equal the number of output frames if frame_subsampling_factor != 1). Note: it is required that chunk_size be a multiple of ivector_period, frame_subsampling_factor, and nnet.Modulus(). You should use GetChunkSize() to compute the chunk size, giving it an advisory/ minimum chunksize, to make sure it satisfies these properties.
[in]	frame_subsampling_factor	This will normally be 1, but may be more than 1 (e.g. 3) in chain systems; it determines the frame-skipping on the output, so we evaluate the output with 't' at multiples of this value.
[in]	ivector_period	The period with which iVectors are to be supplied to the network (if you're using iVectors). Not necessarily the same as the period with which the ivectors are extracted or stored on disk (–online-ivector-period). You will normally set this to the chunk size. It must divide the chunk size (if you're using iVectors) Note: you should call ModifyNnetIvectorPeriod on 'nnet' before calling this function; otherwise the neural net will most likely not actually be able to consume the iVector with this frequency.
[in]	left_context_begin	This should be the left-context of the network plus any additional left-context (provided via the option –extra-left-context-begin) that should be supplied to the network on top of the minimum that the network requires. We call this left_context_begin because this only relates to the start of the utterance (t=0).
[in]	right_context	This should be the right-context of the network, plus any additional right-context ("extra-right-context") that should be supplied to the network on top of the minimum that the network requires (currently extra-right-context != 0 is is not supported at the command-line level).
[in]	num_sequences	The number of separate 'n' values to put in the computation; normally this will be just 1, but it can be increased to allow simultaneous operation on multiple streams of input.
[out]	request1	The first of the 3 requests that this function generates, that the user should then supply to CompileLooped(). Note: this will tend to be the largest computation request in terms of input, because we have to provide enough left and right context that it can evaluate the first chunk. Note: as elsewhere, the job of duplicating first and last frames enough to provide the required left/right context to the network, is left to the caller (at runtime, not during compilation).
[out]	request2	The second of the 3 requests that this function generates. Caution: none of the inputs and outputs should overlap.
[out]	request3	The third of the 3 requests that this function generates. It will be the same as request2, except for a time offset.

Definition at line 164 of file nnet-compile-looped.cc.

References CreateComputationRequestInternal(), Nnet::InputDim(), KALDI_ASSERT, Mod(), and Nnet::Modulus().

Referenced by CreateLoopedComputationRequestSimple(), and DecodableNnetSimpleLoopedInfo::Init().

                                                                   {
   bool has_ivector = (nnet.InputDim("ivector") > 0);
   KALDI_ASSERT(chunk_size % frame_subsampling_factor == 0 &&
                chunk_size % nnet.Modulus() == 0 &&
                chunk_size % ivector_period == 0);
   KALDI_ASSERT(left_context_begin >= 0 && right_context >= 0);
   // note, 'end' is one past the last one.
   int32 chunk1_input_begin_t = - left_context_begin,
       chunk1_input_end_t = chunk_size + right_context,
       chunk2_input_begin_t = chunk1_input_end_t,
       chunk2_input_end_t = chunk2_input_begin_t + chunk_size,
       chunk3_input_begin_t = chunk2_input_end_t,
       chunk3_input_end_t = chunk3_input_begin_t + chunk_size;
 
 
   // work out the times at which i-vectors are required.
   std::set<int32> ivector_times1, ivector_times2, ivector_times3;
   if (has_ivector) {
     for (int32 t = chunk1_input_begin_t; t < chunk1_input_end_t; t++) {
       int32 ivector_t = t - Mod(t, ivector_period);
       ivector_times1.insert(ivector_t);
     }
     for (int32 t = chunk2_input_begin_t; t < chunk2_input_end_t; t++) {
       int32 ivector_t = t - Mod(t, ivector_period);
       if (ivector_times2.count(ivector_t) == 0 &&
           ivector_times1.count(ivector_t) == 0)
         ivector_times2.insert(ivector_t);
     }
     for (int32 t = chunk3_input_begin_t; t < chunk3_input_end_t; t++) {
       int32 ivector_t = t - Mod(t, ivector_period);
       if (ivector_times3.count(ivector_t) == 0 &&
           ivector_times2.count(ivector_t) == 0 &&
           ivector_times1.count(ivector_t) == 0)
         ivector_times3.insert(ivector_t);
     }
   }
 
   CreateComputationRequestInternal(
       chunk1_input_begin_t, chunk1_input_end_t,
       0, chunk_size,
       num_sequences, frame_subsampling_factor,
       ivector_times1,
       request1);
 
   CreateComputationRequestInternal(
       chunk2_input_begin_t, chunk2_input_end_t,
       chunk_size, chunk_size * 2,
       num_sequences, frame_subsampling_factor,
       ivector_times2,
       request2);
 
   CreateComputationRequestInternal(
       chunk3_input_begin_t, chunk3_input_end_t,
       chunk_size * 2, chunk_size * 3,
       num_sequences, frame_subsampling_factor,
       ivector_times3,
       request3);
 
 }

◆ CreateLoopedComputationRequestSimple()

void CreateLoopedComputationRequestSimple	(	const Nnet &	nnet,
		int32	chunk_size,
		int32	frame_subsampling_factor,
		int32	ivector_period,
		int32	extra_left_context_begin,
		int32	extra_right_context,
		int32	num_sequences,
		ComputationRequest *	request1,
		ComputationRequest *	request2,
		ComputationRequest *	request3
	)

This function is deprecated.

It has the same interface as CreateLoopedComputationRequest(), except that the left and right context are specified in a different way (as just the 'extra' part). It is deprecated because this function has to work out the left and right context of the network, which turns out to be quite slow if it's done after you call ModifyNnetIvectorPeriod().

Definition at line 361 of file nnet-compile-looped.cc.

References ComputeSimpleNnetContext(), and CreateLoopedComputationRequest().

Referenced by UnitTestNnetCompileLooped().

                                                                         {
   int32 left_context, right_context;
   ComputeSimpleNnetContext(nnet, &left_context, &right_context);
 
   CreateLoopedComputationRequest(nnet, chunk_size, frame_subsampling_factor,
                                  ivector_period,
                                  extra_left_context_begin + left_context,
                                  extra_right_context + right_context,
                                  num_sequences, request1, request2, request3);
 }

◆ DescriptorTokenize()

bool DescriptorTokenize	(	const std::string &	input,
		std::vector< std::string > *	tokens
	)

This function tokenizes input when parsing Descriptor configuration values.

A token in this context is not the same as a generic Kaldi token, e.g. as defined in IsToken() in util/text_utils.h, which just means a non-empty whitespace-free string. Here a token is more like a programming-language token, and currently the following are allowed as tokens: "(" ")" ","

A nonempty string beginning with A-Za-z_, and containing only -_A-Za-z0-9.
An integer, optionally beginning with - or + and then a nonempty sequence of 0-9.

This function should return false and print an informative error with local context if it can't tokenize the input.

Definition at line 30 of file nnet-parse.cc.

References kaldi::ConvertStringToReal(), ErrorContext(), kaldi::IsValidName(), KALDI_ASSERT, and KALDI_WARN.

Referenced by NormalizeTextDescriptor(), Nnet::ProcessComponentNodeConfigLine(), Nnet::ProcessOutputNodeConfigLine(), Nnet::RemoveSomeNodes(), ModelCollapser::ReplaceNodeInDescriptor(), UnitTestDescriptorIo(), UnitTestDescriptorTokenize(), and UnitTestGeneralDescriptor().

                                                       {
   KALDI_ASSERT(tokens != NULL);
   size_t start = input.find_first_not_of(" \t"), size = input.size();
   tokens->clear();
   while (start < size) {
     KALDI_ASSERT(!isspace(input[start]));
     if (input[start] == '(' || input[start] == ')' || input[start] == ',') {
       tokens->push_back(std::string(input, start, 1));
       start = input.find_first_not_of(" \t", start + 1);
     } else {
       size_t found = input.find_first_of(" \t(),", start);
       KALDI_ASSERT(found != start);
       if (found == std::string::npos) {
         std::string str(input, start, input.size() - start);
         BaseFloat tmp;
         if (!IsValidName(str) && !ConvertStringToReal(str, &tmp)) {
           KALDI_WARN << "Could not tokenize line " << ErrorContext(std::string(input, start));
           return false;
         }
         tokens->push_back(str);
         break;
       } else {
         if (input[found] == '(' || input[found] == ')' || input[found] == ',') {
           std::string str(input, start, found - start);
           BaseFloat tmp;
           if (!IsValidName(str) && !ConvertStringToReal(str, &tmp)) {
             KALDI_WARN << "Could not tokenize line " << ErrorContext(std::string(input, start));
             return false;
           }
           tokens->push_back(str);
           start = found;
         } else {
           std::string str(input, start, found - start);
           BaseFloat tmp;
           if (!IsValidName(str) && !ConvertStringToReal(str, &tmp)) {
             KALDI_WARN << "Could not tokenize line " << ErrorContext(std::string(input, start));
             return false;
           }
           tokens->push_back(str);
           start = input.find_first_not_of(" \t", found);
         }
       }
     }
   }
   return true;
 }

◆ DivideIntoPieces()

void kaldi::nnet3::DivideIntoPieces	(	int32	a,
		int32	b,
		std::vector< int32 > *	pieces
	)

This function divides the number 'a' into 'b' pieces, such that the sum of the pieces equals 'a' and no two pieces differ by more than 1.

Parameters

[in]	a	A number, may be positive or negative
[in]	b	The number of pieces, b >= 1.
[out]	pieces	The pieces will be written to here. At exit, their sum will equal a, and none of them will differ from any other by more than 1. Otherwise they are arbitrarily chosen.

Definition at line 72 of file nnet3-xvector-compute-batched.cc.

References rnnlm::i, and KALDI_ASSERT.

Referenced by BatchedXvectorComputer::SplitUtteranceIntoChunks().

                                                                   {
   KALDI_ASSERT(b > 0);
   pieces->clear();
   pieces->reserve(b);
   int32 a_sign = 1;
   // Make sure a is positive before division, because the behavior of division
   // with negative operands is not fully defined in C.
   if (a < 0) {
     a_sign = -1;
     a *= -1;
   }
   int32 piece_size1 = a / b,
       piece_size2 = piece_size1 + 1,
       remainder = a % b;
   int32 num_pieces_of_size1 = b - remainder,
       num_pieces_of_size2 = remainder;
   KALDI_ASSERT(a == num_pieces_of_size1 * piece_size1 +
                num_pieces_of_size2 * piece_size2);
 
   for (int32 i = 0; i < num_pieces_of_size1; i++)
     pieces->push_back(piece_size1 * a_sign);
   for (int32 i = 0; i < num_pieces_of_size2; i++)
     pieces->push_back(piece_size2 * a_sign);
 }

◆ DotProduct()

BaseFloat DotProduct	(	const Nnet &	nnet1,
		const Nnet &	nnet2
	)

Returns dot product between two networks of the same structure (calls the DotProduct functions of the Updatable components and sums up the return values).

Definition at line 250 of file nnet-utils.cc.

References UpdatableComponent::DotProduct(), Nnet::GetComponent(), KALDI_ASSERT, kUpdatableComponent, Nnet::NumComponents(), and Component::Properties().

Referenced by AffineComponent::BackpropNeedsOutput(), BlockAffineComponent::BackpropNeedsOutput(), Convolutional1dComponent::BackpropNeedsOutput(), LstmNonlinearityComponent::ConsolidateMemory(), ScaleAndOffsetComponent::Copy(), ConstantComponent::IsComputable(), LinearComponent::LinearComponent(), AffineComponent::Properties(), BlockAffineComponent::Properties(), RepeatedAffineComponent::Properties(), PerElementScaleComponent::Properties(), PerElementOffsetComponent::Properties(), ConstantFunctionComponent::Properties(), NnetDiscriminativeTrainer::Train(), CompositeComponent::Type(), UnitTestNnetModelDerivatives(), and UpdatableComponent::~UpdatableComponent().

                                         {
   KALDI_ASSERT(nnet1.NumComponents() == nnet2.NumComponents());
   BaseFloat ans = 0.0;
   for (int32 c = 0; c < nnet1.NumComponents(); c++) {
     const Component *comp1 = nnet1.GetComponent(c),
                     *comp2 = nnet2.GetComponent(c);
     if (comp1->Properties() & kUpdatableComponent) {
       const UpdatableComponent
           *u_comp1 = dynamic_cast<const UpdatableComponent*>(comp1),
           *u_comp2 = dynamic_cast<const UpdatableComponent*>(comp2);
       KALDI_ASSERT(u_comp1 != NULL && u_comp2 != NULL);
       ans += u_comp1->DotProduct(*u_comp2);
     }
   }
   return ans;
 }

◆ EnsureContiguousProperty()

void EnsureContiguousProperty	(	const std::vector< int32 > &	indexes,
		std::vector< std::vector< int32 > > *	indexes_out
	)

This function takes a vector of indexes and splits it up into as separate vectors of the same size, as needed to ensure that the 'contiguous property' holds.

This is done via padding with -1's. An example will clarify this. Suppose the input is: [ -1 1 1 1 2 2 1 1 ] which lacks the contiguous property because 1's appear in 2 different places, it would split it up as [ -1 1 1 1 2 2 -1 -1 ] [ -1 -1 -1 -1 -1 -1 1 1 ] If 'indexes' is empty or only contains -1's, 'indexes_out' will be empty.

Definition at line 214 of file nnet-compile-utils.cc.

References rnnlm::i.

Referenced by SplitLocationsBackward(), and UnitTestEnsureContiguousProperty().

                                                {
   indexes_out->clear();
   indexes_out->reserve(3);
   if (indexes.empty()) return;
   int32 max_value = *std::max_element(indexes.begin(), indexes.end());
   if (max_value == -1) return;
   std::vector<int32> num_segments_seen(max_value + 1, 0);
   int32 dim = indexes.size(), num_output_vectors = 0;
   for (int32 i = 0; i < dim;) {
     // note, we increment i within the loop.
     if (indexes[i] == -1) {
       i++;
       continue;
     }
     int32 value = indexes[i], start_index = i;
     for (; i < dim && indexes[i] == value; i++);
     int32 end_index = i;  // one past the end.
     // the input 'indexes' contains a sequence of possibly-repeated instances of
     // the value 'value', starting at index 'start_index', with 'end_index' as
     // one past the end.
     int32 this_num_segments_seen = num_segments_seen[value]++;
     if (this_num_segments_seen >= num_output_vectors) {  // we have nowhere to
                                                          // put it.
       indexes_out->resize(++num_output_vectors);
       indexes_out->back().resize(dim, -1);  // fill newly added vector with -1's.
     }
     std::vector<int32> &this_out_vec((*indexes_out)[this_num_segments_seen]);
     std::vector<int32>::iterator iter = this_out_vec.begin() + start_index,
         end = this_out_vec.begin() + end_index;
     // Fill the appropriate range of the output vector with 'value'
     for (; iter != end; ++iter) *iter = value;
   }
 }

◆ ErrorContext() [1/2]

std::string ErrorContext ( std::istream & is )

Return a string used in error messages.

Here, "is" will be from an istringstream derived from a single line or part of a line. If "is" is at EOF or in error state, this should just say "end of line", else if the contents of "is" before EOF is <20 characters it should return it all, else it should return the first 20 characters followed by "...".

Definition at line 78 of file nnet-parse.cc.

Referenced by DescriptorTokenize().

                                        {
   if (!is.good()) return "end of line";
   char buf[21];
   is.read(buf, 21);
   if (is) {
     return (std::string(buf, 20) + "...");
   }
   return std::string(buf, is.gcount());
 }

◆ ErrorContext() [2/2]

std::string ErrorContext ( const std::string & str )

Definition at line 88 of file nnet-parse.cc.

                                              {
   if (str.size() == 0) return "end of line";
   if (str.size() <= 20) return str;
   return std::string(str, 0, 20) + "...";
 }

◆ EvaluateComputationRequest()

void EvaluateComputationRequest	(	const Nnet &	nnet,
		const ComputationRequest &	request,
		std::vector< std::vector< bool > > *	is_computable
	)

Given an nnet and a computation request, this function works out which requested outputs in the computation request are computable; it outputs this information as a vector "is_computable" indexed by the same indexes as request.outputs.

It does this by executing some of the early stages of compilation.

Definition at line 71 of file nnet-utils.cc.

References ComputationGraphBuilder::Compute(), ComputationGraphBuilder::GetComputableInfo(), Nnet::GetNodeNames(), kaldi::GetVerboseLevel(), KALDI_VLOG, and ComputationGraph::Print().

Referenced by ComputeSimpleNnetContextForShift().

                                                 {
   ComputationGraph graph;
   ComputationGraphBuilder builder(nnet, &graph);
   builder.Compute(request);
   builder.GetComputableInfo(is_computable);
   if (GetVerboseLevel() >= 4) {
     std::ostringstream graph_pretty;
     graph.Print(graph_pretty, nnet.GetNodeNames());
     KALDI_VLOG(4) << "Graph is " << graph_pretty.str();
   }
 }

◆ ExampleApproxEqual()

bool ExampleApproxEqual	(	const NnetExample &	eg1,
		const NnetExample &	eg2,
		BaseFloat	delta
	)

Returns true if the examples are approximately equal (only intended to be used in testing).

Definition at line 1885 of file nnet-test-utils.cc.

References kaldi::ApproxEqual(), NnetIo::features, GeneralMatrix::GetMatrix(), rnnlm::i, NnetIo::indexes, NnetExample::io, and NnetIo::name.

Referenced by NnetGenerationOptions::NnetGenerationOptions(), and UnitTestNnetExample().

                                          {
   if (eg1.io.size() != eg2.io.size())
     return false;
   for (size_t i = 0; i < eg1.io.size(); i++) {
     NnetIo io1 = eg1.io[i], io2 = eg2.io[i];
     if (io1.name != io2.name || io1.indexes != io2.indexes)
       return false;
     Matrix<BaseFloat> feat1, feat2;
     io1.features.GetMatrix(&feat1);
     io2.features.GetMatrix(&feat2);
     if (!ApproxEqual(feat1, feat2, delta))
       return false;
   }
   return true;
 }

◆ ExpandComputation()

void ExpandComputation	(	const Nnet &	nnet,
		const MiscComputationInfo &	misc_info,
		const NnetComputation &	computation,
		bool	need_debug_info,
		int32	num_n_values,
		NnetComputation *	expanded_computation
	)

This function is used in 'shortcut' compilation to expand a computation that has been compiled for exactly 2 'n' values, to one that is suitable for some num_n_values > 2.

Parameters

[in]	nnet	The neural network for which this computation is being built.
[in]	misc_info	The same MiscComputationInfo object that was present in the ComputationRequests that were originally used to generate the computation (required to generated the PrecomputedIndexes)
[in]	computation	The computation that was compiled for exactly 2 'n' values (n=0 and n=1)
[in]	need_debug_info	True if we want to retain the 'debug_info' in the output 'expanded_computation'. In any case, the 'debug_info' is required in the input computation.
[in]	num_n_values	The number of 'n' values we want in the output computation
[out]	expanded_computation	The expanded computation.

Definition at line 3804 of file nnet-optimize-utils.cc.

References ComputationExpander::Expand().

Referenced by CachingOptimizingCompiler::CompileViaShortcut().

                                                               {
   ComputationExpander expander(nnet, misc_info, computation,
                                need_debug_info, num_n_values,
                                expanded_computation);
   expander.Expand();
 }

◆ ExpectToken()

static void kaldi::nnet3::ExpectToken	(	const std::string &	token,
		const std::string &	what_we_are_parsing,
		const std::string **	next_token
	)

static

Definition at line 45 of file nnet-descriptor.cc.

References KALDI_ERR, and ParsingContext().

                                                       {
   if (**next_token != token)
     KALDI_ERR << "Expected '" << token << "' while parsing "
               << what_we_are_parsing << ", got "
               << **next_token << ParsingContext(*next_token);
   else  // advance token pointer.
     (*next_token)++;
 }

◆ ExtendMatrices()

void ExtendMatrices ( NnetComputation * computation )

This is not really an optimization in itself but it can make things easier for class VariableMergingOptimizer (usually called by its wrapper VariableMergingOptimization()).

It looks for a case where most of a matrix (but not its final rows) are copied to some submatrix of another matrix, where the row-range of that submatrix extends to the last row of the other matrix; and it extends the other matrix with additional rows so that the entire source matrix can be copied to the destination.

Definition at line 1270 of file nnet-optimize-utils.cc.

References MatrixExtender::ExtendMatrices().

Referenced by Optimize().

                                                   {
   MatrixExtender ext(computation);
   ext.ExtendMatrices();
 }

◆ ExtrapolateComputationRequest()

static bool kaldi::nnet3::ExtrapolateComputationRequest	(	const ComputationRequest &	request1,
		const ComputationRequest &	request2,
		ComputationRequest *	request3
	)

static

Definition at line 251 of file nnet-compile-looped.cc.

References AddTimeOffsetToComputationRequest(), ComputationRequest::inputs, and KALDI_ASSERT.

Referenced by CompileLoopedInternal().

                                   {
   // accepts two computation requests 'request1' and 'request2' that
   // must be identical except for a time offset, and creates 'request3'
   // that is the extrapolation of the next term in sequence.
   *request3 = request2;
   KALDI_ASSERT(!request1.inputs.empty() && !request1.inputs[0].indexes.empty() &&
                !request2.inputs.empty() && !request2.inputs[0].indexes.empty());
   int32 t_offset = request2.inputs[0].indexes[0].t -
       request1.inputs[0].indexes[0].t;
   // the following is just to make sure that the inputs are structurally
   // equivalent.
   AddTimeOffsetToComputationRequest(-t_offset, request3);
   if (!(*request3 == request1))
     return false;  // there is somse structural difference, or
                    // the time offset is not consistent.
   // the following reverses the last call to AddTimeOffsetToComputationRequest,
   // then adds the offset we want.
   AddTimeOffsetToComputationRequest(2 * t_offset, request3);
   return true;
 }

◆ FilterExample()

void kaldi::nnet3::FilterExample	(	const NnetExample &	eg,
		int32	min_input_t,
		int32	max_input_t,
		int32	min_output_t,
		int32	max_output_t,
		NnetExample *	eg_out
	)

This function filters the indexes (and associated feature rows) in a NnetExample, removing any index/row in an NnetIo named "input" with t < min_input_t or t > max_input_t and any index/row in an NnetIo named "output" with t < min_output_t or t > max_output_t.

Will crash if filtering removes all Indexes of "input" or "output".

Definition at line 145 of file nnet3-copy-egs.cc.

References NnetIo::features, kaldi::FilterGeneralMatrixRows(), rnnlm::i, NnetIo::indexes, NnetExample::io, KALDI_ASSERT, KALDI_ERR, NnetIo::name, and GeneralMatrix::NumRows().

Referenced by SelectFromExample().

                                         {
   eg_out->io.clear();
   eg_out->io.resize(eg.io.size());
   for (size_t i = 0; i < eg.io.size(); i++) {
     bool is_input_or_output;
     int32 min_t, max_t;
     const NnetIo &io_in = eg.io[i];
     NnetIo &io_out = eg_out->io[i];
     const std::string &name = io_in.name;
     io_out.name = name;
     if (name == "input") {
       min_t = min_input_t;
       max_t = max_input_t;
       is_input_or_output = true;
     } else if (name == "output") {
       min_t = min_output_t;
       max_t = max_output_t;
       is_input_or_output = true;
     } else {
       is_input_or_output = false;
     }
     if (!is_input_or_output) {  // Just copy everything.
       io_out.indexes = io_in.indexes;
       io_out.features = io_in.features;
     } else {
       const std::vector<Index> &indexes_in = io_in.indexes;
       std::vector<Index> &indexes_out = io_out.indexes;
       indexes_out.reserve(indexes_in.size());
       int32 num_indexes = indexes_in.size(), num_kept = 0;
       KALDI_ASSERT(io_in.features.NumRows() == num_indexes);
       std::vector<bool> keep(num_indexes, false);
       std::vector<Index>::const_iterator iter_in = indexes_in.begin(),
                                           end_in = indexes_in.end();
       std::vector<bool>::iterator iter_out = keep.begin();
       for (; iter_in != end_in; ++iter_in,++iter_out) {
         int32 t = iter_in->t;
         bool is_within_range = (t >= min_t && t <= max_t);
         *iter_out = is_within_range;
         if (is_within_range) {
           indexes_out.push_back(*iter_in);
           num_kept++;
         }
       }
       KALDI_ASSERT(iter_out == keep.end());
       if (num_kept == 0)
         KALDI_ERR << "FilterExample removed all indexes for '" << name << "'";
 
       FilterGeneralMatrixRows(io_in.features, keep,
                               &io_out.features);
       KALDI_ASSERT(io_out.features.NumRows() == num_kept &&
                    indexes_out.size() == static_cast<size_t>(num_kept));
     }
   }
 }

◆ FindNStride() [1/2]

static int32 kaldi::nnet3::FindNStride	(	const std::vector< Index > &	indexes,
		bool	full_check
	)

static

Definition at line 2942 of file nnet-optimize-utils.cc.

References rnnlm::i, rnnlm::j, KALDI_ASSERT, Index::n, rnnlm::n, kaldi::RandInt(), and kaldi::SortAndUniq().

Referenced by ComputationExpander::ExpandIndexes(), ComputationExpander::InitStrideInfo(), and IoSpecificationIsDecomposable().

                                           {
   // First find candidate stride.  Later we'll check for consistency.
   int32 size = indexes.size();
   KALDI_ASSERT(size > 0);
   int32 N = indexes[size-1].n + 1,
         n_stride = -1;
   if (N <= 1) {
     // we wouldn't be able to determine the stride if N <= 1.
     return 0;
   }
   Index index(indexes[0]);
   if (index.n != 0 || size % N != 0) {
     // for the n stride to be positive, we must start with an index with n == 0.
     // if indexes.size() is not divisible by N, we have no hope of finding the
     // regular structure.
     return 0;
   }
   index.n = 1;
   // First check the two most common strides, which are 1
   // and size / N.
   if (indexes[1] == index) {
     n_stride = 1;
   } else if (indexes[size / N] == index) {
     n_stride = size / N;
   } else {
     int32 stride;
     // try the other possible strides one by one (for subsampling
     // layers of convnets, we might see strides of 2, for instance).
     for (stride = 2; stride < size / N; stride++) {
       if (size % stride == 0 && indexes[stride] == index) {
         n_stride = stride;
         break;
       }
     }
     if (n_stride == -1) {
       // if we fell off the loop then we found no candidates, which is strange
       // and means we did not find the expected structure; we'll return 0 as we
       // failed.
       return 0;
     }
   }
   // Now is the checking phase.
 
   // to understand block_size, see the comment above this functcion.
   int32 block_size = n_stride * N;
 
   std::vector<int32> indexes_to_check;
   if (full_check) {
     indexes_to_check.resize(size);
     for (int32 i = 0; i < size; i++)
       indexes_to_check[i] = i;
   } else {
     int32 num_to_check = std::min<int32>(5, size);
     indexes_to_check.resize(num_to_check);
     for (int32 j = 0; j < num_to_check; j++)
       indexes_to_check[j] = RandInt(0, size - 1);
     SortAndUniq(&indexes_to_check);
   }
   for (std::vector<int32>::iterator iter = indexes_to_check.begin();
        iter != indexes_to_check.end(); ++iter) {
     int32 i = *iter;
     Index index = indexes[i];
     int32 n = index.n;
     if (n < N - 1) {
       index.n = n + 1;
       if (i + n_stride >= size || indexes[i + n_stride] != index)
         return 0;
     }
     if (n == 0) {
       if (i / block_size != (i + n_stride * (N-1)) / block_size) {
         // this is a check that the input divides into blocks of size n_stride *
         // N and the N different versions of the same Index are always within a
         // block (i.e. that the n stride never crosses over the block; having
         // the same Index repeated within different blocks actually would not
         // matter).
         return 0;
       }
     } else { // n > 0
       index.n = n - 1;
       if (i - n_stride < 0 || indexes[i - n_stride] != index)
         return 0;
     }
   }
   return n_stride;
 }

◆ FindNStride() [2/2]

static int32 kaldi::nnet3::FindNStride	(	const std::vector< Cindex > &	cindexes,
		bool	full_check
	)

static

Definition at line 3033 of file nnet-optimize-utils.cc.

References rnnlm::i, rnnlm::j, KALDI_ASSERT, rnnlm::n, kaldi::RandInt(), and kaldi::SortAndUniq().

                                           {
   int32 size = cindexes.size();
   KALDI_ASSERT(size > 0);
   int32 N = cindexes[size-1].second.n + 1,
       n_stride = 0;
   if (N <= 1)
     return 0;
   Cindex cindex(cindexes[0]);
   if (cindex.second.n != 0 || size % N != 0)
     return 0;
   cindex.second.n = 1;
   if (cindexes[1] == cindex) {
     n_stride = 1;
   } else if (cindexes[size / N] == cindex) {
     n_stride = size / N;
   } else {
     int32 stride;
     for (stride = 2; stride < size / N; stride++) {
       if (size % stride == 0 && cindexes[stride] == cindex) {
         n_stride = stride;
         break;
       }
     }
     if (stride == size / N)
       return 0;
   }
   int32 block_size = n_stride * N;
   std::vector<int32> indexes_to_check;
   if (full_check) {
     indexes_to_check.resize(size);
     for (int32 i = 0; i < size; i++)
       indexes_to_check[i] = i;
   } else {
     int32 num_to_check = std::min<int32>(5, size);
     indexes_to_check.resize(num_to_check);
     for (int32 j = 0; j < num_to_check; j++)
       indexes_to_check[j] = RandInt(0, size - 1);
     SortAndUniq(&indexes_to_check);
   }
   for (std::vector<int32>::iterator iter = indexes_to_check.begin();
        iter != indexes_to_check.end(); ++iter) {
     int32 i = *iter;
     Cindex cindex = cindexes[i];
     int32 n = cindex.second.n;
     if (n < N - 1) {
       cindex.second.n = n + 1;
       if (i + n_stride >= size || cindexes[i + n_stride] != cindex)
         return 0;
     }
     if (n == 0) {
       if (i / block_size != (i + n_stride * (N-1)) / block_size)
         return 0;
     } else {
       cindex.second.n = n - 1;
       if (i - n_stride < 0 || cindexes[i - n_stride] != cindex)
         return 0;
     }
   }
   return n_stride;
 }

◆ FindNumLeadingAndTrailingIdenticals()

static void kaldi::nnet3::FindNumLeadingAndTrailingIdenticals	(	const std::vector< std::pair< int32, int32 > > &	vec,
		int32 *	num_leading_identicals,
		int32 *	num_trailing_identicals
	)

static

Definition at line 2477 of file nnet-optimize-utils.cc.

References KALDI_ASSERT.

Referenced by SnipRangesRowOp().

                                     {
   KALDI_ASSERT(!vec.empty());
   const std::pair<int32, int32> *begin = &(vec[0]), *ptr = begin,
       *end = ptr + vec.size();
   while (ptr != end && ptr->first == ptr->second)
     ptr++;
   // note regarding error message: we assume all pairs of identical numbers are
   // -1, due to the way this is called, but it only affects how we describe the
   // error.
   KALDI_ASSERT(ptr != end && "Vector consists entirely of -1's.");
   *num_leading_identicals = ptr - begin;
   const std::pair<int32, int32> *ptr2 = end - 1;
   // the following while loop should terminate before falling off the vector,
   // because we've established above (in the assertion) that the vector contains
   // at least one nonnegative number.
   while (ptr2->first == ptr2->second)
     ptr2--;
   KALDI_ASSERT(ptr2 >= begin);  // would be code error.
   *num_trailing_identicals = end - 1 - ptr2;
 }

◆ FindNumLeadingAndTrailingNegatives() [1/2]

static void kaldi::nnet3::FindNumLeadingAndTrailingNegatives	(	const std::vector< int32 > &	vec,
		int32 *	num_leading_negatives,
		int32 *	num_trailing_negatives
	)

static

Definition at line 2339 of file nnet-optimize-utils.cc.

References KALDI_ASSERT.

Referenced by SnipMultiRowOp(), and SnipSingleRowOp().

                                                                               {
   KALDI_ASSERT(!vec.empty());
   const int32 *begin = &(vec[0]), *ptr = begin, *end = ptr + vec.size();
   while (ptr != end && *ptr < 0)
     ptr++;
   // note regarding error message: we assume all negative numbers are -1, due to
   // the way this is called, but it only affects how we describe the error.
   KALDI_ASSERT(ptr != end && "Vector consists entirely of -1's.");
   *num_leading_negatives = ptr - begin;
   const int32 *ptr2 = end - 1;
   // the following while loop should terminate before falling off the vector,
   // because we've established above (in the assertion) that the vector contains
   // at least one nonnegative number.
   while (*ptr2 < 0)
     ptr2--;
   KALDI_ASSERT(ptr2 >= begin);  // or would be code error.
   *num_trailing_negatives = end - 1 - ptr2;
 }

◆ FindNumLeadingAndTrailingNegatives() [2/2]

static void kaldi::nnet3::FindNumLeadingAndTrailingNegatives	(	const std::vector< std::pair< int32, int32 > > &	vec,
		int32 *	num_leading_negatives,
		int32 *	num_trailing_negatives
	)

static

Definition at line 2406 of file nnet-optimize-utils.cc.

References KALDI_ASSERT.

                                    {
   KALDI_ASSERT(!vec.empty());
   const std::pair<int32, int32> *begin = &(vec[0]), *ptr = begin,
       *end = ptr + vec.size();
   while (ptr != end && ptr->first < 0)
     ptr++;
   // note regarding error message: we assume all negative numbers are -1, due to
   // the way this is called, but it only affects how we describe the error.
   KALDI_ASSERT(ptr != end && "Vector consists entirely of -1's.");
   *num_leading_negatives = ptr - begin;
   const std::pair<int32, int32> *ptr2 = end - 1;
   // the following while loop should terminate before falling off the vector,
   // because we've established above (in the assertion) that the vector contains
   // at least one nonnegative number.
   while (ptr2->first < 0)
     ptr2--;
   KALDI_ASSERT(ptr2 >= begin);  // would be code error.
   *num_trailing_negatives = end - 1 - ptr2;
 }

◆ FindOrphanComponents()

void FindOrphanComponents	(	const Nnet &	nnet,
		std::vector< int32 > *	components
	)

This function finds a list of components that are never used, and outputs the integer comopnent indexes (you can use these to index nnet.GetComponentNames() to get their names).

Definition at line 591 of file nnet-utils.cc.

References NetworkNode::component_index, Nnet::GetNode(), rnnlm::i, Nnet::IsComponentNode(), KALDI_ASSERT, Nnet::NumComponents(), Nnet::NumNodes(), and NetworkNode::u.

Referenced by Nnet::Check(), and Nnet::RemoveOrphanComponents().

                                                                           {
   int32 num_components = nnet.NumComponents(), num_nodes = nnet.NumNodes();
   std::vector<bool> is_used(num_components, false);
   for (int32 i = 0; i < num_nodes; i++) {
     if (nnet.IsComponentNode(i)) {
       int32 c = nnet.GetNode(i).u.component_index;
       KALDI_ASSERT(c >= 0 && c < num_components);
       is_used[c] = true;
     }
   }
   components->clear();
   for (int32 i = 0; i < num_components; i++)
     if (!is_used[i])
       components->push_back(i);
 }

◆ FindOrphanNodes()

void FindOrphanNodes	(	const Nnet &	nnet,
		std::vector< int32 > *	nodes
	)

This function finds a list of nodes that are never used to compute any output, and outputs the integer node indexes (you can use these to index nnet.GetNodeNames() to get their names).

Definition at line 607 of file nnet-utils.cc.

References ComputeGraphTranspose(), rnnlm::i, Nnet::IsOutputNode(), rnnlm::j, NnetToDirectedGraph(), and Nnet::NumNodes().

Referenced by Nnet::Check(), and Nnet::RemoveOrphanNodes().

                                                                 {
 
   std::vector<std::vector<int32> > depend_on_graph, dependency_graph;
   NnetToDirectedGraph(nnet, &depend_on_graph);
   // depend_on_graph[i] is a list of all the nodes that depend on i.
   ComputeGraphTranspose(depend_on_graph, &dependency_graph);
   // dependency_graph[i] is a list of all the nodes that i depends on,
   // to be computed.
 
   // Find all nodes required to produce the outputs.
   int32 num_nodes = nnet.NumNodes();
   assert(num_nodes == static_cast<int32>(dependency_graph.size()));
   std::vector<bool> node_is_required(num_nodes, false);
   std::vector<int32> queue;
   for (int32 i = 0; i < num_nodes; i++) {
     if (nnet.IsOutputNode(i))
       queue.push_back(i);
   }
   while (!queue.empty()) {
     int32 i = queue.back();
     queue.pop_back();
     if (!node_is_required[i]) {
       node_is_required[i] = true;
       for (size_t j = 0; j < dependency_graph[i].size(); j++)
         queue.push_back(dependency_graph[i][j]);
     }
   }
   nodes->clear();
   for (int32 i = 0; i < num_nodes; i++) {
     if (!node_is_required[i])
       nodes->push_back(i);
   }
 }

◆ FindSccs()

void FindSccs	(	const std::vector< std::vector< int32 > > &	graph,
		std::vector< std::vector< int32 > > *	sccs
	)

Given a directed graph (where each std::vector<int32> is a list of destination-nodes of arcs coming from the current node), partition it into strongly connected components (i.e.

within each SCC, all nodes are reachable from all other nodes). Each element of 'sccs' is a list of node indexes that are in that scc.

Definition at line 156 of file nnet-graph.cc.

References FindSccsTarjan(), and KALDI_ASSERT.

Referenced by ComputeNnetComputationEpochs(), GraphHasCycles(), and UnitTestFindSccs().

                                                   {
   // Internally we call Tarjan's SCC algorithm, as it only requires one DFS. We
   // can change this to other methods later on if necessary.
   KALDI_ASSERT(sccs != NULL);
   FindSccsTarjan(graph, sccs);
 }

◆ FindSccsTarjan()

void kaldi::nnet3::FindSccsTarjan	(	const std::vector< std::vector< int32 > > &	graph,
		std::vector< std::vector< int32 > > *	sccs
	)

Definition at line 138 of file nnet-graph.cc.

References TarjanNode::index, KALDI_ASSERT, rnnlm::n, and TarjanSccRecursive().

Referenced by FindSccs().

                                                         {
   KALDI_ASSERT(sccs != NULL);
 
   // Initialization.
   std::vector<TarjanNode> tarjan_nodes(graph.size());
   std::vector<int32> tarjan_stack;
   int32 global_index = 0;
 
   // Calls the recursive function.
   for (int32 n = 0; n < graph.size(); ++n) {
     if (tarjan_nodes[n].index == -1) {
       TarjanSccRecursive(n, graph,
                          &global_index, &tarjan_nodes, &tarjan_stack, sccs);
     }
   }
 }

◆ FixGotoLabel()

void FixGotoLabel ( NnetComputation * computation )

This function ensures that the arg1 of a final command of type kGotoLabel is the same as the command with type kNoOperationLabel.

This is necessary if you do any other type of optimization after 'OptimizeLoopedComputation()'.

Definition at line 4552 of file nnet-optimize-utils.cc.

References NnetComputation::commands, rnnlm::d, KALDI_ERR, kGotoLabel, kNoOperationLabel, and kProvideOutput.

Referenced by InsertCommands(), Optimize(), ComputationLoopedOptimizer::Optimize(), and RemoveUnnecessaryAllocation().

                                                 {
   int32 num_commands = computation->commands.size();
   if (num_commands == 0)
     return;
   for (int32 c = num_commands - 1; c >= 0; c--) {
     if (computation->commands[c].command_type == kGotoLabel) {
       int32 dest_command = computation->commands[c].arg1;
       if (static_cast<size_t>(dest_command) <  computation->commands.size() &&
           computation->commands[dest_command].command_type == kNoOperationLabel)
         return;  // nothing to fix.
       for (int32 d = 0; d + 1 < num_commands; d++) {
         if (computation->commands[d].command_type == kNoOperationLabel) {
           computation->commands[c].arg1 = d;
           return;
         }
       }
       KALDI_ERR << "Label not found.";
     } else if (computation->commands[c].command_type == kProvideOutput) {
       // sometimes kProvideOutput commands are temporarily ordered after
       // the kGotoLabel command, and we need to work in that case.
       continue;
     } else {
       // it loks like there is no 'goto' command in this computation-
       // if there were, it would be right at the end, possibly followed by
       // kProvideOutput commands.
       break;
     }
   }
 }

◆ FreezeNaturalGradient()

void FreezeNaturalGradient	(	bool	freeze,
		Nnet *	nnet
	)

Controls if natural gradient will be updated.

Definition at line 432 of file nnet-utils.cc.

References UpdatableComponent::FreezeNaturalGradient(), Nnet::GetComponent(), KALDI_ERR, kUpdatableComponent, Nnet::NumComponents(), and Component::Properties().

Referenced by LstmNonlinearityComponent::ConsolidateMemory(), LinearComponent::LinearComponent(), NaturalGradientAffineComponent::NaturalGradientAffineComponent(), NnetChainTrainer::Train(), NnetTrainer::Train(), NaturalGradientPerElementScaleComponent::Type(), and CompositeComponent::Type().

                                                     {
   for (int32 c = 0; c < nnet->NumComponents(); c++) {
     Component *comp = nnet->GetComponent(c);
     if (comp->Properties() & kUpdatableComponent) {
       // For now all updatable components inherit from class UpdatableComponent.
       // If that changes in future, we will change this code.
       UpdatableComponent *uc = dynamic_cast<UpdatableComponent*>(comp);
       if (uc == NULL)
         KALDI_ERR << "Updatable component does not inherit from class "
             "UpdatableComponent; change this code.";
       uc->FreezeNaturalGradient(freeze);
     }
   }
 }

◆ GenerateConfigSequence()

void GenerateConfigSequence	(	const NnetGenerationOptions &	opts,
		std::vector< std::string > *	configs
	)

Generates a sequence of at least one config files, output as strings, where the first in the sequence is the initial nnet, and the remaining ones may do things like add layers.

Definition at line 1252 of file nnet-test-utils.cc.

Referenced by NnetGenerationOptions::NnetGenerationOptions(), UnitTestNnetAnalyze(), UnitTestNnetCompile(), UnitTestNnetCompileLooped(), UnitTestNnetCompileMulti(), UnitTestNnetCompute(), UnitTestNnetContext(), UnitTestNnetInputDerivatives(), UnitTestNnetIo(), UnitTestNnetModelDerivatives(), and UnitTestNnetOptimizeWithOptions().

                                    {
 start:
   int32 network_type = RandInt(0, 14);
   switch(network_type) {
     case 0:
       GenerateConfigSequenceSimplest(opts, configs);
       break;
     case 1:
       if (!opts.allow_context)
         goto start;
       GenerateConfigSequenceSimpleContext(opts, configs);
       break;
     case 2:
       if (!opts.allow_context || !opts.allow_nonlinearity)
         goto start;
       GenerateConfigSequenceSimple(opts, configs);
       break;
     case 3:
       if (!opts.allow_recursion || !opts.allow_context ||
           !opts.allow_nonlinearity)
         goto start;
       GenerateConfigSequenceRnn(opts, configs);
       break;
     case 4:
       if (!opts.allow_recursion || !opts.allow_context ||
           !opts.allow_nonlinearity)
         goto start;
       GenerateConfigSequenceRnnClockwork(opts, configs);
       break;
     case 5:
       if (!opts.allow_recursion || !opts.allow_context ||
           !opts.allow_nonlinearity)
         goto start;
       GenerateConfigSequenceLstm(opts, configs);
       break;
     case 6:
       if (!opts.allow_recursion || !opts.allow_context ||
           !opts.allow_nonlinearity)
         goto start;
       GenerateConfigSequenceLstm(opts, configs);
       break;
     case 7:
       if (!opts.allow_nonlinearity)
         goto start;
       GenerateConfigSequenceCnn(opts, configs);
       break;
     case 8:
       if (!opts.allow_use_of_x_dim)
         goto start;
       GenerateConfigSequenceDistribute(opts, configs);
       break;
     case 9:
       GenerateConfigSequenceCompositeBlock(opts, configs);
       break;
     case 10:
       if (!opts.allow_statistics_pooling)
         goto start;
       GenerateConfigSequenceStatistics(opts, configs);
       break;
     case 11:
       if (!opts.allow_recursion || !opts.allow_context ||
           !opts.allow_nonlinearity)
         goto start;
       GenerateConfigSequenceLstmWithTruncation(opts, configs);
       break;
       // We're allocating more case statements to the most recently
       // added type of model, to give more thorough testing where
       // it's needed most.
     case 12:
       if (!opts.allow_nonlinearity || !opts.allow_context)
         goto start;
       GenerateConfigSequenceCnnNew(opts, configs);
       break;
     case 13: case 14:
       if (!opts.allow_nonlinearity || !opts.allow_context)
         goto start;
       GenerateConfigSequenceRestrictedAttention(opts, configs);
       break;
     default:
       KALDI_ERR << "Error generating config sequence.";
   }
   KALDI_ASSERT(!configs->empty());
 }

◆ GenerateConfigSequenceCnn()

void kaldi::nnet3::GenerateConfigSequenceCnn	(	const NnetGenerationOptions &	opts,
		std::vector< std::string > *	configs
	)

Definition at line 913 of file nnet-test-utils.cc.

References kaldi::Rand().

Referenced by GenerateConfigSequence().

                                    {
   std::ostringstream os;
 
 
   int32 input_x_dim = 10 + Rand() % 20,
         input_y_dim = 10 + Rand() % 20,
         input_z_dim = 3 + Rand() % 10,
         filt_x_dim = 1 + Rand() % input_x_dim,
         filt_y_dim = 1 + Rand() % input_y_dim,
         num_filters = 10 + Rand() % 20,
         filt_x_step = (1 + Rand() % filt_x_dim),
         filt_y_step = (1 + Rand() % filt_y_dim);
   int32 remainder = (input_x_dim - filt_x_dim) % filt_x_step;
   // adjusting input_x_dim to ensure divisibility
   input_x_dim = input_x_dim - remainder;
   remainder = (input_y_dim - filt_y_dim) % filt_y_step;
   // adjusting input_x_dim to ensure divisibility
   input_y_dim = input_y_dim - remainder;
 
   int32 input_vectorization = Rand() % 2;
   std::string vectorization;
   if (input_vectorization == 0) {
     vectorization = "yzx";
   } else  {
     vectorization = "zyx";
   }
 
   os << "component name=conv type=ConvolutionComponent "
      << " input-x-dim=" << input_x_dim
      << " input-y-dim=" << input_y_dim
      << " input-z-dim=" << input_z_dim
      << " filt-x-dim=" << filt_x_dim
      << " filt-y-dim=" << filt_y_dim
      << " filt-x-step=" << filt_x_step
      << " filt-y-step=" << filt_y_step
      << " num-filters=" << num_filters
      << " input-vectorization-order=" << vectorization
      << std::endl;
 
   int32 conv_output_x_dim = (1 + (input_x_dim - filt_x_dim) / filt_x_step);
   int32 conv_output_y_dim = (1 + (input_y_dim - filt_y_dim) / filt_y_step);
   int32 conv_output_z_dim = num_filters;
   int32 pool_x_size = 1 + Rand() % conv_output_x_dim;
   int32 pool_y_size = 1 + Rand() % conv_output_y_dim;
   int32 pool_z_size = 1 + Rand() % conv_output_z_dim;
   int32 pool_x_step = 1;
   int32 pool_y_step = 1;
   int32 pool_z_step = 1;
   do {
     pool_x_step = (1 + Rand() % pool_x_size);
   } while((conv_output_x_dim - pool_x_size) % pool_x_step);
   do {
     pool_y_step = (1 + Rand() % pool_y_size);
   } while((conv_output_y_dim - pool_y_size) % pool_y_step);
   do {
     pool_z_step = (1 + Rand() % pool_z_size);
   } while((conv_output_z_dim - pool_z_size) % pool_z_step);
 
   os << "component name=maxpooling type=MaxpoolingComponent "
      << " input-x-dim=" << conv_output_x_dim
      << " input-y-dim=" << conv_output_y_dim
      << " input-z-dim=" << conv_output_z_dim
      << " pool-x-size=" << pool_x_size
      << " pool-y-size=" << pool_y_size
      << " pool-z-size=" << pool_z_size
      << " pool-x-step=" << pool_x_step
      << " pool-y-step=" << pool_y_step
      << " pool-z-step=" << pool_z_step
      << std::endl;
 
   os << "input-node name=input dim=" << (input_x_dim * input_y_dim * input_z_dim) << std::endl;
   os << "component-node name=conv_node component=conv input=input\n";
   os << "component-node name=maxpooling_node component=maxpooling input=conv_node\n";
   os << "output-node name=output input=conv_node\n";
   configs->push_back(os.str());
 }

◆ GenerateConfigSequenceCnnNew()

void kaldi::nnet3::GenerateConfigSequenceCnnNew	(	const NnetGenerationOptions &	opts,
		std::vector< std::string > *	configs
	)

Definition at line 994 of file nnet-test-utils.cc.

References kaldi::RandInt().

Referenced by GenerateConfigSequence().

                                    {
   std::ostringstream ss;
 
 
   int32 cur_height = RandInt(5, 15),
       cur_num_filt = RandInt(1, 3),
       num_layers = RandInt(0, 3);
   // note: generating zero layers is a bit odd but it exercises some code that
   // we otherwise wouldn't exercise.
 
 
   std::string cur_layer_descriptor = "input";
 
   { // input layer.
     ss << "input-node name=input dim=" << (cur_height * cur_num_filt)
        << std::endl;
   }
 
 
   for (int32 l = 0; l < num_layers; l++) {
     int32 next_num_filt = RandInt(1, 10);
 
     bool height_padding = (cur_height < 5 || RandInt(0, 1) == 0);
     int32 height_subsampling_factor = RandInt(1, 2);
     if (cur_height < 4) {
       // output height of 1 causes a problem with unused height-offsets,
       // so don't subsample in that case.
       height_subsampling_factor = 1;
     }
 
     int32 next_height = cur_height;
     if (!height_padding) {
       next_height -= 2;  // the kernel will have height 3.
     }
     next_height = (next_height + height_subsampling_factor - 1) /
         height_subsampling_factor;
 
     if (next_height == cur_height && RandInt(0, 1) == 0) {
       // ensure that with sufficient frequency, we have the
       // same height and num-filt out; this enables ResNet-style
       // addition.
       next_num_filt = cur_num_filt;
     }
 
     std::string time_offsets, required_time_offsets;
     if (RandInt(0, 3) == 0) {
       time_offsets = "0";
       required_time_offsets = (RandInt(0, 1) == 0 ? "" : "0");
     } else if (RandInt(0, 1) == 0) {
       time_offsets = "-1,0,1";
       required_time_offsets = (RandInt(0, 1) == 0 ? "" : "-1");
     } else {
       time_offsets = "-2,0,2";
       required_time_offsets = (RandInt(0, 1) == 0 ? "" : "0");
     }
 
     ss << "component type=TimeHeightConvolutionComponent name=layer" << l << "-conv "
        << "num-filters-in=" << cur_num_filt
        << " num-filters-out=" << next_num_filt
        << " height-in=" << cur_height
        << " height-out=" << next_height
        << " height-offsets=" << (height_padding ? "-1,0,1" : "0,1,2")
        << " time-offsets=" << time_offsets;
 
     if (RandInt(0, 1) == 0) {
       // this limits the 'temp memory' usage to 100
       // bytes, which will test another code path where
       // it breaks up the temporary matrix into pieces
       ss << " max-memory-mb=1.0e-04";
     }
 
     if (height_subsampling_factor != 1 || RandInt(0, 1) == 0)
       ss << " height-subsample-out=" << height_subsampling_factor;
     if (required_time_offsets == "" && RandInt(0, 1) == 0) {
       required_time_offsets = time_offsets;
       // it should default to this, but we're exercising more of the config
       // parsing code this way.
     }
     if (required_time_offsets != "")
       ss << " required-time-offsets=" << required_time_offsets;
     if (RandInt(0, 1) == 0)
       ss << " param-stddev=0.1 bias-stddev=1";
     if (RandInt(0, 1) == 0)
       ss << " use-natural-gradient=false";
     if (RandInt(0, 1) == 0)
       ss << " rank-in=4";
     if (RandInt(0, 1) == 0)
       ss << " rank-out=4";
     if (RandInt(0, 1) == 0)
       ss << " alpha-in=2.0";
     if (RandInt(0, 1) == 0)
       ss << " alpha-out=2.0";
     ss << std::endl;
 
     ss << "component-node name=layer" << l << "-conv component=layer"
        << l << "-conv input=" << cur_layer_descriptor << std::endl;
 
     bool use_relu = false;
     if (use_relu) {
       ss << "component type=RectifiedLinearComponent name=layer" << l
          << "-relu dim=" << (next_height * next_num_filt) << std::endl;
       ss << "component-node name=layer" << l << "-relu component=layer"
          << l << "-relu input=layer" << l << "-conv" << std::endl;
     }
 
     std::ostringstream desc_ss;
     if (next_height == cur_height && next_num_filt == cur_num_filt
         && RandInt(0, 1) == 0) {
       desc_ss << "Sum(" << cur_layer_descriptor << ", layer" << l
               << (use_relu ? "-relu)" : "-conv)");
     } else {
       desc_ss << "layer" << l << (use_relu ? "-relu" : "-conv");
     }
 
     if (RandInt(0, 3) == 0) {
       std::ostringstream round_desc_ss;
       int32 modulus = RandInt(2, 3);
       round_desc_ss << "Round(" << desc_ss.str() << ", " << modulus << ")";
       cur_layer_descriptor = round_desc_ss.str();
     } else {
       cur_layer_descriptor = desc_ss.str();
     }
     cur_height = next_height;
     cur_num_filt = next_num_filt;
   }
 
   ss << "output-node name=output input=" << cur_layer_descriptor << std::endl;
 
 
   configs->push_back(ss.str());
 }

◆ GenerateConfigSequenceCompositeBlock()

void GenerateConfigSequenceCompositeBlock	(	const NnetGenerationOptions &	opts,
		std::vector< std::string > *	configs
	)

Generate a config string with a composite component composed only of block affine, repeated affine, and natural gradient repeated affine components.

Definition at line 1213 of file nnet-test-utils.cc.

References rnnlm::i, KALDI_WARN, NnetGenerationOptions::output_dim, and kaldi::RandInt().

Referenced by GenerateConfigSequence(), NnetGenerationOptions::NnetGenerationOptions(), and UnitTestConvertRepeatedToBlockAffineComposite().

                                                                          {
   int32 num_components = RandInt(1,5);
   int32 input_dim = 10 * RandInt(1,10);
   if (opts.output_dim > 0) {
     KALDI_WARN  << "This function doesn't take a requested output_dim due to "
       "implementation complications.";
   }
   int32 max_rows_process = 512 + 512 * RandInt(1,3);
   std::ostringstream os;
   os << "component name=composite1 type=CompositeComponent max-rows-process="
      << max_rows_process << " num-components=" << num_components;
 
   int32 types_length = 3;
   std::string types[] = {"BlockAffineComponent",
                          "RepeatedAffineComponent",
                          "NaturalGradientRepeatedAffineComponent"};
   int32 last_output_dim = input_dim;
   // components within a composite component are indexed from 1.
   for(int32 i = 1; i <= num_components; i++) {
     os << " component" << i << "=";
     int32 rand_index = RandInt(0, types_length - 1);
     std::string rand_type = types[rand_index];
     os << "'type=" << rand_type << " input-dim=" << last_output_dim;
     int32 current_output_dim = 10 * RandInt(1,10);
     // must be a divisor or current_output_dim and last_output_dim
     int32 num_repeats = 10;
     os << " output-dim=" << current_output_dim;
     std::string repeats_string = (rand_type == "BlockAffineComponent") ? "num-blocks": "num-repeats";
     os << " " << repeats_string << "=" << num_repeats << "'";
     last_output_dim = current_output_dim;
   }
   os << std::endl << std::endl;
   os << "input-node name=input dim=" << input_dim << std::endl;
   os << "component-node name=composite1 component=composite1 input=input\n";
   os << "output-node name=output input=composite1\n";
   configs->push_back(os.str());
 }

◆ GenerateConfigSequenceDistribute()

void kaldi::nnet3::GenerateConfigSequenceDistribute	(	const NnetGenerationOptions &	opts,
		std::vector< std::string > *	configs
	)

Definition at line 1187 of file nnet-test-utils.cc.

References rnnlm::i, NnetGenerationOptions::output_dim, and kaldi::RandInt().

Referenced by GenerateConfigSequence().

                                    {
   int32 output_dim = (opts.output_dim > 0 ? opts.output_dim : 100);
   int32 x_expand = RandInt(1, 5), after_expand_dim = RandInt(10, 20),
       input_dim = x_expand * after_expand_dim;
   std::ostringstream os;
   os << "input-node name=input dim=" << input_dim << std::endl;
   os << "component name=distribute type=DistributeComponent input-dim="
      << input_dim << " output-dim=" << after_expand_dim << std::endl;
   os << "component-node name=distribute component=distribute input=input\n";
   os << "component name=affine type=AffineComponent input-dim="
      << after_expand_dim << " output-dim=" << output_dim << std::endl;
   os << "component-node name=affine component=affine input=distribute\n";
   os << "output-node name=output input=Sum(";
   for (int32 i = 0; i < x_expand; i++) {
     if (i > 0) os << ", ";
     os << "ReplaceIndex(affine, x, " << i << ")";
   }
   os << ")\n";
   configs->push_back(os.str());
 }

◆ GenerateConfigSequenceLstm()

void kaldi::nnet3::GenerateConfigSequenceLstm	(	const NnetGenerationOptions &	opts,
		std::vector< std::string > *	configs
	)

Definition at line 414 of file nnet-test-utils.cc.

References rnnlm::i, NnetGenerationOptions::output_dim, and kaldi::Rand().

Referenced by GenerateConfigSequence().

                                    {
   std::ostringstream os;
 
   std::vector<int32> splice_context;
   for (int32 i = -5; i < 4; i++)
     if (Rand() % 3 == 0)
       splice_context.push_back(i);
   if (splice_context.empty())
     splice_context.push_back(0);
 
   int32 input_dim = 10 + Rand() % 20,
       spliced_dim = input_dim * splice_context.size(),
       output_dim = (opts.output_dim > 0 ?
                     opts.output_dim :
                     100 + Rand() % 200),
       cell_dim = 40 + Rand() % 50,
       projection_dim = std::ceil(cell_dim / (Rand() % 10 + 1));
 
   os << "input-node name=input dim=" << input_dim << std::endl;
 
   // trainable cell value for start/end of file.
   os << "component name=c0 type=ConstantComponent"
      << " output-dim=" << cell_dim << std::endl;
 
 
   // Parameter Definitions W*(* replaced by - to have valid names)
   // Input gate control : Wi* matrices
   os << "component name=Wi-xr type=NaturalGradientAffineComponent"
      << " input-dim=" << spliced_dim + projection_dim
      << " output-dim=" << cell_dim << std::endl;
   os << "component name=Wic type=PerElementScaleComponent "
      << " dim=" << cell_dim << std::endl;
 
   // Forget gate control : Wf* matrices
   os << "component name=Wf-xr type=NaturalGradientAffineComponent"
      << " input-dim=" << spliced_dim + projection_dim
      << " output-dim=" << cell_dim << std::endl;
   os << "component name=Wfc type=PerElementScaleComponent "
      << " dim=" << cell_dim << std::endl;
 
   // Output gate control : Wo* matrices
   os << "component name=Wo-xr type=NaturalGradientAffineComponent"
      << " input-dim=" << spliced_dim + projection_dim
      << " output-dim=" << cell_dim  << std::endl;
   os << "component name=Woc type=PerElementScaleComponent "
      << " dim=" << cell_dim << std::endl;
 
   // Cell input matrices : Wc* matrices
   os << "component name=Wc-xr type=NaturalGradientAffineComponent"
      << " input-dim=" << spliced_dim + projection_dim
      << " output-dim=" << cell_dim  << std::endl;
 
 
 
   // projection matrices : Wrm and Wpm
   os << "component name=W-m type=NaturalGradientAffineComponent "
      << " input-dim=" << cell_dim
      << " output-dim=" << 2 * projection_dim << std::endl;
 
   // Output : Wyr and Wyp
   os << "component name=Wy- type=NaturalGradientAffineComponent "
      << " input-dim=" << 2 * projection_dim
      << " output-dim=" << cell_dim << std::endl;
 
   // Defining the diagonal matrices
   // Defining the final affine transform
   os << "component name=final_affine type=NaturalGradientAffineComponent "
      << "input-dim=" << cell_dim << " output-dim=" << output_dim << std::endl;
   os << "component name=logsoftmax type=LogSoftmaxComponent dim="
      << output_dim << std::endl;
 
   // Defining the non-linearities
   //  declare a no-op component so that we can use a sum descriptor's output
   //  multiple times, and to make the config more readable given the equations
   os << "component name=i type=SigmoidComponent dim="
      << cell_dim << std::endl;
   os << "component name=f type=SigmoidComponent dim="
      << cell_dim << std::endl;
   os << "component name=o type=SigmoidComponent dim="
      << cell_dim << std::endl;
   os << "component name=g type=TanhComponent dim="
      << cell_dim << std::endl;
   os << "component name=h type=TanhComponent dim="
      << cell_dim << std::endl;
   os << "component name=c1 type=ElementwiseProductComponent "
      << " input-dim=" << 2 * cell_dim
      << " output-dim=" << cell_dim << std::endl;
   os << "component name=c2 type=ElementwiseProductComponent "
      << " input-dim=" << 2 * cell_dim
      << " output-dim=" << cell_dim << std::endl;
   os << "component name=m type=ElementwiseProductComponent "
      << " input-dim=" << 2 * cell_dim
      << " output-dim=" << cell_dim << std::endl;
 
   // Defining the computations
   std::ostringstream temp_string_stream;
   for (size_t i = 0; i < splice_context.size(); i++) {
     int32 offset = splice_context[i];
     temp_string_stream << "Offset(input, " << offset << ")";
     if (i + 1 < splice_context.size())
       temp_string_stream << ", ";
   }
   std::string spliced_input = temp_string_stream.str();
 
   std::string c_tminus1 = "Sum(Failover(Offset(c1_t, -1), c0), IfDefined(Offset( c2_t, -1)))";
 
 
   // c0.  note: the input is never used as the component requires
   // no input indexes; we just write itself as input to keep the
   // structures happy.
   os << "component-node name=c0 component=c0 input=c0\n";
 
   // i_t
   os << "component-node name=i1 component=Wi-xr input=Append("
      << spliced_input << ", IfDefined(Offset(r_t, -1)))\n";
   os << "component-node name=i2 component=Wic "
      << " input=" << c_tminus1 << std::endl;
   os << "component-node name=i_t component=i input=Sum(i1, i2)\n";
 
   // f_t
   os << "component-node name=f1 component=Wf-xr input=Append("
      << spliced_input << ", IfDefined(Offset(r_t, -1)))\n";
   os << "component-node name=f2 component=Wfc "
      << " input=" << c_tminus1 << std::endl;
   os << "component-node name=f_t component=f input=Sum(f1, f2)\n";
 
   // o_t
   os << "component-node name=o1 component=Wo-xr input=Append("
      << spliced_input << ", IfDefined(Offset(r_t, -1)))\n";
   os << "component-node name=o2 component=Woc input=Sum(c1_t, c2_t)\n";
   os << "component-node name=o_t component=o input=Sum(o1, o2)\n";
 
   // h_t
   os << "component-node name=h_t component=h input=Sum(c1_t, c2_t)\n";
 
   // g_t
   os << "component-node name=g1 component=Wc-xr input=Append("
      << spliced_input << ", IfDefined(Offset(r_t, -1)))\n";
   os << "component-node name=g_t component=g input=g1\n";
 
   // parts of c_t
   os << "component-node name=c1_t component=c1 "
      << " input=Append(f_t, " << c_tminus1 << ")\n";
   os << "component-node name=c2_t component=c2 input=Append(i_t, g_t)\n";
 
   // m_t
   os << "component-node name=m_t component=m input=Append(o_t, h_t)\n";
 
   // r_t and p_t
   os << "component-node name=rp_t component=W-m input=m_t\n";
   // Splitting outputs of Wy- node
   os << "dim-range-node name=r_t input-node=rp_t dim-offset=0 "
      << "dim=" << projection_dim << std::endl;
 
   // y_t
   os << "component-node name=y_t component=Wy- input=rp_t\n";
 
   // Final affine transform
   os << "component-node name=final_affine component=final_affine input=y_t\n";
   os << "component-node name=posteriors component=logsoftmax input=final_affine\n";
   os << "output-node name=output input=posteriors\n";
   configs->push_back(os.str());
 }

◆ GenerateConfigSequenceLstmType2()

void kaldi::nnet3::GenerateConfigSequenceLstmType2	(	const NnetGenerationOptions &	opts,
		std::vector< std::string > *	configs
	)

Definition at line 767 of file nnet-test-utils.cc.

References rnnlm::i, KALDI_ERR, NnetGenerationOptions::output_dim, kaldi::Rand(), and kaldi::RandInt().

                                    {
   KALDI_ERR << "Not Implemented";
   std::ostringstream os;
 
   std::vector<int32> splice_context;
   for (int32 i = -5; i < 4; i++)
     if (Rand() % 3 == 0)
       splice_context.push_back(i);
   if (splice_context.empty())
     splice_context.push_back(0);
 
   int32 input_dim = 10 + Rand() % 20,
       spliced_dim = input_dim * splice_context.size(),
       output_dim = (opts.output_dim > 0 ?
                     opts.output_dim :
                     100 + Rand() % 200),
       cell_dim = 40 + Rand() % 50,
       projection_dim = std::ceil(cell_dim / (Rand() % 10 + 2));
 
   int32 offset = RandInt(-3, 3);
   if (offset == 0)
     offset = -1;
 
   os << "input-node name=input dim=" << input_dim << std::endl;
   // Parameter Definitions W*(* replaced by - to have valid names)
   os << "component name=W-x type=NaturalGradientAffineComponent input-dim="
      << spliced_dim << " output-dim=" << 4 * cell_dim << std::endl;
   os << "component name=W-r type=NaturalGradientAffineComponent input-dim="
      << projection_dim << " output-dim=" << 4 * cell_dim << std::endl;
   os << "component name=W-m type=NaturalGradientAffineComponent input-dim="
      << cell_dim << " output-dim=" << 2 * projection_dim  << std::endl;
   os << "component name=Wyr type=NaturalGradientAffineComponent input-dim="
      << projection_dim << " output-dim=" << cell_dim << std::endl;
   os << "component name=Wyp type=NaturalGradientAffineComponent input-dim="
      << projection_dim << " output-dim=" << cell_dim << std::endl;
   // Defining the diagonal matrices
   os << "component name=Wic type=PerElementScaleComponent "
      << " dim=" << cell_dim << std::endl;
   os << "component name=Wfc type=PerElementScaleComponent "
      << " dim=" << cell_dim << std::endl;
   os << "component name=Woc type=PerElementScaleComponent "
      << " dim=" << cell_dim << std::endl;
   // Defining the final affine transform
   os << "component name=final_affine type=NaturalGradientAffineComponent "
      << "input-dim=" << cell_dim << " output-dim=" << output_dim << std::endl;
   os << "component name=logsoftmax type=LogSoftmaxComponent dim="
      << output_dim << std::endl;
 
   // Defining the non-linearities
   //  declare a no-op component so that we can use a sum descriptor's output
   //  multiple times, and to make the config more readable given the equations
   os << "component name=c_t type=NoOpComponent dim="
      << cell_dim << std::endl;
   os << "component name=i_t type=SigmoidComponent dim="
      << cell_dim << std::endl;
   os << "component name=f_t type=SigmoidComponent dim="
      << cell_dim << std::endl;
   os << "component name=o_t type=SigmoidComponent dim="
      << cell_dim << std::endl;
   os << "component name=g type=TanhComponent dim="
      << cell_dim << std::endl;
   os << "component name=h type=TanhComponent dim="
      << cell_dim << std::endl;
   os << "component name=f_t-c_tminus1 type=ElementwiseProductComponent "
      << " input-dim=" << 2 * cell_dim
      << " output-dim=" << cell_dim << std::endl;
   os << "component name=i_t-g type=ElementwiseProductComponent "
      << " input-dim=" << 2 * cell_dim
      << " output-dim=" << cell_dim << std::endl;
   os << "component name=m_t type=ElementwiseProductComponent "
      << " input-dim=" << 2 * cell_dim
      << " output-dim=" << cell_dim << std::endl;
 
 
   // Defining the computations
   os << "component-node name=W-x component=W-x input=Append(";
   for (size_t i = 0; i < splice_context.size(); i++) {
     int32 offset = splice_context[i];
     os << "Offset(input, " << offset << ")";
     if (i + 1 < splice_context.size())
       os << ", ";
   }
   os << ")\n";
 
   os << "component-node name=W-r component=W-r input=IfDefined(Offset(r_t"
      << offset << "))\n";
   os << "component-node name=W-m component=W-m input=m_t \n";
   os << "component-node name=Wic component=Wic input=IfDefined(Offset(c_t"
      << offset << "))\n";
   os << "component-node name=Wfc component=Wfc input=IfDefined(Offset(c_t"
      << offset << "))\n";
   os << "component-node name=Woc component=Woc input=c_t\n";
 
   // Splitting the outputs of W*m node
   os << "dim-range-node name=r_t input-node=W-m dim-offset=0 "
      << "dim=" << projection_dim << std::endl;
   os << "dim-range-node name=p_t input-node=W-m dim-offset=" << projection_dim
      << " dim=" << projection_dim << std::endl;
 
   // Splitting outputs of W*x node
   os << "dim-range-node name=W_ix-x_t input-node=W-x dim-offset=0 "
      << "dim=" << cell_dim << std::endl;
   os << "dim-range-node name=W_fx-x_t input-node=W-x "
      << "dim-offset=" << cell_dim << " dim="<<cell_dim << std::endl;
   os << "dim-range-node name=W_cx-x_t input-node=W-x "
      << "dim-offset=" << 2 * cell_dim << " dim="<<cell_dim << std::endl;
   os << "dim-range-node name=W_ox-x_t input-node=W-x "
      << "dim-offset=" << 3 * cell_dim << " dim="<<cell_dim << std::endl;
 
   // Splitting outputs of W*r node
   os << "dim-range-node name=W_ir-r_tminus1 input-node=W-r dim-offset=0 "
      << "dim=" << cell_dim << std::endl;
   os << "dim-range-node name=W_fr-r_tminus1 input-node=W-r "
      << "dim-offset=" << cell_dim << " dim="<<cell_dim << std::endl;
   os << "dim-range-node name=W_cr-r_tminus1 input-node=W-r "
      << "dim-offset=" << 2 * cell_dim << " dim="<<cell_dim << std::endl;
   os << "dim-range-node name=W_or-r_tminus1 input-node=W-r "
      << "dim-offset=" << 3 * cell_dim << " dim="<<cell_dim << std::endl;
 
   // Non-linear operations
   os << "component-node name=c_t component=c_t input=Sum(f_t-c_tminus1, i_t-g)\n";
   os << "component-node name=h component=h input=c_t\n";
   os << "component-node name=i_t component=i_t input=Sum(W_ix-x_t, Sum(W_ir-r_tminus1, Wic))\n";
   os << "component-node name=f_t component=f_t input=Sum(W_fx-x_t, Sum(W_fr-r_tminus1, Wfc))\n";
   os << "component-node name=o_t component=o_t input=Sum(W_ox-x_t, Sum(W_or-r_tminus1, Woc))\n";
   os << "component-node name=f_t-c_tminus1 component=f_t-c_tminus1 input=Append(f_t, Offset(c_t"
      << offset << "))\n";
   os << "component-node name=i_t-g component=i_t-g input=Append(i_t, g)\n";
   os << "component-node name=m_t component=m_t input=Append(o_t, h)\n";
 
   os << "component-node name=g component=g input=Sum(W_cx-x_t, W_cr-r_tminus1)\n";
 
   // Final affine transform
   os << "component-node name=Wyr component=Wyr input=r_t\n";
   os << "component-node name=Wyp component=Wyp input=p_t\n";
 
   os << "component-node name=final_affine component=final_affine input=Sum(Wyr, Wyp)\n";
 
   os << "component-node name=posteriors component=logsoftmax input=final_affine\n";
   os << "output-node name=output input=posteriors\n";
 
   configs->push_back(os.str());
 }

◆ GenerateConfigSequenceLstmWithTruncation()

void kaldi::nnet3::GenerateConfigSequenceLstmWithTruncation	(	const NnetGenerationOptions &	opts,
		std::vector< std::string > *	configs
	)

Definition at line 580 of file nnet-test-utils.cc.

References rnnlm::i, NnetGenerationOptions::output_dim, kaldi::Rand(), and kaldi::RandInt().

Referenced by GenerateConfigSequence().

                                    {
   std::ostringstream os;
 
   std::vector<int32> splice_context;
   for (int32 i = -5; i < 4; i++)
     if (Rand() % 3 == 0)
       splice_context.push_back(i);
   if (splice_context.empty())
     splice_context.push_back(0);
 
   int32 input_dim = 10 + Rand() % 20,
       spliced_dim = input_dim * splice_context.size(),
       output_dim = (opts.output_dim > 0 ?
                     opts.output_dim :
                     100 + Rand() % 200),
       cell_dim = 40 + Rand() % 50,
       projection_dim = std::ceil(cell_dim / (Rand() % 10 + 1));
   int32 clipping_threshold = RandInt(6, 50),
       zeroing_threshold = RandInt(1,  5),
       zeroing_interval = RandInt(1, 5) * 10;
   BaseFloat scale = 0.8 + 0.1*RandInt(0,3);
 
   os << "input-node name=input dim=" << input_dim << std::endl;
 
   // Parameter Definitions W*(* replaced by - to have valid names)
   // Input gate control : Wi* matrices
   os << "component name=Wi-xr type=NaturalGradientAffineComponent"
      << " input-dim=" << spliced_dim + projection_dim
      << " output-dim=" << cell_dim << std::endl;
   os << "component name=Wic type=PerElementScaleComponent "
      << " dim=" << cell_dim << std::endl;
 
   // Forget gate control : Wf* matrices
   os << "component name=Wf-xr type=NaturalGradientAffineComponent"
      << " input-dim=" << spliced_dim + projection_dim
      << " output-dim=" << cell_dim << std::endl;
   os << "component name=Wfc type=PerElementScaleComponent "
      << " dim=" << cell_dim << std::endl;
 
   // Output gate control : Wo* matrices
   os << "component name=Wo-xr type=NaturalGradientAffineComponent"
      << " input-dim=" << spliced_dim + projection_dim
      << " output-dim=" << cell_dim  << std::endl;
   os << "component name=Woc type=PerElementScaleComponent "
      << " dim=" << cell_dim << std::endl;
 
   // Cell input matrices : Wc* matrices
   os << "component name=Wc-xr type=NaturalGradientAffineComponent"
      << " input-dim=" << spliced_dim + projection_dim
      << " output-dim=" << cell_dim  << std::endl;
 
 
 
   // projection matrices : Wrm and Wpm
   os << "component name=W-m type=NaturalGradientAffineComponent "
      << " input-dim=" << cell_dim
      << " output-dim=" << 2 * projection_dim << std::endl;
 
   // Output : Wyr and Wyp
   os << "component name=Wy- type=NaturalGradientAffineComponent "
      << " input-dim=" << 2 * projection_dim
      << " output-dim=" << cell_dim << std::endl;
 
   // Defining the diagonal matrices
   // Defining the final affine transform
   os << "component name=final_affine type=NaturalGradientAffineComponent "
      << "input-dim=" << cell_dim << " output-dim=" << output_dim << std::endl;
   os << "component name=logsoftmax type=LogSoftmaxComponent dim="
      << output_dim << std::endl;
 
   // Defining the non-linearities
   //  declare a no-op component so that we can use a sum descriptor's output
   //  multiple times, and to make the config more readable given the equations
   os << "component name=i type=SigmoidComponent dim="
      << cell_dim << std::endl;
   os << "component name=f type=SigmoidComponent dim="
      << cell_dim << std::endl;
   os << "component name=o type=SigmoidComponent dim="
      << cell_dim << std::endl;
   os << "component name=g type=TanhComponent dim="
      << cell_dim << std::endl;
   os << "component name=h type=TanhComponent dim="
      << cell_dim << std::endl;
   os << "component name=c1 type=ElementwiseProductComponent "
      << " input-dim=" << 2 * cell_dim
      << " output-dim=" << cell_dim << std::endl;
   os << "component name=c2 type=ElementwiseProductComponent "
      << " input-dim=" << 2 * cell_dim
      << " output-dim=" << cell_dim << std::endl;
   os << "component name=m type=ElementwiseProductComponent "
      << " input-dim=" << 2 * cell_dim
      << " output-dim=" << cell_dim << std::endl;
   os << "component name=c type=BackpropTruncationComponent dim="
      << cell_dim
      << " scale=" << scale
      << " clipping-threshold=" << clipping_threshold
      << " zeroing-threshold=" << zeroing_threshold
      << " zeroing-interval=" << zeroing_interval
      << " recurrence-interval=1" << std::endl;
   os << "component name=r type=BackpropTruncationComponent dim="
      << projection_dim
      << " scale=" << scale
      << " clipping-threshold=" << clipping_threshold
      << " zeroing-threshold=" << zeroing_threshold
      << " zeroing-interval=" << zeroing_interval
      << " recurrence-interval=1" << std::endl;
 
   // Defining the computations
   std::ostringstream temp_string_stream;
   for (size_t i = 0; i < splice_context.size(); i++) {
     int32 offset = splice_context[i];
     temp_string_stream << "Offset(input, " << offset << ")";
     if (i + 1 < splice_context.size())
       temp_string_stream << ", ";
   }
   std::string spliced_input = temp_string_stream.str();
 
   int32 offset = RandInt(-3, 3);
   if (offset == 0)
     offset = -1;
 
 
   std::string c_tminus1;
   {
     std::ostringstream os_temp;
     os_temp << "IfDefined(Offset(c_t, " << offset << "))";
     c_tminus1 = os_temp.str();
   }
   os << "component-node name=c_t component=c input=Sum(c1_t, c2_t)\n";
 
   // i_t
   os << "component-node name=i1 component=Wi-xr input=Append("
      << spliced_input << ", IfDefined(Offset(r_t, " << offset << ")))\n";
   os << "component-node name=i2 component=Wic "
      << " input=" << c_tminus1 << std::endl;
   os << "component-node name=i_t component=i input=Sum(i1, i2)\n";
 
   // f_t
   os << "component-node name=f1 component=Wf-xr input=Append("
      << spliced_input << ", IfDefined(Offset(r_t, " << offset << ")))\n";
   os << "component-node name=f2 component=Wfc "
      << " input=" << c_tminus1 << std::endl;
   os << "component-node name=f_t component=f input=Sum(f1, f2)\n";
 
   // o_t
   os << "component-node name=o1 component=Wo-xr input=Append("
      << spliced_input << ", IfDefined(Offset(r_t, " << offset << ")))\n";
   os << "component-node name=o2 component=Woc input=Sum(c1_t, c2_t)\n";
   os << "component-node name=o_t component=o input=Sum(o1, o2)\n";
 
   // h_t
   os << "component-node name=h_t component=h input=Sum(c1_t, c2_t)\n";
 
   // g_t
   os << "component-node name=g1 component=Wc-xr input=Append("
      << spliced_input << ", IfDefined(Offset(r_t, " << offset << ")))\n";
   os << "component-node name=g_t component=g input=g1\n";
 
   // parts of c_t
   os << "component-node name=c1_t component=c1 "
      << " input=Append(f_t, " << c_tminus1 << ")\n";
   os << "component-node name=c2_t component=c2 input=Append(i_t, g_t)\n";
 
   // m_t
   os << "component-node name=m_t component=m input=Append(o_t, h_t)\n";
 
   // r_t and p_t
   os << "component-node name=rp_t component=W-m input=m_t\n";
   // Splitting outputs of Wy- node
   os << "dim-range-node name=r_t_pretrunc input-node=rp_t dim-offset=0 "
      << "dim=" << projection_dim << std::endl;
   os << "component-node name=r_t component=r input=r_t_pretrunc\n";
 
   // y_t
   os << "component-node name=y_t component=Wy- input=rp_t\n";
 
   // Final affine transform
   os << "component-node name=final_affine component=final_affine input=y_t\n";
   os << "component-node name=posteriors component=logsoftmax input=final_affine\n";
   os << "output-node name=output input=posteriors\n";
   configs->push_back(os.str());
 }

◆ GenerateConfigSequenceRestrictedAttention()

void kaldi::nnet3::GenerateConfigSequenceRestrictedAttention	(	const NnetGenerationOptions &	opts,
		std::vector< std::string > *	configs
	)

Definition at line 1130 of file nnet-test-utils.cc.

References kaldi::RandInt().

Referenced by GenerateConfigSequence().

                                    {
   std::ostringstream ss;
 
 
   int32 input_dim = RandInt(100, 150),
       num_heads = RandInt(1, 2),
       key_dim = RandInt(20, 40),
       value_dim = RandInt(20, 40),
       time_stride = RandInt(1, 3),
       num_left_inputs = RandInt(1, 4),
       num_right_inputs = RandInt(0, 2),
       num_left_inputs_required = RandInt(0, num_left_inputs),
       num_right_inputs_required = RandInt(0, num_right_inputs);
   bool output_context = (RandInt(0, 1) == 0);
   int32 context_dim = (num_left_inputs + 1 + num_right_inputs),
       query_dim = key_dim + context_dim;
   int32 attention_input_dim = num_heads * (key_dim + value_dim + query_dim);
 
   std::string cur_layer_descriptor = "input";
 
   { // input layer.
     ss << "input-node name=input dim=" << input_dim
        << std::endl;
   }
 
   { // affine component
     ss << "component name=affine type=NaturalGradientAffineComponent input-dim="
        << input_dim << " output-dim=" << attention_input_dim << std::endl;
     ss << "component-node name=affine component=affine input=input"
        << std::endl;
   }
 
   { // attention component
     ss << "component-node name=attention component=attention input=affine"
        << std::endl;
     ss << "component name=attention type=RestrictedAttentionComponent"
        << " num-heads=" << num_heads << " key-dim=" << key_dim
        << " value-dim=" << value_dim << " time-stride=" << time_stride
        << " num-left-inputs=" << num_left_inputs << " num-right-inputs="
        << num_right_inputs << " num-left-inputs-required="
        << num_left_inputs_required << " num-right-inputs-required="
        << num_right_inputs_required
        << " output-context=" << (output_context ? "true" : "false")
        << (RandInt(0, 1) == 0 ? " key-scale=1.0" : "")
        << std::endl;
   }
 
   { // output
     ss << "output-node name=output input=attention" << std::endl;
   }
   configs->push_back(ss.str());
 }

◆ GenerateConfigSequenceRnn()

void kaldi::nnet3::GenerateConfigSequenceRnn	(	const NnetGenerationOptions &	opts,
		std::vector< std::string > *	configs
	)

Definition at line 263 of file nnet-test-utils.cc.

References rnnlm::i, NnetGenerationOptions::output_dim, kaldi::Rand(), and kaldi::RandInt().

Referenced by GenerateConfigSequence().

                                    {
   std::ostringstream os;
 
   std::vector<int32> splice_context;
   for (int32 i = -5; i < 4; i++)
     if (Rand() % 3 == 0)
       splice_context.push_back(i);
   if (splice_context.empty())
     splice_context.push_back(0);
 
   int32 input_dim = 10 + Rand() % 20,
       spliced_dim = input_dim * splice_context.size(),
       output_dim = (opts.output_dim > 0 ?
                     opts.output_dim :
                     100 + Rand() % 200),
       hidden_dim = 40 + Rand() % 50;
   os << "component name=affine1 type=NaturalGradientAffineComponent input-dim="
      << spliced_dim << " output-dim=" << hidden_dim << std::endl;
   if (RandInt(0, 1) == 0) {
     os << "component name=nonlin1 type=RectifiedLinearComponent dim="
        << hidden_dim << std::endl;
   } else {
     os << "component name=nonlin1 type=TanhComponent dim="
        << hidden_dim << std::endl;
   }
   os << "component name=recurrent_affine1 type=NaturalGradientAffineComponent input-dim="
      << hidden_dim << " output-dim=" << hidden_dim << std::endl;
   os << "component name=affine2 type=NaturalGradientAffineComponent input-dim="
      << hidden_dim << " output-dim=" << output_dim << std::endl;
   os << "component name=logsoftmax type=LogSoftmaxComponent dim="
      << output_dim << std::endl;
   os << "input-node name=input dim=" << input_dim << std::endl;
 
   os << "component-node name=affine1_node component=affine1 input=Append(";
   for (size_t i = 0; i < splice_context.size(); i++) {
     int32 offset = splice_context[i];
     os << "Offset(input, " << offset << ")";
     if (i + 1 < splice_context.size())
       os << ", ";
   }
   os << ")\n";
   os << "component-node name=recurrent_affine1 component=recurrent_affine1 "
         "input=Offset(nonlin1, -1)\n";
   os << "component-node name=nonlin1 component=nonlin1 "
         "input=Sum(affine1_node, IfDefined(recurrent_affine1))\n";
   os << "component-node name=affine2 component=affine2 input=nonlin1\n";
   os << "component-node name=output_nonlin component=logsoftmax input=affine2\n";
   os << "output-node name=output input=output_nonlin\n";
   configs->push_back(os.str());
 }

◆ GenerateConfigSequenceRnnClockwork()

void kaldi::nnet3::GenerateConfigSequenceRnnClockwork	(	const NnetGenerationOptions &	opts,
		std::vector< std::string > *	configs
	)

Definition at line 322 of file nnet-test-utils.cc.

References rnnlm::i, NnetGenerationOptions::output_dim, and kaldi::Rand().

Referenced by GenerateConfigSequence().

                                    {
   std::ostringstream os;
 
   std::vector<int32> splice_context;
   for (int32 i = -5; i < 4; i++)
     if (Rand() % 3 == 0)
       splice_context.push_back(i);
   if (splice_context.empty())
     splice_context.push_back(0);
 
   int32 input_dim = 10 + Rand() % 20,
       spliced_dim = input_dim * splice_context.size(),
       output_dim = (opts.output_dim > 0 ?
                     opts.output_dim :
                     100 + Rand() % 200),
       hidden_dim = 40 + Rand() % 50;
   os << "component name=affine1 type=NaturalGradientAffineComponent input-dim="
      << spliced_dim << " output-dim=" << hidden_dim << std::endl;
   os << "component name=nonlin1 type=RectifiedLinearComponent dim="
      << hidden_dim << std::endl;
   os << "component name=recurrent_affine1 type=NaturalGradientAffineComponent input-dim="
      << hidden_dim << " output-dim=" << hidden_dim << std::endl;
   // the suffix _0, _1, _2 equals the index of the output-frame modulo 3; there
   // are 3 versions of the final affine layer.  There was a paper by Vincent
   // Vanhoucke about something like this.
   os << "component name=final_affine_0 type=NaturalGradientAffineComponent input-dim="
      << hidden_dim << " output-dim=" << output_dim << std::endl;
   os << "component name=final_affine_1 type=NaturalGradientAffineComponent input-dim="
      << hidden_dim << " output-dim=" << output_dim << std::endl;
   os << "component name=final_affine_2 type=NaturalGradientAffineComponent input-dim="
      << hidden_dim << " output-dim=" << output_dim << std::endl;
   os << "component name=logsoftmax type=LogSoftmaxComponent dim="
      << output_dim << std::endl;
   os << "input-node name=input dim=" << input_dim << std::endl;
 
   os << "component-node name=affine1_node component=affine1 input=Append(";
   for (size_t i = 0; i < splice_context.size(); i++) {
     int32 offset = splice_context[i];
     os << "Offset(input, " << offset << ")";
     if (i + 1 < splice_context.size())
       os << ", ";
   }
   os << ")\n";
   os << "component-node name=recurrent_affine1 component=recurrent_affine1 "
         "input=Offset(nonlin1, -1)\n";
   os << "component-node name=nonlin1 component=nonlin1 "
         "input=Sum(affine1_node, IfDefined(recurrent_affine1))\n";
   os << "component-node name=final_affine_0 component=final_affine_0 input=nonlin1\n";
   os << "component-node name=final_affine_1 component=final_affine_1 input=Offset(nonlin1, -1)\n";
   os << "component-node name=final_affine_2 component=final_affine_2 input=Offset(nonlin1, 1)\n";
   os << "component-node name=output_nonlin component=logsoftmax input=Switch(final_affine_0, final_affine_1, final_affine_2)\n";
   os << "output-node name=output input=output_nonlin\n";
   configs->push_back(os.str());
 }

◆ GenerateConfigSequenceSimple()

void kaldi::nnet3::GenerateConfigSequenceSimple	(	const NnetGenerationOptions &	opts,
		std::vector< std::string > *	configs
	)

Definition at line 111 of file nnet-test-utils.cc.

References NnetGenerationOptions::allow_final_nonlinearity, NnetGenerationOptions::allow_ivector, rnnlm::i, NnetGenerationOptions::output_dim, kaldi::Rand(), and kaldi::RandInt().

Referenced by GenerateConfigSequence().

                                    {
   std::ostringstream os;
 
   std::vector<int32> splice_context;
   for (int32 i = -5; i < 4; i++)
     if (Rand() % 3 == 0)
       splice_context.push_back(i);
   if (splice_context.empty())
     splice_context.push_back(0);
 
   int32 input_dim = 10 + Rand() % 20,
       output_dim = (opts.output_dim > 0 ?
                     opts.output_dim :
                     100 + Rand() % 200),
       hidden_dim = 40 + Rand() % 50;
   int32 ivector_dim = 10 + Rand() % 20;
   if (RandInt(0, 1) == 0 || !opts.allow_ivector)
     ivector_dim = 0;
   int32 spliced_dim = input_dim * splice_context.size() + ivector_dim;
 
   bool use_final_nonlinearity = (opts.allow_final_nonlinearity &&
                                  RandInt(0, 1) == 0);
   bool use_batch_norm = (RandInt(0, 1) == 0);
 
   os << "component name=affine1 type=NaturalGradientAffineComponent input-dim="
      << spliced_dim << " output-dim=" << hidden_dim << std::endl;
   os << "component name=relu1 type=RectifiedLinearComponent dim="
      << hidden_dim << std::endl;
   if (use_batch_norm) {
     int32 block_dim = (hidden_dim % 2 == 0 ? hidden_dim / 2 : hidden_dim);
     os << "component name=batch-norm type=BatchNormComponent dim="
        << hidden_dim << " block-dim=" << block_dim
        << " target-rms=2.0";
     if (RandInt(0, 1) == 0)
       os << " epsilon=3.0";
     os << '\n';
   }
   os << "component name=final_affine type=NaturalGradientAffineComponent input-dim="
      << hidden_dim << " output-dim=" << output_dim << std::endl;
   if (use_final_nonlinearity) {
     if (Rand() % 2 == 0) {
       os << "component name=logsoftmax type=SoftmaxComponent dim="
          << output_dim << std::endl;
     } else {
       os << "component name=logsoftmax type=LogSoftmaxComponent dim="
          << output_dim << std::endl;
     }
   }
   os << "input-node name=input dim=" << input_dim << std::endl;
   if (ivector_dim != 0)
     os << "input-node name=ivector dim=" << ivector_dim << std::endl;
 
   os << "component-node name=affine1_node component=affine1 input=Append(";
   if (ivector_dim != 0)
     os << "ReplaceIndex(ivector, t, 0), ";
   for (size_t i = 0; i < splice_context.size(); i++) {
     int32 offset = splice_context[i];
     if (RandInt(0, 1) == 0) {
       os << "Offset(input, " << offset << ")";
     } else {
       // testing the Scale() expression.
       os << "Scale(-1, Offset(input, " << offset << "))";
     }
     if (i + 1 < splice_context.size())
       os << ", ";
   }
   os << ")\n";
   if (RandInt(0, 1) == 0) {
     os << "component-node name=nonlin1 component=relu1 input=affine1_node\n";
   } else if (RandInt(0, 1) == 0) {
     os << "component-node name=nonlin1 component=relu1 input=Scale(-1.0, affine1_node)\n";
   } else {
     os << "component-node name=nonlin1 component=relu1 input=Sum(Const(1.0, "
        << hidden_dim << "), Scale(-1.0, affine1_node))\n";
   }
   if (use_batch_norm) {
     os << "component-node name=batch-norm component=batch-norm input=nonlin1\n";
     os << "component-node name=final_affine component=final_affine input=batch-norm\n";
   } else {
     os << "component-node name=final_affine component=final_affine input=nonlin1\n";
   }
   if (use_final_nonlinearity) {
     os << "component-node name=output_nonlin component=logsoftmax input=final_affine\n";
     os << "output-node name=output input=output_nonlin\n";
   } else {
     os << "output-node name=output input=final_affine\n";
   }
   configs->push_back(os.str());
 
   if ((Rand() % 2) == 0) {
     std::ostringstream os2;
     os2 << "component name=affine2 type=NaturalGradientAffineComponent input-dim="
         << hidden_dim << " output-dim=" << hidden_dim << std::endl;
     os2 << "component name=relu2 type=RectifiedLinearComponent dim="
         << hidden_dim << std::endl;
     // regenerate the final_affine component when we add the new config.
     os2 << "component name=final_affine type=NaturalGradientAffineComponent input-dim="
         << hidden_dim << " output-dim=" << output_dim << std::endl;
     os2 << "component-node name=affine2 component=affine2 input=nonlin1\n";
     os2 << "component-node name=relu2 component=relu2 input=affine2\n";
     os2 << "component-node name=final_affine component=final_affine input=relu2\n";
     configs->push_back(os2.str());
   }
 }

◆ GenerateConfigSequenceSimpleContext()

void kaldi::nnet3::GenerateConfigSequenceSimpleContext	(	const NnetGenerationOptions &	opts,
		std::vector< std::string > *	configs
	)

Definition at line 54 of file nnet-test-utils.cc.

References rnnlm::i, NnetGenerationOptions::output_dim, kaldi::Rand(), and kaldi::RandInt().

Referenced by GenerateConfigSequence().

                                    {
   std::ostringstream os;
 
   std::vector<int32> splice_context;
   for (int32 i = -5; i < 4; i++)
     if (Rand() % 3 == 0)
       splice_context.push_back(i);
   if (splice_context.empty())
     splice_context.push_back(0);
 
   int32 input_dim = 10 + Rand() % 20,
       spliced_dim = input_dim * splice_context.size(),
       output_dim = (opts.output_dim > 0 ?
                     opts.output_dim :
                     100 + Rand() % 200);
 
   if (RandInt(0,1) == 0) {
     // do it the traditional way with an AffineComponent and an Append() expression.
     os << "component name=affine1 type=AffineComponent input-dim="
        << spliced_dim << " output-dim=" << output_dim << std::endl;
 
     os << "input-node name=input dim=" << input_dim << std::endl;
 
     os << "component-node name=affine1_node component=affine1 input=Append(";
     for (size_t i = 0; i < splice_context.size(); i++) {
       int32 offset = splice_context[i];
       os << "Offset(input, " << offset << ")";
       if (i + 1 < splice_context.size())
         os << ", ";
     }
     os << ")\n";
     os << "output-node name=output input=affine1_node\n";
   } else {
     os << "component name=tdnn1 type=TdnnComponent input-dim="
        << input_dim << " output-dim=" << output_dim
        << " time-offsets=";
     for (size_t i = 0; i < splice_context.size(); i++) {
       if (i>0) os << ',';
       os << splice_context[i];
     }
     os << " use-bias=" << (RandInt(0,1) == 0 ? "true":"false")
        << " use-natural-gradient="  << (RandInt(0,1) == 0 ? "true":"false")
        << std::endl;
     os << "input-node name=input dim=" << input_dim << std::endl;
     os << "component-node name=tdnn1_node component=tdnn1 input=input\n";
     os << "output-node name=output input=tdnn1_node\n";
   }
   configs->push_back(os.str());
 }

◆ GenerateConfigSequenceSimplest()

void kaldi::nnet3::GenerateConfigSequenceSimplest	(	const NnetGenerationOptions &	opts,
		std::vector< std::string > *	configs
	)

Definition at line 33 of file nnet-test-utils.cc.

References NnetGenerationOptions::output_dim, and kaldi::Rand().

Referenced by GenerateConfigSequence().

                                    {
   std::ostringstream os;
 
   int32 input_dim = 10 + Rand() % 20,
       output_dim = (opts.output_dim > 0 ?
                     opts.output_dim :
                     100 + Rand() % 200);
 
 
   os << "component name=affine1 type=AffineComponent input-dim="
      << input_dim << " output-dim=" << output_dim << std::endl;
 
   os << "input-node name=input dim=" << input_dim << std::endl;
   os << "component-node name=affine1_node component=affine1 input=input\n";
   os << "output-node name=output input=affine1_node\n";
   configs->push_back(os.str());
 }

◆ GenerateConfigSequenceStatistics()

void kaldi::nnet3::GenerateConfigSequenceStatistics	(	const NnetGenerationOptions &	opts,
		std::vector< std::string > *	configs
	)

Definition at line 219 of file nnet-test-utils.cc.

References kaldi::RandInt().

Referenced by GenerateConfigSequence().

                                    {
   int32 input_dim = RandInt(10, 30),
       input_period = RandInt(1, 3),
       stats_period = input_period * RandInt(1, 3),
       left_context = stats_period * RandInt(1, 10),
       right_context = stats_period * RandInt(1, 10),
       log_count_features = RandInt(0, 3);
   BaseFloat variance_floor = RandInt(1, 10) * 1.0e-10;
   bool output_stddevs = (RandInt(0, 1) == 0);
 
   int32 raw_stats_dim = 1 + input_dim + (output_stddevs ? input_dim : 0),
       pooled_stats_dim = log_count_features + input_dim +
         (output_stddevs ? input_dim : 0);
   std::ostringstream os;
   os << "input-node name=input dim=" << input_dim << std::endl;
   os << "component name=statistics-extraction type=StatisticsExtractionComponent "
      << "input-dim=" << input_dim << " input-period=" << input_period
      << " output-period=" << stats_period << " include-variance="
      << std::boolalpha << output_stddevs << "\n";
 
   os << "component name=statistics-pooling type=StatisticsPoolingComponent "
      << "input-dim=" << raw_stats_dim << " input-period=" << stats_period
      << " left-context=" << left_context << " right-context=" << right_context
      << " num-log-count-features=" << log_count_features << " output-stddevs="
      << std::boolalpha << output_stddevs << " variance-floor="
      << variance_floor << "\n";
 
   os << "component name=affine type=AffineComponent "
      << "input-dim=" << input_dim << " output-dim=" << pooled_stats_dim
      << "\n";
 
   os << "component-node name=statistics-extraction component=statistics-extraction "
      << "input=input\n";
   os << "component-node name=statistics-pooling component=statistics-pooling "
      << "input=statistics-extraction\n";
   os << "component-node name=affine component=affine input=input\n";
   os << "output-node name=output input=Sum(affine, Round(statistics-pooling, "
      << stats_period << "))\n";
   configs->push_back(os.str());
 }

◆ GenerateRandomComponentConfig()

static void kaldi::nnet3::GenerateRandomComponentConfig	(	std::string *	component_type,
		std::string *	config
	)

static

Definition at line 1400 of file nnet-test-utils.cc.

References rnnlm::i, KALDI_ERR, rnnlm::n, kaldi::Rand(), kaldi::RandInt(), and kaldi::RandUniform().

Referenced by GenerateRandomSimpleComponent().

                                                              {
 
   int32 n = RandInt(0, 37);
   BaseFloat learning_rate = 0.001 * RandInt(1, 100);
 
   std::ostringstream os;
   switch(n) {
     case 0: {
       *component_type = "PnormComponent";
       int32 output_dim = RandInt(1, 50), group_size = RandInt(1, 15),
           input_dim = output_dim * group_size;
       os << "input-dim=" << input_dim << " output-dim=" << output_dim;
       break;
     }
     case 1: {
       BaseFloat target_rms = (RandInt(1, 200) / 100.0);
       std::string add_log_stddev = (Rand() % 2 == 0 ? "True" : "False");
       *component_type = "NormalizeComponent";
 
       int32 block_dim = RandInt(2, 50), num_blocks = RandInt(1, 3),
           dim = block_dim * num_blocks;
       // avoid dim=1 because the derivatives would be zero, which
       // makes them hard to test.
       os << "dim=" << dim << " block-dim=" << block_dim
          << " target-rms=" << target_rms
          << " add-log-stddev=" << add_log_stddev;
       break;
     }
     case 2: {
       *component_type = "SigmoidComponent";
       os << "dim=" << RandInt(1, 50);
       break;
     }
     case 3: {
       *component_type = "TanhComponent";
       os << "dim=" << RandInt(1, 50);
       break;
     }
     case 4: {
       *component_type = "RectifiedLinearComponent";
       os << "dim=" << RandInt(1, 50);
       break;
     }
     case 5: {
       *component_type = "SoftmaxComponent";
       os << "dim=" << RandInt(1, 50);
       break;
     }
     case 6: {
       *component_type = "LogSoftmaxComponent";
       os << "dim=" << RandInt(1, 50);
       break;
     }
     case 7: {
       *component_type = "NoOpComponent";
       os << "dim=" << RandInt(1, 50);
       break;
     }
     case 8: {
       *component_type = "FixedAffineComponent";
       int32 input_dim = RandInt(1, 50), output_dim = RandInt(1, 50);
       os << "input-dim=" << input_dim << " output-dim=" << output_dim;
       break;
     }
     case 9: {
       *component_type = "AffineComponent";
       int32 input_dim = RandInt(1, 50), output_dim = RandInt(1, 50);
       os << "input-dim=" << input_dim << " output-dim=" << output_dim
          << " learning-rate=" << learning_rate;
       break;
     }
     case 10: {
       *component_type = "NaturalGradientAffineComponent";
       int32 input_dim = RandInt(1, 50), output_dim = RandInt(1, 50);
       os << "input-dim=" << input_dim << " output-dim=" << output_dim
          << " learning-rate=" << learning_rate;
       break;
     }
     case 11: {
       *component_type = "SumGroupComponent";
       std::vector<int32> sizes;
       int32 num_groups = RandInt(1, 50);
       os << "sizes=";
       for (int32 i = 0; i < num_groups; i++) {
         os << RandInt(1, 5);
         if (i + 1 < num_groups)
           os << ',';
       }
       break;
     }
     case 12: {
       *component_type = "FixedScaleComponent";
       os << "dim=" << RandInt(1, 100);
       break;
     }
     case 13: {
       *component_type = "FixedBiasComponent";
       os << "dim=" << RandInt(1, 100);
       break;
     }
     case 14: {
       *component_type = "NaturalGradientPerElementScaleComponent";
       os << "dim=" << RandInt(1, 100)
          << " learning-rate=" << learning_rate;
       break;
     }
     case 15: {
       *component_type = "PerElementScaleComponent";
       os << "dim=" << RandInt(1, 100)
          << " learning-rate=" << learning_rate;
       break;
     }
     case 16: {
       *component_type = "ElementwiseProductComponent";
       int32 output_dim = RandInt(1, 100), multiple = RandInt(2, 4),
           input_dim = output_dim * multiple;
       os << "input-dim=" << input_dim << " output-dim=" << output_dim;
       break;
     }
     case 17: {
       int32 input_vectorization = Rand() % 2;
       std::string vectorization;
       if (input_vectorization == 0) {
         vectorization = "yzx";
       } else  {
         vectorization = "zyx";
       }
       *component_type = "ConvolutionComponent";
       int32 input_x_dim = 10 + Rand() % 20,
             input_y_dim = 10 + Rand() % 20,
             input_z_dim = 3 + Rand() % 10,
             filt_x_dim = 1 + Rand() % input_x_dim,
             filt_y_dim = 1 + Rand() % input_y_dim,
             num_filters = 1 + Rand() % 10,
             filt_x_step = (1 + Rand() % filt_x_dim),
             filt_y_step = (1 + Rand() % filt_y_dim);
       int32 remainder = (input_x_dim - filt_x_dim) % filt_x_step;
       // adjusting input_x_dim to ensure divisibility
       input_x_dim = input_x_dim - remainder;
       remainder = (input_y_dim - filt_y_dim) % filt_y_step;
       // adjusting input_x_dim to ensure divisibility
       input_y_dim = input_y_dim - remainder;
 
       os << "input-x-dim=" << input_x_dim
          << " input-y-dim=" << input_y_dim
          << " input-z-dim=" << input_z_dim
          << " filt-x-dim=" << filt_x_dim
          << " filt-y-dim=" << filt_y_dim
          << " filt-x-step=" << filt_x_step
          << " filt-y-step=" << filt_y_step
          << " num-filters=" << num_filters
          << " input-vectorization-order=" << vectorization
          << " learning-rate=" << learning_rate;
       break;
       // TODO : add test for file based initialization. But confirm how to write
       // a file which is not going to be overwritten by other components
     }
     case 18: {
       *component_type = "PermuteComponent";
       int32 input_dim = 10 + Rand() % 100;
       std::vector<int32> column_map(input_dim);
       for (int32 i = 0; i < input_dim; i++)
         column_map[i] = i;
       std::random_shuffle(column_map.begin(), column_map.end());
       std::ostringstream buffer;
       for (int32 i = 0; i < input_dim-1; i++)
         buffer << column_map[i] << ",";
       buffer << column_map.back();
       os << "column-map=" << buffer.str();
       break;
     }
     case 19: {
       *component_type = "PerElementOffsetComponent";
       std::string param_config = RandInt(0, 1)?
                                  " param-mean=0.0 param-stddev=0.0":
                                  " param-mean=1.0 param-stddev=1.0";
       int32 block_dim = RandInt(10, 20), dim = block_dim * RandInt(1, 2);
       os << "dim=" << dim << " block-dim=" << block_dim
          << " use-natural-gradient=" << (RandInt(0, 1) == 0 ? "true" : "false")
          << " learning-rate=" << learning_rate << param_config;
       break;
     }
     case 20: case 21: {
       *component_type = "CompositeComponent";
       int32 cur_dim = RandInt(20, 30), num_components = RandInt(1, 3),
           max_rows_process = RandInt(1, 30);
       os << "num-components=" << num_components
          << " max-rows-process=" << max_rows_process;
       std::vector<std::string> sub_configs;
       for (int32 i = 1; i <= num_components; i++) {
         if (RandInt(1, 3) == 1) {
           os << " component" << i << "='type=RectifiedLinearComponent dim="
              << cur_dim << "'";
         } else if (RandInt(1, 2) == 1) {
           os << " component" << i << "='type=TanhComponent dim="
              << cur_dim << "'";
         } else {
           int32 next_dim = RandInt(20, 30);
           os << " component" << i << "='type=AffineComponent input-dim="
              << cur_dim << " output-dim=" << next_dim << "'";
           cur_dim = next_dim;
         }
       }
       break;
     }
     case 22: {
       *component_type = "SumGroupComponent";
       int32 num_groups = RandInt(1, 50),
         input_dim = num_groups * RandInt(1, 15);
       os << "input-dim=" << input_dim << " output-dim=" << num_groups;
       break;
     }
     case 23: {
       *component_type = "RepeatedAffineComponent";
       int32 num_repeats = RandInt(1, 50),
           input_dim = num_repeats * RandInt(1, 15),
           output_dim = num_repeats * RandInt(1, 15);
       os << "input-dim=" << input_dim << " output-dim=" << output_dim
          << " num-repeats=" << num_repeats;
       break;
     }
     case 24: {
       *component_type = "BlockAffineComponent";
       int32 num_blocks = RandInt(1, 50),
           input_dim = num_blocks * RandInt(1, 15),
           output_dim = num_blocks * RandInt(1, 15);
       os << "input-dim=" << input_dim << " output-dim=" << output_dim
          << " num-blocks=" << num_blocks;
       break;
     }
     case 25: {
       *component_type = "NaturalGradientRepeatedAffineComponent";
       int32 num_repeats = RandInt(1, 50),
           input_dim = num_repeats * RandInt(1, 15),
           output_dim = num_repeats * RandInt(1, 15);
       os << "input-dim=" << input_dim << " output-dim=" << output_dim
          << " num-repeats=" << num_repeats;
       break;
     }
     case 26: {
       *component_type = "MaxpoolingComponent";
       int32 input_x_dim = 5 + Rand() % 10,
             input_y_dim = 5 + Rand() % 10,
             input_z_dim = 5 + Rand() % 10;
       int32 pool_x_size = 1 + Rand() % input_x_dim,
             pool_y_size = 1 + Rand() % input_y_dim,
             pool_z_size = 1 + Rand() % input_z_dim;
       int32 pool_x_step = (1 + Rand() % pool_x_size),
             pool_y_step = (1 + Rand() % pool_y_size),
             pool_z_step = (1 + Rand() % pool_z_size);
       // adjusting input dim to ensure divisibility
       int32 remainder = (input_x_dim - pool_x_size) % pool_x_step;
       input_x_dim = input_x_dim - remainder;
       remainder = (input_y_dim - pool_y_size) % pool_y_step;
       input_y_dim = input_y_dim - remainder;
       remainder = (input_z_dim - pool_z_size) % pool_z_step;
       input_z_dim = input_z_dim - remainder;
       os << " input-x-dim=" << input_x_dim
          << " input-y-dim=" << input_y_dim
          << " input-z-dim=" << input_z_dim
          << " pool-x-size=" << pool_x_size
          << " pool-y-size=" << pool_y_size
          << " pool-z-size=" << pool_z_size
          << " pool-x-step=" << pool_x_step
          << " pool-y-step=" << pool_y_step
          << " pool-z-step=" << pool_z_step;
       break;
     }
     case 27: {
       *component_type = "ConstantFunctionComponent";
       int32 input_dim = RandInt(1, 50), output_dim = RandInt(1, 50);
       bool is_updatable = (RandInt(0, 1) == 0),
           use_natural_gradient =  (RandInt(0, 1) == 0);
       os << "input-dim=" << input_dim << " output-dim=" << output_dim
          << " learning-rate=" << learning_rate
          << " is-updatable=" << std::boolalpha << is_updatable
          << " use-natural-gradient=" << std::boolalpha << use_natural_gradient;
       break;
     }
     case 28: {
       *component_type = "ClipGradientComponent";
       os << "dim=" << RandInt(1, 50);
       os << " clipping-threshold=" << RandInt(1, 50)
          << " norm-based-clipping=" << (RandInt(0, 1) == 0 ? "false" : "true");
       if (RandInt(0, 1) == 1)
         os << " self-repair-scale="
            << (RandInt(0, 1) == 0 ? 0 : RandInt(1, 50));
       if (RandInt(0, 1) == 1)
         os << " self-repair-clipped-proportion-threshold=" << RandUniform();
       if (RandInt(0, 1) == 1)
         os << " self-repair-target=" << RandUniform();
       break;
     }
     case 29: {
       *component_type = "DropoutComponent";
       bool test_mode = (RandInt(0, 1) == 0);
       os << "dim=" << RandInt(1, 200)
          << " dropout-proportion=" << RandUniform() << " test-mode="
          << (test_mode ? "true" : "false");
       break;
     }
     case 30: {
       *component_type = "LstmNonlinearityComponent";
       // set self-repair scale to zero so the derivative tests will pass.
       os << "cell-dim=" << RandInt(1, 200)
          << " self-repair-scale=0.0";
       break;
     }
     // I think we'll get in the habit of allocating a larger number of case
     // labels to the most recently added component, so it gets tested more
     case 31: {
       *component_type = "BatchNormComponent";
       int32 block_dim = RandInt(1, 20), dim = block_dim * RandInt(1, 2);
       bool test_mode = (RandInt(0, 1) == 0);
       os << " dim=" << dim
          << " block-dim=" << block_dim << " target-rms="
          << RandInt(1, 4) << " test-mode="
          << (test_mode ? "true" : "false")
          << " epsilon=" << (RandInt(0, 1) == 0 ? "0.1" : "1.0");
       break;
     }
     case 32: {
       *component_type = "SumBlockComponent";
       BaseFloat scale = 0.5 * RandInt(1, 3);
       BaseFloat output_dim = RandInt(1, 10),
           input_dim = output_dim * RandInt(1, 3);
       os << "input-dim=" << input_dim
          << " output-dim=" << output_dim
          << " scale=" << scale;
       break;
     }
     case 33: {
       *component_type = "ScaleAndOffsetComponent";
       int32 block_dim = RandInt(10, 20),
           num_blocks = RandInt(1, 3),
           dim = block_dim * num_blocks;
       os << "dim=" << dim << " block-dim=" << block_dim
          << " use-natural-gradient="
          << (RandInt(0,1) == 0 ? "true" : "false");
       break;
     }
     case 34: {
       *component_type = "LinearComponent";
       int32 input_dim = RandInt(1, 50), output_dim = RandInt(1, 50);
       os << "input-dim=" << input_dim << " output-dim=" << output_dim
          << " learning-rate=" << learning_rate;
       break;
     }
     case 35: {
       // This is not technically a SimpleComponent, but it behaves as one
       // if time-offsets=0.
       *component_type = "TdnnComponent";
       int32 input_dim = RandInt(1, 50), output_dim = RandInt(1, 50);
       os << "input-dim=" << input_dim << " output-dim=" << output_dim
          << " learning-rate=" << learning_rate << " time-offsets=0"
          << " use-natural-gradient=" << (RandInt(0,1) == 0 ? "true":"false")
          << " use-bias=" << (RandInt(0,1) == 0 ? "true":"false");
       break;
     }
     case 36: {
       *component_type = "GruNonlinearityComponent";
       int32 cell_dim = RandInt(10, 20);
       int32 recurrent_dim = (RandInt(0, 1) == 0 ?
                              RandInt(5, cell_dim - 1) : cell_dim);
       os << "cell-dim=" << cell_dim
          << " recurrent-dim=" << recurrent_dim;
       break;
     }
     case 37: {
       *component_type = "OutputGruNonlinearityComponent";
       os << "cell-dim=" << RandInt(10, 20)
          << " learning-rate=" << learning_rate;
 
       break;
     }
     default:
       KALDI_ERR << "Error generating random component";
   }
   *config = os.str();
 }

◆ GenerateRandomSimpleComponent()

Component * GenerateRandomSimpleComponent ( )

Generates random simple component for testing.

Definition at line 1783 of file nnet-test-utils.cc.

References GenerateRandomComponentConfig(), ConfigLine::HasUnusedValues(), Component::InitFromConfig(), KALDI_ERR, Component::NewComponentOfType(), ConfigLine::ParseLine(), ConfigLine::UnusedValues(), and ConfigLine::WholeLine().

Referenced by NnetGenerationOptions::NnetGenerationOptions(), and UnitTestNnetComponent().

                                            {
   std::string component_type, config;
   GenerateRandomComponentConfig(&component_type, &config);
   ConfigLine config_line;
   if (!config_line.ParseLine(config))
     KALDI_ERR << "Bad config line " << config;
 
   Component *c = Component::NewComponentOfType(component_type);
   if (c == NULL)
     KALDI_ERR << "Invalid component type " << component_type;
   c->InitFromConfig(&config_line);
   if (config_line.HasUnusedValues()) {
     KALDI_ERR << "Config line " << config_line.WholeLine()
               << " has unused values: "
               << config_line.UnusedValues();
   }
   return c;
 }

◆ GenerateSimpleNnetTrainingExample()

void GenerateSimpleNnetTrainingExample	(	int32	num_supervised_frames,
		int32	left_context,
		int32	right_context,
		int32	input_dim,
		int32	output_dim,
		int32	ivector_dim,
		NnetExample *	example
	)

Low-level function that generates an nnet training example.

By "simple" we mean there is one output named "output", an input named "input", and possibly also an input named "ivector" (this will be assumed absent if ivector_dim <= 0). This function generates exactly "left_context" or "right_context" frames of context on the left and right respectively.

Definition at line 1832 of file nnet-test-utils.cc.

References GeneralMatrix::Compress(), NnetIo::features, rnnlm::i, NnetExample::io, KALDI_ASSERT, kaldi::RandInt(), kaldi::RandUniform(), and MatrixBase< Real >::SetRandn().

Referenced by NnetGenerationOptions::NnetGenerationOptions(), UnitTestNnetExample(), and UnitTestNnetMergeExamples().

                           {
   KALDI_ASSERT(num_supervised_frames > 0 && left_context >= 0 &&
                right_context >= 0 && output_dim > 0 && input_dim > 0
                && example != NULL);
   example->io.clear();
 
   int32 feature_t_begin = RandInt(0, 2);
   int32 num_feat_frames = left_context + right_context + num_supervised_frames;
   Matrix<BaseFloat> input_mat(num_feat_frames, input_dim);
   input_mat.SetRandn();
   NnetIo input_feat("input", feature_t_begin, input_mat);
   if (RandInt(0, 1) == 0)
     input_feat.features.Compress();
   example->io.push_back(input_feat);
 
   if (ivector_dim > 0) {
     // Create a feature for the iVectors.  iVectors always have t=0 in the
     // current framework.
     Matrix<BaseFloat> ivector_mat(1, ivector_dim);
     ivector_mat.SetRandn();
     NnetIo ivector_feat("ivector", 0, ivector_mat);
     if (RandInt(0, 1) == 0)
       ivector_feat.features.Compress();
     example->io.push_back(ivector_feat);
   }
 
   {  // set up the output supervision.
     Posterior labels(num_supervised_frames);
     for (int32 t = 0; t < num_supervised_frames; t++) {
       int32 num_labels = RandInt(1, 3);
       BaseFloat remaining_prob_mass = 1.0;
       for (int32 i = 0; i < num_labels; i++) {
         BaseFloat this_prob = (i+1 == num_labels ? 1.0 : RandUniform()) *
             remaining_prob_mass;
         remaining_prob_mass -= this_prob;
         labels[t].push_back(std::pair<int32, BaseFloat>(RandInt(0, output_dim-1),
                                                         this_prob));
       }
     }
     int32 supervision_t_begin = feature_t_begin + left_context;
     NnetIo output_sup("output", output_dim, supervision_t_begin,
                       labels);
     example->io.push_back(output_sup);
   }
 }

◆ GenRandDescriptor()

void kaldi::nnet3::GenRandDescriptor	(	int32	num_nodes,
		Descriptor *	desc
	)

Definition at line 76 of file nnet-descriptor-test.cc.

References GenRandSumDescriptor(), and kaldi::Rand().

Referenced by UnitTestDescriptorIo(), and UnitTestGeneralDescriptor().

                                          {
   int32 num_parts = 1 + Rand() % 3;
   std::vector<SumDescriptor*> parts;
   for (int32 part = 0; part < num_parts; part++)
     parts.push_back(GenRandSumDescriptor(num_nodes));
   *desc = Descriptor(parts);
 
 }

◆ GenRandForwardingDescriptor()

ForwardingDescriptor* kaldi::nnet3::GenRandForwardingDescriptor ( int32 num_nodes )

Definition at line 25 of file nnet-descriptor-test.cc.

References rnnlm::i, ReplaceIndexForwardingDescriptor::kT, ReplaceIndexForwardingDescriptor::kX, rnnlm::n, kaldi::Rand(), Index::t, and Index::x.

Referenced by GenRandSumDescriptor().

                                                                    {
   if (Rand() % 2 != 0) {
     return new SimpleForwardingDescriptor(Rand() % num_nodes);
   } else {
     int32 r = Rand() % 4;
     if (r == 0) {
       Index offset;
       offset.t = Rand() % 5;
       offset.x = Rand() % 2;
       return
           new OffsetForwardingDescriptor(GenRandForwardingDescriptor(num_nodes),
                                          offset);
     } else if (r == 1) {
       std::vector<ForwardingDescriptor*> vec;
       int32 n = 1 + Rand() % 3;
       for (int32 i = 0; i < n; i++)
         vec.push_back(GenRandForwardingDescriptor(num_nodes));
       return new SwitchingForwardingDescriptor(vec);
     } else if (r == 2) {
       return new RoundingForwardingDescriptor(
           GenRandForwardingDescriptor(num_nodes), 1 + Rand() % 4);
     } else {
       return new ReplaceIndexForwardingDescriptor(
           GenRandForwardingDescriptor(num_nodes),
           (Rand() % 2 == 0 ? ReplaceIndexForwardingDescriptor::kT :
            ReplaceIndexForwardingDescriptor::kX),
           -2 + Rand() % 4);
     }
   }
 }

◆ GenRandSumDescriptor()

SumDescriptor* kaldi::nnet3::GenRandSumDescriptor ( int32 num_nodes )

Definition at line 57 of file nnet-descriptor-test.cc.

References GenRandForwardingDescriptor(), BinarySumDescriptor::kFailoverOperation, BinarySumDescriptor::kSumOperation, and kaldi::Rand().

Referenced by GenRandDescriptor().

                      {
   if (Rand() % 3 != 0) {
     bool not_required = (Rand() % 5 == 0);
     if (not_required)
       return new OptionalSumDescriptor(GenRandSumDescriptor(num_nodes));
     else
       return new SimpleSumDescriptor(GenRandForwardingDescriptor(num_nodes));
   } else {
     return new BinarySumDescriptor(
         (Rand() % 2 == 0 ? BinarySumDescriptor::kSumOperation:
          BinarySumDescriptor::kFailoverOperation),
         GenRandSumDescriptor(num_nodes),
         GenRandSumDescriptor(num_nodes));
   }
 }

◆ GetChainComputationRequest()

void GetChainComputationRequest	(	const Nnet &	nnet,
		const NnetChainExample &	eg,
		bool	need_model_derivative,
		bool	store_component_stats,
		bool	use_xent_regularization,
		bool	use_xent_derivative,
		ComputationRequest *	computation_request
	)

This function takes a NnetChainExample and produces a ComputationRequest.

Assumes you don't want the derivatives w.r.t. the inputs; if you do, you can create the ComputationRequest manually. Assumes that if need_model_derivative is true, you will be supplying derivatives w.r.t. all outputs.

If use_xent_regularization == true, then it assumes that for each output name (e.g. "output" in the eg, there is another output with the same dimension and with the suffix "-xent" on its name, e.g. named "output-xent". The derivative w.r.t. the xent objective will only be supplied to the nnet computation if 'use_xent_derivative' is true (we propagate back the xent derivative to the model only in training, not in model-combination in nnet3-chain-combine).

Definition at line 290 of file nnet-chain-example.cc.

References Nnet::GetNodeIndex(), IoSpecification::has_deriv, rnnlm::i, NnetIo::indexes, NnetChainSupervision::indexes, IoSpecification::indexes, NnetChainExample::inputs, ComputationRequest::inputs, Nnet::IsInputNode(), Nnet::IsOutputNode(), KALDI_ERR, NnetIo::name, NnetChainSupervision::name, IoSpecification::name, ComputationRequest::need_model_derivative, ComputationRequest::outputs, NnetChainExample::outputs, and ComputationRequest::store_component_stats.

Referenced by NnetChainComputeProb::Compute(), NnetChainExampleStructureCompare::operator()(), and NnetChainTrainer::Train().

                                                              {
   request->inputs.clear();
   request->inputs.reserve(eg.inputs.size());
   request->outputs.clear();
   request->outputs.reserve(eg.outputs.size() * 2);
   request->need_model_derivative = need_model_derivative;
   request->store_component_stats = store_component_stats;
   for (size_t i = 0; i < eg.inputs.size(); i++) {
     const NnetIo &io = eg.inputs[i];
     const std::string &name = io.name;
     int32 node_index = nnet.GetNodeIndex(name);
     if (node_index == -1 ||
         !nnet.IsInputNode(node_index))
       KALDI_ERR << "Nnet example has input named '" << name
                 << "', but no such input node is in the network.";
 
     request->inputs.resize(request->inputs.size() + 1);
     IoSpecification &io_spec = request->inputs.back();
     io_spec.name = name;
     io_spec.indexes = io.indexes;
     io_spec.has_deriv = false;
   }
   for (size_t i = 0; i < eg.outputs.size(); i++) {
     // there will normally be exactly one output , named "output"
     const NnetChainSupervision &sup = eg.outputs[i];
     const std::string &name = sup.name;
     int32 node_index = nnet.GetNodeIndex(name);
     if (node_index == -1 &&
         !nnet.IsOutputNode(node_index))
       KALDI_ERR << "Nnet example has output named '" << name
                 << "', but no such output node is in the network.";
     request->outputs.resize(request->outputs.size() + 1);
     IoSpecification &io_spec = request->outputs.back();
     io_spec.name = name;
     io_spec.indexes = sup.indexes;
     io_spec.has_deriv = need_model_derivative;
 
     if (use_xent_regularization) {
       size_t cur_size = request->outputs.size();
       request->outputs.resize(cur_size + 1);
       IoSpecification &io_spec = request->outputs[cur_size - 1],
           &io_spec_xent = request->outputs[cur_size];
       // the IoSpecification for the -xent output is the same
       // as for the regular output, except for its name which has
       // the -xent suffix (and the has_deriv member may differ).
       io_spec_xent = io_spec;
       io_spec_xent.name = name + "-xent";
       io_spec_xent.has_deriv = use_xent_derivative;
     }
   }
   // check to see if something went wrong.
   if (request->inputs.empty())
     KALDI_ERR << "No inputs in computation request.";
   if (request->outputs.empty())
     KALDI_ERR << "No outputs in computation request.";
 }

◆ GetChainNnetExampleSize()

int32 kaldi::nnet3::GetChainNnetExampleSize ( const NnetChainExample & a )

This function returns the 'size' of a chain example as defined for purposes of merging egs, which is defined as the largest number of Indexes in any of the inputs or outputs of the example.

◆ GetChunkSize()

int32 GetChunkSize	(	const Nnet &	nnet,
		int32	frame_subsampling_factor,
		int32	advised_chunk_size
	)

Definition at line 81 of file nnet-compile-looped.cc.

References KALDI_ASSERT, and Nnet::Modulus().

Referenced by DecodableNnetSimpleLoopedInfo::Init(), and UnitTestNnetCompileLooped().

                                              {
   int32 modulus = nnet.Modulus();
   KALDI_ASSERT(modulus > 0 && frame_subsampling_factor > 0 &&
                advised_chunk_size > 0);
   int32 chunk_size = advised_chunk_size;
   while (1) {
     if (chunk_size % modulus == 0 &&
         chunk_size % frame_subsampling_factor == 0)
       return chunk_size;
     chunk_size++;
   }
 }

◆ GetCommandsOfType()

void GetCommandsOfType	(	const NnetComputation &	computation,
		CommandType	t,
		std::vector< int32 > *	command_indexes
	)

This utility function works out from a computation, the command-indexes of the commands of the given type.

Definition at line 1429 of file nnet-analyze.cc.

References NnetComputation::commands.

Referenced by ComputationLoopedOptimizer::FindTimeShift(), and ComputationLoopedOptimizer::Optimize().

                                                           {
   int32 num_commands = computation.commands.size();
   command_indexes->clear();
   for (int32 c = 0; c < num_commands; c++)
     if (computation.commands[c].command_type == t)
       command_indexes->push_back(c);
 }

◆ GetComputationRequest()

void GetComputationRequest	(	const Nnet &	nnet,
		const NnetExample &	eg,
		bool	need_model_derivative,
		bool	store_component_stats,
		ComputationRequest *	computation_request
	)

This function takes a NnetExample (which should already have been frame-selected, if desired, and merged into a minibatch) and produces a ComputationRequest.

It assumes you don't want the derivatives w.r.t. the inputs; if you do, you can create/modify the ComputationRequest manually. Assumes that if need_model_derivative is true, you will be supplying derivatives w.r.t. all outputs.

Definition at line 202 of file nnet-example-utils.cc.

References Nnet::GetNodeIndex(), IoSpecification::has_deriv, rnnlm::i, NnetIo::indexes, IoSpecification::indexes, ComputationRequest::inputs, NnetExample::io, Nnet::IsInputNode(), Nnet::IsOutputNode(), KALDI_ERR, NnetIo::name, IoSpecification::name, ComputationRequest::need_model_derivative, ComputationRequest::outputs, and ComputationRequest::store_component_stats.

Referenced by NnetLdaStatsAccumulator::AccStats(), NnetComputerFromEg::Compute(), NnetComputeProb::Compute(), and NnetTrainer::Train().

                                                         {
   request->inputs.clear();
   request->inputs.reserve(eg.io.size());
   request->outputs.clear();
   request->outputs.reserve(eg.io.size());
   request->need_model_derivative = need_model_derivative;
   request->store_component_stats = store_component_stats;
   for (size_t i = 0; i < eg.io.size(); i++) {
     const NnetIo &io = eg.io[i];
     const std::string &name = io.name;
     int32 node_index = nnet.GetNodeIndex(name);
     if (node_index == -1 ||
         (!nnet.IsInputNode(node_index) && !nnet.IsOutputNode(node_index)))
       KALDI_ERR << "Nnet example has input or output named '" << name
                 << "', but no such input or output node is in the network.";
 
     std::vector<IoSpecification> &dest =
         nnet.IsInputNode(node_index) ? request->inputs : request->outputs;
     dest.resize(dest.size() + 1);
     IoSpecification &io_spec = dest.back();
     io_spec.name = name;
     io_spec.indexes = io.indexes;
     io_spec.has_deriv = nnet.IsOutputNode(node_index) && need_model_derivative;
   }
   // check to see if something went wrong.
   if (request->inputs.empty())
     KALDI_ERR << "No inputs in computation request.";
   if (request->outputs.empty())
     KALDI_ERR << "No outputs in computation request.";
 }

◆ GetCount()

int32 kaldi::nnet3::GetCount ( double expected_count )

Definition at line 66 of file nnet3-copy-egs.cc.

References KALDI_ASSERT, and kaldi::WithProb().

Referenced by main().

                                       {
   KALDI_ASSERT(expected_count >= 0.0);
   int32 ans = floor(expected_count);
   expected_count -= ans;
   if (WithProb(expected_count))
     ans++;
   return ans;
 }

◆ GetDiscriminativeComputationRequest()

void GetDiscriminativeComputationRequest	(	const Nnet &	nnet,
		const NnetDiscriminativeExample &	eg,
		bool	need_model_derivative,
		bool	store_component_stats,
		bool	use_xent_regularization,
		bool	use_xent_derivative,
		ComputationRequest *	computation_request
	)

This function takes a NnetDiscriminativeExample and produces a ComputationRequest.

Assumes you don't want the derivatives w.r.t. the inputs; if you do, you can create the ComputationRequest manually. Assumes that if need_model_derivative is true, you will be supplying derivatives w.r.t. all outputs.

Definition at line 284 of file nnet-discriminative-example.cc.

References Nnet::GetNodeIndex(), IoSpecification::has_deriv, rnnlm::i, NnetIo::indexes, NnetDiscriminativeSupervision::indexes, IoSpecification::indexes, NnetDiscriminativeExample::inputs, ComputationRequest::inputs, Nnet::IsInputNode(), Nnet::IsOutputNode(), KALDI_ERR, NnetIo::name, NnetDiscriminativeSupervision::name, IoSpecification::name, ComputationRequest::need_model_derivative, NnetDiscriminativeExample::outputs, ComputationRequest::outputs, and ComputationRequest::store_component_stats.

Referenced by NnetDiscriminativeComputeObjf::Compute(), NnetDiscriminativeExampleStructureCompare::operator()(), and NnetDiscriminativeTrainer::Train().

                                                                       {
   request->inputs.clear();
   request->inputs.reserve(eg.inputs.size());
   request->outputs.clear();
   request->outputs.reserve(eg.outputs.size());
   request->need_model_derivative = need_model_derivative;
   request->store_component_stats = store_component_stats;
   for (size_t i = 0; i < eg.inputs.size(); i++) {
     const NnetIo &io = eg.inputs[i];
     const std::string &name = io.name;
     int32 node_index = nnet.GetNodeIndex(name);
     if (node_index == -1 &&
         !nnet.IsInputNode(node_index))
       KALDI_ERR << "Nnet example has input named '" << name
                 << "', but no such input node is in the network.";
 
     request->inputs.resize(request->inputs.size() + 1);
     IoSpecification &io_spec = request->inputs.back();
     io_spec.name = name;
     io_spec.indexes = io.indexes;
     io_spec.has_deriv = false;
   }
   for (size_t i = 0; i < eg.outputs.size(); i++) {
     // there will normally be exactly one output , named "output"
     const NnetDiscriminativeSupervision &sup = eg.outputs[i];
     const std::string &name = sup.name;
     int32 node_index = nnet.GetNodeIndex(name);
     if (node_index == -1 &&
         !nnet.IsOutputNode(node_index))
       KALDI_ERR << "Nnet example has output named '" << name
                 << "', but no such output node is in the network.";
     request->outputs.resize(request->outputs.size() + 1);
     IoSpecification &io_spec = request->outputs.back();
     io_spec.name = name;
     io_spec.indexes = sup.indexes;
     io_spec.has_deriv = need_model_derivative;
 
     if (use_xent_regularization) {
       size_t cur_size = request->outputs.size();
       request->outputs.resize(cur_size + 1);
       IoSpecification &io_spec = request->outputs[cur_size - 1],
           &io_spec_xent = request->outputs[cur_size];
       // the IoSpecification for the -xent output is the same
       // as for the regular output, except for its name which has
       // the -xent suffix (and the has_deriv member may differ).
       io_spec_xent = io_spec;
       io_spec_xent.name = name + "-xent";
       io_spec_xent.has_deriv = use_xent_derivative;
     }
   }
   // check to see if something went wrong.
   if (request->inputs.empty())
     KALDI_ERR << "No inputs in computation request.";
   if (request->outputs.empty())
     KALDI_ERR << "No outputs in computation request.";
 }

◆ GetDiscriminativeNnetExampleSize()

int32 kaldi::nnet3::GetDiscriminativeNnetExampleSize ( const NnetDiscriminativeExample & a )

This function returns the 'size' of a discriminative example as defined for purposes of merging egs, which is defined as the largest number of Indexes in any of the inputs or outputs of the example.

◆ GetIndexesMultiStrings()

static void kaldi::nnet3::GetIndexesMultiStrings	(	const Nnet &	nnet,
		const NnetComputation &	computation,
		std::vector< std::string > *	indexes_multi_strings
	)

static

Definition at line 462 of file nnet-computation.cc.

References NnetComputation::SubMatrixInfo::col_offset, rnnlm::i, NnetComputation::indexes_multi, rnnlm::j, KALDI_WARN, NnetComputation::matrices, NnetComputation::SubMatrixInfo::matrix_index, NnetComputation::MatrixInfo::num_cols, NnetComputation::SubMatrixInfo::num_cols, NnetComputation::MatrixInfo::num_rows, NnetComputation::SubMatrixInfo::num_rows, NnetComputation::SubMatrixInfo::row_offset, and NnetComputation::submatrices.

Referenced by NnetComputation::GetCommandStrings(), and NnetComputation::Print().

                                                  {
   int32 indexes_multi_size = computation.indexes_multi.size();
   indexes_multi_strings->resize(indexes_multi_size);
 
   for (int32 i = 0; i < indexes_multi_size; i++) {
     std::ostringstream os;
     os << "[";
     const std::vector<std::pair<int32, int32> > &vec =
         computation.indexes_multi[i];
     int32 size = vec.size();
     for (int32 j = 0; j < size; j++) {
       int32 submat_index = vec[j].first, row_index = vec[j].second;
       if (submat_index == -1) {
         os << "NULL";
       } else {
         const NnetComputation::SubMatrixInfo &submat =
             computation.submatrices[submat_index];
         const NnetComputation::MatrixInfo &mat =
             computation.matrices[submat.matrix_index];
         int32 row = row_index + submat.row_offset;
         int32 col_start = submat.col_offset,
             col_end = col_start + submat.num_cols;
         if (!(row_index < submat.num_rows &&
               row < mat.num_rows)) {
           KALDI_WARN << "Invalid indexes in indexes-multi[" << i
                      << ": submatrix " << submat_index << " = m"
                      << submat.matrix_index << "(" << submat.row_offset
                      << ':' << (submat.row_offset + submat.num_rows - 1)
                      << ',' << submat.col_offset << ':'
                      << (submat.col_offset + submat.num_cols - 1) << ") has "
                      << submat.num_rows << " rows, but you access row "
                      << row_index;
         }
         if (col_start == 0 && col_end == mat.num_cols)
           os << 'm' << submat.matrix_index << '(' << row << ",:)";
         else
           os << 'm' << submat.matrix_index << '(' << row << ',' << col_start
              << ':' << (col_end - 1) << ')';
       }
       if (j + 1 < size) os << ",";
     }
     os << "]";
     (*indexes_multi_strings)[i] = os.str();
   }
 }

◆ GetIndexesStrings()

static void kaldi::nnet3::GetIndexesStrings	(	const Nnet &	nnet,
		const NnetComputation &	computation,
		std::vector< std::string > *	indexes_strings
	)

static

Definition at line 448 of file nnet-computation.cc.

References rnnlm::i, NnetComputation::indexes, and PrintIntegerVector().

Referenced by NnetComputation::GetCommandStrings(), and NnetComputation::Print().

                                                                      {
   int32 size = computation.indexes.size();
   indexes_strings->resize(size);
   for (int32 i = 0; i < size; i++) {
     std::ostringstream os;
     PrintIntegerVector(os, computation.indexes[i]);
     (*indexes_strings)[i] = os.str();
   }
 }

◆ GetIoNames()

static void kaldi::nnet3::GetIoNames	(	const std::vector< NnetExample > &	src,
		std::vector< std::string > *	names_vec
	)

static

Definition at line 34 of file nnet-example-utils.cc.

References kaldi::CopySetToVector().

Referenced by MergeExamples().

                                                              {
   std::set<std::string> names;
   std::vector<NnetExample>::const_iterator iter = src.begin(), end = src.end();
   for (; iter != end; ++iter) {
     std::vector<NnetIo>::const_iterator iter2 = iter->io.begin(),
                                          end2 = iter->io.end();
     for (; iter2 != end2; ++iter2)
       names.insert(iter2->name);
   }
   CopySetToVector(names, names_vec);
 }

◆ GetIoSizes()

static void kaldi::nnet3::GetIoSizes	(	const std::vector< NnetExample > &	src,
		const std::vector< std::string > &	names,
		std::vector< int32 > *	sizes
	)

static

Definition at line 50 of file nnet-example-utils.cc.

References NnetIo::features, rnnlm::i, NnetIo::indexes, KALDI_ASSERT, KALDI_ERR, NnetIo::name, GeneralMatrix::NumCols(), and GeneralMatrix::NumRows().

Referenced by MergeExamples().

                                                 {
   std::vector<int32> dims(names.size(), -1);  // just for consistency checking.
   sizes->clear();
   sizes->resize(names.size(), 0);
   std::vector<std::string>::const_iterator names_begin = names.begin(),
                                              names_end = names.end();
   std::vector<NnetExample>::const_iterator iter = src.begin(), end = src.end();
   for (; iter != end; ++iter) {
     std::vector<NnetIo>::const_iterator iter2 = iter->io.begin(),
                                          end2 = iter->io.end();
     for (; iter2 != end2; ++iter2) {
       const NnetIo &io = *iter2;
       std::vector<std::string>::const_iterator names_iter =
           std::lower_bound(names_begin, names_end, io.name);
       KALDI_ASSERT(*names_iter == io.name);
       int32 i = names_iter - names_begin;
       int32 this_dim = io.features.NumCols();
       if (dims[i] == -1) {
         dims[i] = this_dim;
       } else if (dims[i] != this_dim) {
         KALDI_ERR << "Merging examples with inconsistent feature dims: "
                   << dims[i] << " vs. " << this_dim << " for '"
                   << io.name << "'.";
       }
       KALDI_ASSERT(io.features.NumRows() == io.indexes.size());
       int32 this_size = io.indexes.size();
       (*sizes)[i] += this_size;
     }
   }
 }

◆ GetMaxMemoryUse()

int64 GetMaxMemoryUse ( const NnetComputation & computation )

Definition at line 1439 of file nnet-analyze.cc.

References NnetComputation::Command::arg1, NnetComputation::Command::arg2, NnetComputation::Command::command_type, NnetComputation::commands, kAcceptInput, KALDI_ASSERT, kAllocMatrix, kaldi::kCompressedMatrixInt8, kaldi::kCompressedMatrixUint8, kCompressMatrix, kDeallocMatrix, kDecompressMatrix, NnetComputation::SubMatrixInfo::num_cols, NnetComputation::SubMatrixInfo::num_rows, and NnetComputation::submatrices.

Referenced by Optimize(), and OptimizeMemoryCompression().

                                                           {
   int64 cur_memory_use = 0,
       max_memory_use = 0;
   int32 num_commands = computation.commands.size(),
       num_submatrices = computation.submatrices.size();
   // the vector 'num_compressed_bytes' is used to remember the number of bytes
   // in the compressed matrices for each submatrix (this will only be used for
   // those that correspond to a 'whole matrix).  It's needed because the
   // decompression command doesn't tell us what compression type was used for
   // that matrix.
   std::vector<int32> num_compressed_bytes(num_submatrices, -100000000);
   for (int32 command_index = 0; command_index < num_commands; ++command_index) {
     const NnetComputation::Command &c = computation.commands[command_index];
     int64 this_num_bytes = -100000000,
         this_compressed_num_bytes = -10000000;
 
 
     if (c.arg1 >= 0 && c.arg1 < num_submatrices) {
       // if arg1 could plausibly be a sub-matrix index...
       const NnetComputation::SubMatrixInfo &submat_info =
           computation.submatrices[c.arg1];
       this_num_bytes = static_cast<int64>(sizeof(BaseFloat)) *
           submat_info.num_rows * submat_info.num_cols;
 
       if (c.command_type == kCompressMatrix) {
         this_compressed_num_bytes =
             ((c.arg2 == static_cast<int32>(kCompressedMatrixInt8) ||
               c.arg2 == static_cast<int32>(kCompressedMatrixUint8)) ?
              1 : 2) * static_cast<int64>(submat_info.num_rows) *
             submat_info.num_cols;
         num_compressed_bytes[c.arg1] = this_compressed_num_bytes;
       } else if (c.command_type == kDecompressMatrix) {
         this_compressed_num_bytes = num_compressed_bytes[c.arg1];
       }
     }
     switch (c.command_type) {
       case kAllocMatrix:
       case kAcceptInput:
         cur_memory_use += this_num_bytes;
         break;
       case kDeallocMatrix:
         cur_memory_use -= this_num_bytes;
         break;
       case kCompressMatrix:
         cur_memory_use += this_compressed_num_bytes - this_num_bytes;
         break;
       case kDecompressMatrix:
         cur_memory_use += this_num_bytes - this_compressed_num_bytes;
         break;
       default:
         break;
     }
     KALDI_ASSERT(cur_memory_use >= 0);
     if (cur_memory_use > max_memory_use)
       max_memory_use = cur_memory_use;
   }
   return max_memory_use;
 }

◆ GetNnetChainExampleSize()

int32 kaldi::nnet3::GetNnetChainExampleSize ( const NnetChainExample & a )

Definition at line 437 of file nnet-chain-example.cc.

References rnnlm::i, NnetChainExample::inputs, and NnetChainExample::outputs.

Referenced by ChainExampleMerger::AcceptExample(), ChainExampleMerger::Finish(), and ChainExampleMerger::WriteMinibatch().

                                                          {
   int32 ans = 0;
   for (size_t i = 0; i < a.inputs.size(); i++) {
     int32 s = a.inputs[i].indexes.size();
     if (s > ans)
       ans = s;
   }
   for (size_t i = 0; i < a.outputs.size(); i++) {
     int32 s = a.outputs[i].indexes.size();
     if (s > ans)
       ans = s;
   }
   return ans;
 }

◆ GetNnetDiscriminativeExampleSize()

int32 kaldi::nnet3::GetNnetDiscriminativeExampleSize ( const NnetDiscriminativeExample & a )

Definition at line 430 of file nnet-discriminative-example.cc.

References rnnlm::i, NnetDiscriminativeExample::inputs, and NnetDiscriminativeExample::outputs.

Referenced by DiscriminativeExampleMerger::AcceptExample(), DiscriminativeExampleMerger::Finish(), and DiscriminativeExampleMerger::WriteMinibatch().

                                                                            {
   int32 ans = 0;
   for (size_t i = 0; i < a.inputs.size(); i++) {
     int32 s = a.inputs[i].indexes.size();
     if (s > ans)
       ans = s;
   }
   for (size_t i = 0; i < a.outputs.size(); i++) {
     int32 s = a.outputs[i].indexes.size();
     if (s > ans)
       ans = s;
   }
   return ans;
 }

◆ GetNnetExampleSize()

int32 GetNnetExampleSize ( const NnetExample & a )

This function returns the 'size' of a nnet-example as defined for purposes of merging egs, which is defined as the largest number of Indexes in any of the inputs or outputs of the example.

Definition at line 1178 of file nnet-example-utils.cc.

References rnnlm::i, and NnetExample::io.

Referenced by ExampleMerger::AcceptExample(), ExampleMerger::Finish(), and ExampleMerger::WriteMinibatch().

                                                {
   int32 ans = 0;
   for (size_t i = 0; i < a.io.size(); i++) {
     int32 s = a.io[i].indexes.size();
     if (s > ans)
       ans = s;
   }
   return ans;
 }

◆ GetNumNvalues()

int32 GetNumNvalues	(	const std::vector< NnetIo > &	io_vec,
		bool	exhaustive
	)

This utility function can be used to obtain the number of distinct 'n' values in a training example.

This is the number of examples (e.g. sequences) that have been combined into a single example. (Actually it returns the (largest - smallest + 1) of 'n' values, and assumes they are consecutive).

Parameters

[in]	vec	The vector of NnetIo objects from the training example (NnetExample or NnetChainExample) for which we need the number of 'n' values
[in]	exhaustive	If true, it will check exhaustively what largest and smallest 'n' values are. If 'false' it does it in a fast way which will return the same answer as if exhaustive == true for all the types of eg we currently create (basically: correct if the last row of the input or supervision matrices has the last-numbered 'n' value), and will occasionally (randomly) do a test to check that this is the same as if we called it with 'exhaustive=true'.

Definition at line 2198 of file nnet-utils.cc.

References rnnlm::i, NnetIo::indexes, KALDI_ASSERT, KALDI_ERR, rnnlm::n, and kaldi::RandInt().

Referenced by CollapseModelConfig::CollapseModelConfig(), NnetChainTrainer::TrainInternal(), NnetTrainer::TrainInternal(), NnetChainTrainer::TrainInternalBackstitch(), and NnetTrainer::TrainInternalBackstitch().

                                     {
   int32 num_n_values = -1;
   for (size_t i = 0; i < io_vec.size(); i++) {
     const NnetIo &io = io_vec[i];
     int32 this_num_n_values;
     const std::vector<Index> &index_vec = io.indexes;
     KALDI_ASSERT(!index_vec.empty() &&
                  "Empty input or output in ComputationRequest?");
     if (exhaustive) {
       int32 lowest_n_value = std::numeric_limits<int32>::max(),
           highest_n_value = std::numeric_limits<int32>::min();
       std::vector<Index>::const_iterator
           iter = index_vec.begin(), end = index_vec.end();
       for (; iter != end; ++iter) {
         int32 n = iter->n;
         if (n < lowest_n_value) { lowest_n_value = n; }
         if (n > highest_n_value) { highest_n_value = n; }
       }
       this_num_n_values = highest_n_value + 1 - lowest_n_value;
     } else {
       // we assume that the 'n' values range from zero to N-1,
       // where N is the number of distinct 'n' values.
       this_num_n_values = index_vec.back().n + 1;
     }
     if (num_n_values == -1) {
       num_n_values = this_num_n_values;
     } else {
       if (num_n_values != this_num_n_values) {
         KALDI_ERR << "Different inputs/outputs of ComputationRequest have "
             "different numbers of n values: " << num_n_values
                   << " vs. " << this_num_n_values;
       }
     }
   }
   if (!exhaustive && RandInt(0, 100) == 0) {
     int32 num_n_values_check = GetNumNvalues(io_vec, true);
     if (num_n_values != num_n_values_check) {
       KALDI_ERR << "Exhaustive and quick checks returned different "
           "answers: " << num_n_values << " vs. "
                 << num_n_values_check;
     }
   }
   return num_n_values;
 }

◆ GetNxList()

void GetNxList	(	const std::vector< Index > &	indexes,
		std::vector< std::pair< int32, int32 > > *	pairs
	)

This function outputs a unique, lexicographically sorted list of the pairs of (n, x) values that are encountered in the provided list of Indexes.

Definition at line 416 of file nnet-compile-utils.cc.

Referenced by kaldi::nnet3::time_height_convolution::GetComputationIo(), RestrictedAttentionComponent::GetIndexes(), and kaldi::nnet3::time_height_convolution::GetIndexesForComputation().

                                                          {
   // set of (n,x) pairs
   std::unordered_set<std::pair<int32, int32>, PairHasher<int32> > n_x_set;
 
   for (std::vector<Index>::const_iterator iter = indexes.begin();
        iter != indexes.end(); ++iter)
     n_x_set.insert(std::pair<int32, int32>(iter->n, iter->x));
   pairs->clear();
   pairs->reserve(n_x_set.size());
   for (std::unordered_set<std::pair<int32, int32>, PairHasher<int32> >::iterator
            iter = n_x_set.begin(); iter != n_x_set.end(); ++iter)
     pairs->push_back(*iter);
   std::sort(pairs->begin(), pairs->end());
 }

◆ GetPrecomputedIndexes()

ComponentPrecomputedIndexes* kaldi::nnet3::GetPrecomputedIndexes	(	const Component &	c,
		int32	num_rows
	)

Definition at line 180 of file nnet-component-test.cc.

References rnnlm::i, kReordersIndexes, Component::PrecomputeIndexes(), Component::Properties(), and Component::ReorderIndexes().

Referenced by TestSimpleComponentDataDerivative(), TestSimpleComponentModelDerivative(), and TestSimpleComponentPropagateProperties().

                                                                    {
   std::vector<Index> input_indexes(num_rows);
   int32 num_t_values;
   if (num_rows % 3 == 0) { num_t_values = 3; }
   else if (num_rows % 2 == 0) { num_t_values = 2; }
   else { num_t_values = 1; }
 
   for (int32 i = 0; i < num_rows; i++) {
     input_indexes[i].n = i % num_t_values;
     input_indexes[i].x = 0;
     input_indexes[i].t = i / num_t_values;
   }
   std::vector<Index> output_indexes(input_indexes);
 
   if (c.Properties()&kReordersIndexes) {
     c.ReorderIndexes(&input_indexes, &output_indexes);
   }
   MiscComputationInfo misc_info;
   bool need_backprop = true;  // just in case.
   ComponentPrecomputedIndexes *ans = c.PrecomputeIndexes(misc_info,
                                                          input_indexes,
                                                          output_indexes,
                                                          need_backprop);
   // ans will be NULL in most cases.
   return ans;
 }

◆ GetSubmatCounts()

void kaldi::nnet3::GetSubmatCounts	(	const std::vector< std::vector< std::pair< int32, int32 > > > &	submat_lists,
		std::unordered_map< int32, int32 > *	submat_counts,
		std::vector< int32 > *	submats_with_large_counts
	)

Gets counts of submatrices (the 1st members of pairs) in submat_lists.

Also outputs, to 'submats_with_large_counts', a list of submatrix indexes that have counts over half of submat_lists.size(). (These will be separated out into their own AddRows() commands).

Definition at line 35 of file nnet-compile-utils.cc.

References KALDI_ASSERT.

Referenced by SplitLocations().

                                                  {
   auto iter = submat_lists.begin(), end = submat_lists.end();
   for (; iter != end; ++iter) {
     std::vector<std::pair<int32, int32> >::const_iterator
         iter2 = iter->begin(), end2 = iter->end();
     for (; iter2 != end2; ++iter2) {
       int32 submat_index = iter2->first;
       KALDI_ASSERT(submat_index >= 0);  // We don't expect -1's in submat_lists.
       std::unordered_map<int32,int32>::iterator
           iter = submat_counts->find(submat_index);
       if (iter == submat_counts->end())
         (*submat_counts)[submat_index] = 1;
       else
         iter->second++;
     }
   }
   auto counts_iter = submat_counts->begin(),
       counts_end = submat_counts->end();
   size_t cutoff = submat_lists.size() / 2;
   for (; counts_iter != counts_end; ++counts_iter)
     if (counts_iter->second > cutoff)
       submats_with_large_counts->push_back(counts_iter->first);
 }

◆ GetSubMatrixOfSubMatrix()

static NnetComputation::SubMatrixInfo kaldi::nnet3::GetSubMatrixOfSubMatrix	(	const NnetComputation &	computation,
		int32	submat_a,
		int32	submat_b
	)

static

This static function returns a SubMatrixInfo corresponding to replacing the matrix-index in a's "matrix_index" with, essentially, sub-matrix b.

Of course the matrix_index will be b's "matrix_index", but we may have to modify the row and column offsets. The idea is that sub-matrix submat_b should have the same dimensions as the matrix underlying submat_a.

Definition at line 789 of file nnet-optimize-utils.cc.

References NnetComputation::SubMatrixInfo::col_offset, KALDI_ASSERT, NnetComputation::matrices, NnetComputation::SubMatrixInfo::matrix_index, NnetComputation::MatrixInfo::num_cols, NnetComputation::SubMatrixInfo::num_cols, NnetComputation::MatrixInfo::num_rows, NnetComputation::SubMatrixInfo::num_rows, NnetComputation::SubMatrixInfo::row_offset, and NnetComputation::submatrices.

Referenced by VariableMergingOptimizer::DoMerge().

                                                                         {
   KALDI_ASSERT(static_cast<size_t>(submat_a) < computation.submatrices.size());
   KALDI_ASSERT(static_cast<size_t>(submat_b) < computation.submatrices.size());
   const NnetComputation::SubMatrixInfo &a = computation.submatrices[submat_a],
                                        &b = computation.submatrices[submat_b];
   const NnetComputation::MatrixInfo &a_mat =
       computation.matrices[a.matrix_index];
   KALDI_ASSERT(a_mat.num_rows == b.num_rows && a_mat.num_cols == b.num_cols);
   NnetComputation::SubMatrixInfo ans;
   ans.matrix_index = b.matrix_index;
   ans.row_offset = a.row_offset + b.row_offset;
   ans.num_rows = a.num_rows;
   ans.col_offset = a.col_offset + b.col_offset;
   ans.num_cols = a.num_cols;
   return ans;
 }

◆ GetTList()

void GetTList	(	const std::vector< Index > &	indexes,
		std::vector< int32 > *	t_values
	)

This function outputs a sorted, unique list of the 't' values that are encountered in the provided list of Indexes If 't' values equal to kNoTime are encountered, they are ignored and are not output.

Definition at line 434 of file nnet-compile-utils.cc.

References kNoTime.

Referenced by kaldi::nnet3::time_height_convolution::GetComputationIo().

                                           {
   // set of t values
   std::unordered_set<int32> t_set;
 
   for (std::vector<Index>::const_iterator iter = indexes.begin();
        iter != indexes.end(); ++iter)
     if (iter->t != kNoTime)
       t_set.insert(iter->t);
   t_values->clear();
   t_values->reserve(t_set.size());
   for (std::unordered_set<int32>::iterator iter = t_set.begin();
        iter != t_set.end(); ++iter)
     t_values->push_back(*iter);
   std::sort(t_values->begin(), t_values->end());
 }

◆ GraphHasCycles()

bool GraphHasCycles ( const std::vector< std::vector< int32 > > & graph )

This function returns 'true' if the graph represented in 'graph' contains cycles (including cycles where a single node has an arc to itself).

Definition at line 300 of file nnet-graph.cc.

References FindSccs(), and rnnlm::i.

Referenced by NnetIsRecurrent().

                                                                {
   std::vector<std::vector<int32> > sccs;
   FindSccs(graph, &sccs);
   for (size_t i = 0; i < sccs.size(); i++) {
     if (sccs[i].size() > 1)
       return true;
   }
   // the next code checks for links from a state to itself.
   int32 num_nodes = graph.size();
   for (size_t i = 0; i < num_nodes; i++)
     for (std::vector<int32>::const_iterator iter = graph[i].begin(),
              end = graph[i].end(); iter != end; ++iter)
       if (*iter == i) return true;
   return false;
 }

◆ HasBatchnorm()

bool HasBatchnorm ( const Nnet & nnet )

Returns true if nnet has at least one component of type BatchNormComponent.

Definition at line 527 of file nnet-utils.cc.

References Nnet::GetComponent(), and Nnet::NumComponents().

Referenced by main().

                                     {
   for (int32 c = 0; c < nnet.NumComponents(); c++) {
     const Component *comp = nnet.GetComponent(c);
     if (dynamic_cast<const BatchNormComponent*>(comp) != NULL)
       return true;
   }
   return false;
 }

◆ HasContiguousProperty()

bool HasContiguousProperty	(	const std::vector< int32 > &	indexes,
		std::vector< std::pair< int32, int32 > > *	reverse_indexes
	)

This function returns true if for each integer i != -1, all the indexes j at which indexes[j] == i are consecutive with no gaps (more formally: if j1 < j2 < j3 and indexes[j1] != -1 and indexes[j1] == indexes[j3], then indexes[j1] == indexes[j2]).

For example, the vector [ 1 2 1 ] lacks the contiguous property because 1 appears in two places with a different number in the middle. If the vector has the contiguous property, this function also outputs to "reverse_indexes" the begin and end of these ranges, so that indexes[j] == i for all j such that (*reverse_indexes)[i].first <= j && j < (*reverse_indexes)[i].second.

Definition at line 369 of file nnet-compile-utils.cc.

References rnnlm::i, rnnlm::j, KALDI_ASSERT, and KALDI_WARN.

Referenced by Compiler::CompileBackwardFromIndexes(), UnitTestEnsureContiguousProperty(), and UnitTestHasContiguousProperty().

                                                         {
   reverse_indexes->clear();
   int32 num_indexes = indexes.size();
   if (num_indexes == 0)
     return true;
   int32 num_input_indexes =
       *std::max_element(indexes.begin(), indexes.end()) + 1;
   KALDI_ASSERT(num_input_indexes >= 0);
   if (num_input_indexes == 0) {
     // we don't really expect this input, filled with -1's.
     KALDI_WARN << "HasContiguousProperty called on vector of -1's.";
     return true;
   }
   reverse_indexes->resize(num_input_indexes,
                           std::pair<int32,int32>(-1, -1));
   // set each pair's "first" to the min index of all elements
   // of "indexes" with that value, and the "second" to the
   // max plus one.
   for (int32 i = 0; i < num_indexes; i++) {
     int32 j = indexes[i];
     if (j == -1) continue;
     KALDI_ASSERT(j >= 0);
     std::pair<int32, int32> &pair = (*reverse_indexes)[j];
     if (pair.first == -1) {
       pair.first = i;
       pair.second = i + 1;
     } else {
       pair.first = std::min(pair.first, i);
       pair.second = std::max(pair.second, i + 1);
     }
   }
   // check that the contiguous property holds.
   for (int32 i = 0; i < num_input_indexes; i++) {
     std::pair<int32, int32> pair = (*reverse_indexes)[i];
     if (pair.first != -1) {
       for (int32 j = pair.first; j < pair.second; j++)
         if (indexes[j] != i)
           return false;
     }
   }
   return true;
 }

◆ HasXentOutputs()

static bool kaldi::nnet3::HasXentOutputs ( const Nnet & nnet )

static

Definition at line 235 of file nnet-chain-diagnostics.cc.

References Nnet::GetNodeIndex(), Nnet::GetNodeNames(), and Nnet::IsOutputNode().

Referenced by RecomputeStats().

                                              {
   const std::vector<std::string> node_names = nnet.GetNodeNames();
   for (std::vector<std::string>::const_iterator it = node_names.begin();
         it != node_names.end(); ++it) {
     int32 node_index = nnet.GetNodeIndex(*it);
     if (nnet.IsOutputNode(node_index) && 
         it->find("-xent") != std::string::npos) {
       return true;
     }
   }
   return false;
 }

◆ IdentifyIndexesArgs()

void IdentifyIndexesArgs	(	std::vector< NnetComputation::Command > *	commands,
		std::vector< int32 * > *	indexes_args
	)

Identifies in the vector of commands, arguments that correspond to indexes into the computation's 'indexes' array, and outputs a list of pointers to those arguments to 'indexes_args'.

Useful in renumbering code.

Definition at line 133 of file nnet-optimize-utils.cc.

References NnetComputation::Command::arg3, NnetComputation::Command::command_type, kAddRows, and kCopyRows.

Referenced by ComputationRenumberer::RenumberIndexes().

                                                           {
   indexes_args->clear();
   std::vector<NnetComputation::Command>::iterator iter = commands->begin(),
       end = commands->end();
   for (; iter != end; ++iter) {
     NnetComputation::Command &command = *iter;
     if (command.command_type == kCopyRows ||
         command.command_type == kAddRows)
       indexes_args->push_back(&(command.arg3));
   }
 }

◆ IdentifyIndexesMultiArgs()

void IdentifyIndexesMultiArgs	(	std::vector< NnetComputation::Command > *	commands,
		std::vector< int32 * > *	indexes_multi_args
	)

Identifies in the vector of commands, arguments that correspond to indexes into the computation's indexes_multi array, and outputs a list of pointers to those arguments to 'indexes_multi_args'.

Useful in renumbering code.

Definition at line 105 of file nnet-optimize-utils.cc.

References NnetComputation::Command::arg2, NnetComputation::Command::command_type, kAddRowsMulti, kAddToRowsMulti, kCopyRowsMulti, and kCopyToRowsMulti.

Referenced by ComputationRenumberer::RemoveIndexesMultiDuplicates(), and ComputationRenumberer::RemoveUnusedIndexesMulti().

                                                                      {
   indexes_multi_args->clear();
   std::vector<NnetComputation::Command>::iterator iter = commands->begin(),
       end = commands->end();
   for (; iter != end; ++iter) {
     NnetComputation::Command &command = *iter;
     if (command.command_type == kAddRowsMulti ||
         command.command_type == kAddToRowsMulti ||
         command.command_type == kCopyRowsMulti ||
         command.command_type == kCopyToRowsMulti)
       indexes_multi_args->push_back(&(command.arg2));
   }
 }

◆ IdentifyIndexesRangesArgs()

void IdentifyIndexesRangesArgs	(	std::vector< NnetComputation::Command > *	commands,
		std::vector< int32 * > *	indexes_ranges_args
	)

Identifies in the vector of commands, arguments that correspond to indexes into the computation's 'indexes_ranges' array, and outputs a list of pointers to those arguments to 'indexes_ranges_args'.

Useful in renumbering code.

Definition at line 121 of file nnet-optimize-utils.cc.

References NnetComputation::Command::arg3, NnetComputation::Command::command_type, and kAddRowRanges.

Referenced by ComputationRenumberer::RenumberIndexesRanges().

                                                                        {
   indexes_ranges_args->clear();
   std::vector<NnetComputation::Command>::iterator iter = commands->begin(),
       end = commands->end();
   for (; iter != end; ++iter) {
     NnetComputation::Command &command = *iter;
     if (command.command_type == kAddRowRanges)
       indexes_ranges_args->push_back(&(command.arg3));
   }
 }

◆ IdentifyMatrixArgsInComputation()

void kaldi::nnet3::IdentifyMatrixArgsInComputation	(	NnetComputation *	computation,
		std::vector< int32 >	matrix_args
	)

Definition at line 96 of file nnet-optimize-utils.cc.

References NnetComputation::submatrices.

                                                                      {
   int32 num_submatrices = computation->submatrices.size();
   matrix_args->reserve(computation->submatrices.size());
   for (int32 s = 1; s < num_submatrices; s++)
     matrix_args->push_back(&(computation->submatrices[s].matrix_index));
 }

◆ IdentifySubmatrixArgs() [1/2]

void IdentifySubmatrixArgs	(	NnetComputation::Command *	command,
		std::vector< int32 * > *	submatrix_args
	)

This function outputs to "submatrix_args" the addresses of a subset of arguments arg1 through arg6 in "command", that correspond to the indexes of submatrices.

This is useful in renumbering code. Note: some of the pointers may point to a zero value, for optional submatrix args.

Definition at line 28 of file nnet-optimize-utils.cc.

References NnetComputation::Command::arg1, NnetComputation::Command::arg2, NnetComputation::Command::arg3, NnetComputation::Command::arg4, NnetComputation::Command::arg5, NnetComputation::Command::arg6, NnetComputation::Command::command_type, kAcceptInput, kAddRowRanges, kAddRows, kAddRowsMulti, kAddToRowsMulti, KALDI_ERR, kAllocMatrix, kBackprop, kBackpropNoModelUpdate, kCopyRows, kCopyRowsMulti, kCopyToRowsMulti, kDeallocMatrix, kGotoLabel, kMatrixAdd, kMatrixCopy, kNoOperation, kNoOperationLabel, kNoOperationMarker, kNoOperationPermanent, kPropagate, kProvideOutput, kSetConst, and kSwapMatrix.

Referenced by IdentifySubmatrixArgs(), and IdentifySubmatrixArgsInComputation().

                                                               {
   submatrix_args->clear();
   switch (c->command_type) {
     case kAllocMatrix:
     case kDeallocMatrix:
     case kSetConst:
       submatrix_args->push_back(&c->arg1);
       break;
     case kSwapMatrix:
       submatrix_args->push_back(&c->arg1);
       submatrix_args->push_back(&c->arg2);
       break;
     case kPropagate:
       submatrix_args->push_back(&c->arg3);
       submatrix_args->push_back(&c->arg4);
       break;
     case kBackprop:
     case kBackpropNoModelUpdate:
       submatrix_args->push_back(&c->arg3);
       submatrix_args->push_back(&c->arg4);
       submatrix_args->push_back(&c->arg5);
       submatrix_args->push_back(&c->arg6);
       break;
     case kMatrixCopy:
     case kMatrixAdd:
     case kAddRows:
     case kCopyRows:
     case kAddRowRanges:
       submatrix_args->push_back(&c->arg1);
       submatrix_args->push_back(&c->arg2);
       break;
     case kAddRowsMulti:
     case kCopyRowsMulti:
     case kAddToRowsMulti:
     case kCopyToRowsMulti:
       submatrix_args->push_back(&c->arg1);
       break;
     case kAcceptInput: case kProvideOutput:
       submatrix_args->push_back(&c->arg1);
       break;
     case kNoOperation:
     case kNoOperationPermanent:
     case kNoOperationMarker:
     case kNoOperationLabel:
     case kGotoLabel:
       break;
     default:
       KALDI_ERR << "Unknown command type.";
   }
 }

◆ IdentifySubmatrixArgs() [2/2]

void IdentifySubmatrixArgs	(	std::vector< NnetComputation::Command > *	commands,
		std::vector< int32 * > *	submatrix_args
	)

This function outputs to "submatrix_args" the addresses of the args (arguments arg1 through arg6) in the vector "commands", that correspond to the indexes of submatrices.

This is useful in renumbering code. Note: some of the pointers may point to a zero value, for optional submatrix args.

Definition at line 80 of file nnet-optimize-utils.cc.

References IdentifySubmatrixArgs().

                                                               {
   submatrix_args->clear();
   std::vector<NnetComputation::Command>::iterator iter = commands->begin(),
       end = commands->end();
   std::vector<int32*> this_submatrix_args;
   for (; iter != end; ++iter) {
     IdentifySubmatrixArgs(&(*iter), &this_submatrix_args);
     submatrix_args->insert(submatrix_args->end(),
                            this_submatrix_args.begin(),
                            this_submatrix_args.end());
   }
 }

◆ IdentifySubmatrixArgsInComputation()

void IdentifySubmatrixArgsInComputation	(	NnetComputation *	computation,
		std::vector< int32 * > *	submatrix_args
	)

This function outputs to "submatrix_args" the addresses of integers in 'computation' that correspond to submatrices.

These may be present in 'commands', and in 'indexes_multi'. This is useful in renumbering code. Note: some of the pointers may point to a zero value, for optional submatrix args in commands, but for efficiency we don't provide pointers for the -1's in 'indexes_multi'.

Definition at line 340 of file nnet-optimize-utils.cc.

References NnetComputation::commands, rnnlm::i, IdentifySubmatrixArgs(), and NnetComputation::indexes_multi.

Referenced by ComputationRenumberer::ComputeSubmatrixIsUsed(), and ComputationRenumberer::RenumberSubmatrices().

                                                                            {
   IdentifySubmatrixArgs(&(computation->commands), submatrix_args);
 
   size_t extra_size = 0;
   for (size_t i = 0; i < computation->indexes_multi.size(); i++)
     extra_size += computation->indexes_multi[i].size();
   submatrix_args->reserve(submatrix_args->size() + extra_size);
 
   for (size_t i = 0; i < computation->indexes_multi.size(); i++) {
     std::vector<std::pair<int32, int32> > &indexes_multi =
         computation->indexes_multi[i];
     std::vector<std::pair<int32, int32> >::iterator
         iter = indexes_multi.begin(), end = indexes_multi.end();
     for (; iter != end; ++iter)
       if (iter->first != -1)
         submatrix_args->push_back(&(iter->first));
   }
 }

◆ IndexesHaveSpecialStructure()

static bool kaldi::nnet3::IndexesHaveSpecialStructure	(	const std::vector< int32 > &	indexes,
		int32 *	first_nonnegative_pos,
		int32 *	first_nonnegative_value,
		int32 *	num_nonnegative_indexes
	)

static

Definition at line 2252 of file nnet-optimize-utils.cc.

References KALDI_ASSERT, and rnnlm::n.

Referenced by ReplaceRowWithMatrixOps().

                                                                         {
   KALDI_ASSERT(!indexes.empty());
   const int32 *indexes_ptr = &(indexes[0]);
   size_t pos = 0, size = indexes.size();
 
   // Find the first nonnegative element of 'indexes'.
   for (; pos < size; ++pos)
     if (indexes_ptr[pos] >= 0)
       break;
   if (pos == size)
     return false;  // all -1's... should not happen, but not our problem.
   *first_nonnegative_pos = static_cast<int32>(pos);
   int32 n = indexes_ptr[pos];
   *first_nonnegative_value = n;
   // Find the first element after '*first_nonnegative_index' that isn't
   // consecutive.
   for (; pos < size; ++pos,++n)
     if (indexes_ptr[pos] != n)
       break;
 
   *num_nonnegative_indexes = n - *first_nonnegative_value;
 
   // Check that the remaining values are all <0 (assumed equal to -1, but
   // checking <0 may be faster as just one instruction).
   for (; pos < size; ++pos)
     if (indexes_ptr[pos] >= 0)
       return false;  // does not have the special structure.
 
   return true;
 }

◆ IndexesMultiToSubmatrixIndexes()

static void kaldi::nnet3::IndexesMultiToSubmatrixIndexes	(	const std::vector< std::pair< int32, int32 > > &	indexes_multi,
		std::vector< int32 > *	submatrix_indexes
	)

static

given a vector of pairs from computation.indexes_multi_indexes containing paris (submatrix-index, row-index), this function outputs to "submatrix_indexes" all (unique) submatrix indexes that appear; and it outputs to "contains_null_marker" true if the pair (-1, -1) appears anywhere in indexes_multi, and false otherwise.

Definition at line 264 of file nnet-analyze.cc.

References kaldi::SortAndUniq().

Referenced by ComputeCommandAttributes().

                                          {
   submatrix_indexes->clear();
   std::vector<std::pair<int32, int32> >::const_iterator
       iter = indexes_multi.begin(), end = indexes_multi.end();
   int32 cur_submatrix_index = -1; // an optimization.
   for (; iter != end; ++iter) {
     int32 submatrix_index = iter->first;
     if (submatrix_index != -1 && submatrix_index != cur_submatrix_index) {
       cur_submatrix_index = submatrix_index;
       submatrix_indexes->push_back(submatrix_index);
     }
   }
   SortAndUniq(submatrix_indexes);
 }

◆ InsertCommands()

void InsertCommands	(	std::vector< std::pair< int32, NnetComputation::Command > > *	commands,
		NnetComputation *	computation
	)

Inserts commands into the computation at the requested places.

'commands' is a list of pairs (command-index, command) that is expected to be sorted on command-index. For each entry (c, command) in 'commands', 'command' is inserted into 'computation' just *before* the command that (at entry) is in computation->commands[c]. If there are multiple pairs with the same index c, they will remain in the same order in which they were present in 'commands'; however, 'commands' does not have to be sorted on 'c'. As a special case, if c == computation->commands.size(), the corresponding commands are inserted at the beginning of the computation. This function will appropriately renumber the argument of the kGotoLabel command of any 'looped' computation. Command indexes c in commands[*].first must be in the range [0, computation->commands.size()]. This function may modify 'commands' by sorting it.

Definition at line 4640 of file nnet-optimize-utils.cc.

References NnetComputation::commands, FixGotoLabel(), rnnlm::i, KALDI_ASSERT, and kaldi::RandInt().

Referenced by MemoryCompressionOptimizer::ModifyComputation(), and RowOpsSplitter::SplitCommands().

                                   {
   int32 num_new_commands = new_commands->size(),
       num_old_commands = computation->commands.size();
   if (num_new_commands == 0)
     return;
   CommandPairComparator comparison_operator;
   // use std::stable_sort so that for entries in 'new_commands' that
   // have the same .first value, they stay in the same order they were
   // in before sorting.
   std::stable_sort(new_commands->begin(), new_commands->end(),
                    comparison_operator);
 
   if (RandInt(0, 3) == 0) {   // check 'new_commands'
     for (int32 i = 0; i + 1 < num_new_commands; i++) {
       KALDI_ASSERT((*new_commands)[i].first <= (*new_commands)[i+1].first &&
                    (*new_commands)[i].first >= 0 &&
                    (*new_commands)[i+1].first <= num_old_commands);
     }
   }
   std::vector<NnetComputation::Command> merged_commands;
   merged_commands.reserve(num_old_commands + num_new_commands);
 
   std::vector<std::pair<int32, NnetComputation::Command> >::const_iterator
       new_commands_iter = new_commands->begin(),
       new_commands_end = new_commands->end();
 
   for (int32 old_command_index = 0; old_command_index <= num_old_commands;
        old_command_index++) {
     while (new_commands_iter != new_commands_end &&
            new_commands_iter->first <= old_command_index) {
       merged_commands.push_back(new_commands_iter->second);
       ++new_commands_iter;
     }
     if (old_command_index < num_old_commands)
       merged_commands.push_back(computation->commands[old_command_index]);
   }
   KALDI_ASSERT(merged_commands.size() == num_old_commands +
                num_new_commands);
   // copy to 'computation->commands' via shallow swap.
   computation->commands.swap(merged_commands);
   FixGotoLabel(computation);
 }

◆ IoSpecificationIsDecomposable()

static bool kaldi::nnet3::IoSpecificationIsDecomposable	(	const IoSpecification &	io_spec,
		IoSpecification *	mini_io_spec,
		int32 *	num_n_values_out
	)

static

Definition at line 3822 of file nnet-optimize-utils.cc.

References ConvertNumNValues(), FindNStride(), IoSpecification::has_deriv, IoSpecification::indexes, KALDI_ASSERT, and IoSpecification::name.

Referenced by RequestIsDecomposable().

                                                                    {
   mini_io_spec->name = io_spec.name;
   mini_io_spec->has_deriv = io_spec.has_deriv;
   const std::vector<Index> &indexes = io_spec.indexes;
   KALDI_ASSERT(!indexes.empty() && "Empty Indexes in computation request");
 
   bool full_check = true;  // We might eventually change this to false, for
                            // efficiency.
   int32 num_n_values = indexes.back().n + 1;
   if (num_n_values <= 2) {
     // Computations with 2 or fewer 'n' values are not decomposable, as there
     // would be no speed benefit in shortcut compilation (which relies on
     // compiling an otherwise similar computation with n == 2).
     return false;
   }
   *num_n_values_out = num_n_values;
 
   int32 n_stride = FindNStride(indexes, full_check);
 
   if (n_stride == 0)
     return false;
 
   ConvertNumNValues(n_stride, num_n_values, 2,
                     indexes, &(mini_io_spec->indexes));
 
   return true;
 }

◆ IsNoop()

static bool kaldi::nnet3::IsNoop ( const NnetComputation::Command & command )

static

Definition at line 699 of file nnet-optimize-utils.cc.

References NnetComputation::Command::command_type, and kNoOperation.

Referenced by RemoveNoOps().

                                                           {
   return command.command_type == kNoOperation;
 }

◆ IsSimpleNnet()

bool IsSimpleNnet ( const Nnet & nnet )

This function returns true if the nnet has the following properties: It has an output called "output" (other outputs are allowed but may be ignored).

It has an input called "input", and possibly an extra input called "ivector", but no other inputs. There are probably some other properties that we really ought to be checking, and we may add more later on.

Definition at line 52 of file nnet-utils.cc.

References Nnet::GetNodeIndex(), Nnet::IsInputNode(), Nnet::IsOutputNode(), and NumInputNodes().

Referenced by ComputeExampleComputationRequestSimple(), ComputeSimpleNnetContext(), DecodableNnetSimple::DecodableNnetSimple(), Nnet::Info(), DecodableNnetSimpleLoopedInfo::Init(), main(), NnetInfo(), and AmNnetSimple::SetContext().

                                     {
   // check that we have an output node and called "output".
   if (nnet.GetNodeIndex("output") == -1 ||
       !nnet.IsOutputNode(nnet.GetNodeIndex("output")))
     return false;
   // check that there is an input node named "input".
   if (nnet.GetNodeIndex("input") == -1 ||
       !nnet.IsInputNode(nnet.GetNodeIndex("input")))
     return false;
   // if there was just one input, then it was named
   // "input" and everything checks out.
   if (NumInputNodes(nnet) == 1)
     return true;
   // Otherwise, there should be input node with name "input" and one
   // should be called "ivector".
   return nnet.GetNodeIndex("ivector") != -1 &&
       nnet.IsInputNode(nnet.GetNodeIndex("ivector"));
 }

◆ KlDivergence()

BaseFloat kaldi::nnet3::KlDivergence	(	const Vector< BaseFloat > &	p,
		const Vector< BaseFloat > &	q
	)

Definition at line 31 of file nnet3-am-adjust-priors.cc.

References VectorBase< Real >::Dim(), rnnlm::i, KALDI_ASSERT, KALDI_WARN, kaldi::Log(), and VectorBase< Real >::Sum().

Referenced by PrintPriorDiagnostics().

                                                    {
   BaseFloat sum_p = p.Sum(), sum_q = q.Sum();
   if (fabs(sum_p - 1.0) > 0.01 || fabs(sum_q - 1.0) > 0.01) {
     KALDI_WARN << "KlDivergence: vectors are not close to being normalized "
                << sum_p << ", " << sum_q;
   }
   KALDI_ASSERT(p.Dim() == q.Dim());
   double ans = 0.0;
 
   for (int32 i = 0; i < p.Dim(); i++) {
     BaseFloat p_prob = p(i) / sum_p, q_prob = q(i) / sum_q;
     ans += p_prob * Log(p_prob / q_prob);
   }
   return ans;
 }

◆ LimitDerivativeTimes() [1/2]

void kaldi::nnet3::LimitDerivativeTimes	(	const Nnet &	nnet,
		const ComputationRequest &	request,
		const NnetOptimizeOptions &	opts,
		NnetComputation *	computation
	)

This optimization, which has no effect unless you set –min-deriv-time or –max-deriv-time, modifies the backprop operations for efficiency based on the assumption that derivatives for any Cindex with t < min_deriv_time or t > max_deriv_time are zero.

(this is based on the fact that derivatives in recurrent setups will either decay to zero over time, or will explode and anyway become meaningless). This is only applied if you are not comoputing any input-derivatives). The assumption, for simple Components, is that backprop operations are no-ops as long as the input was zeroed, because the back-propagated derivatives would be zero and the model would not be updated.

The most important effect of this operation is to modify some operations of type kBackprop and kBackpropNoModelUpdate for simple Components, to either make them operate on row ranges of their original input (which in general will be newly created submatrices), or to remove them altogether if they do not operate on any 't' values within the correct range.

We assert as a requirement of this optimization that all allocation commands must zero their matrices (this effectively means that you cannot apply this optimization after RemoveUnnecessaryZeroing()). This means that we don't have to worry about leaving things undefined after removing backprop operations. We also assert that backprop commands that set instead of adding to their input, must not be outputting to things that were previously set to nonzero values. (this shouldn't ever be a problem, but we do check.

Note: after this optimization it will likely be beneficial to call RemoveUnnecessaryOperations to remove operations not of type kBackprop that have now become unnecessary– e.g. operations that do the backprop through Descriptors.

◆ LimitDerivativeTimes() [2/2]

void LimitDerivativeTimes	(	const Nnet &	nnet,
		int32	min_deriv_time,
		int32	max_deriv_time,
		NnetComputation *	computation
	)

Definition at line 2215 of file nnet-optimize-utils.cc.

References DerivativeTimeLimiter::LimitDerivTimes().

Referenced by Optimize().

                                                         {
   DerivativeTimeLimiter limiter(nnet, min_deriv_time, max_deriv_time,
                                 computation);
   limiter.LimitDerivTimes();
 }

◆ MakeSccGraph()

void MakeSccGraph	(	const std::vector< std::vector< int32 > > &	graph,
		const std::vector< std::vector< int32 > > &	sccs,
		std::vector< std::vector< int32 > > *	scc_graph
	)

Given a list of sccs of a graph (e.g.

as computed by FindSccs), compute a directed graph on the sccs. Of course this directed graph will be acyclic.

Definition at line 164 of file nnet-graph.cc.

References rnnlm::i, rnnlm::j, KALDI_ASSERT, and kaldi::SortAndUniq().

Referenced by ComputeNnetComputationEpochs(), and UnitTestMakeSccGraph().

                                                            {
   KALDI_ASSERT(scc_graph != NULL);
   scc_graph->clear();
   scc_graph->resize(sccs.size());
 
   // Hash map from node to SCC index.
   std::vector<int32> node_to_scc_index(graph.size());
   for (int32 i = 0; i < sccs.size(); ++i) {
     for (int32 j = 0; j < sccs[i].size(); ++j) {
       KALDI_ASSERT(sccs[i][j] >= 0 && sccs[i][j] < graph.size());
       node_to_scc_index[sccs[i][j]] = i;
     }
   }
 
   // Builds graph.
   for (int32 i = 0; i < sccs.size(); ++i) {
     for (int32 j = 0; j < sccs[i].size(); ++j) {
       int32 node = sccs[i][j];
       KALDI_ASSERT(node >= 0 && node < graph.size());
       for (int32 k = 0; k < graph[node].size(); ++k) {
         if (node_to_scc_index[graph[node][k]] != i) { // Exclucding self.
           (*scc_graph)[i].push_back(node_to_scc_index[graph[node][k]]);
         }
       }
     }
     // If necessary, we can use a hash maps to avoid this sorting.
     SortAndUniq(&((*scc_graph)[i]));
   }
 }

◆ MatrixIsUnused()

bool MatrixIsUnused	(	const Analyzer &	analyzer,
		const NnetComputation &	computation,
		int32	m
	)

This function returns true if matrix 1 <= m < computation->matrices.size() is unused, defined as: it is not an input or an output, and is not accessed other than via commands of type kAllocMatrix, kDeallocMatrix, and kSetConst.

Definition at line 4582 of file nnet-optimize-utils.cc.

References MatrixAccesses::accesses, NnetComputation::Command::command_type, NnetComputation::commands, rnnlm::i, MatrixAccesses::is_input, MatrixAccesses::is_output, kNoOperation, kSetConst, and Analyzer::matrix_accesses.

Referenced by DerivativeTimeLimiter::PruneMatrices().

                              {
   const MatrixAccesses &accesses = analyzer.matrix_accesses[m];
   if (accesses.is_input || accesses.is_output)
     return false;
   for (size_t i = 0; i < accesses.accesses.size(); i++) {
     int32 command_index = accesses.accesses[i].command_index;
     const NnetComputation::Command &command =
         computation.commands[command_index];
     if (command.command_type != kNoOperation &&
         command.command_type != kSetConst) {
       return false;
     }
   }
   return true;
 }

◆ MaxMemoryUsage()

int32 kaldi::nnet3::MaxMemoryUsage ( const NnetComputation & computation )

Returns the total memory, in bytes, used by the computation (just the temporary memory, not counting the memory used by the nnet itself).

This is defined as the maximum amount of memory used at any one instant.

◆ MaxOutputTimeInRequest()

int32 MaxOutputTimeInRequest ( const ComputationRequest & request )

Definition at line 484 of file nnet-optimize.cc.

References rnnlm::i, KALDI_ERR, and ComputationRequest::outputs.

Referenced by CompileLoopedInternal(), CachingOptimizingCompiler::CompileNoShortcut(), NnetOptimizeOptions::Register(), UnitTestNnetCompute(), and UnitTestNnetInputDerivatives().

                                                                 {
   int32 ans = std::numeric_limits<int32>::min();
   for (size_t i = 0; i < request.outputs.size(); i++) {
     const std::vector<Index> &indexes (request.outputs[i].indexes);
     std::vector<Index>::const_iterator iter = indexes.begin(),
         end = indexes.end();
     for (; iter != end; ++iter)
       if (iter->t > ans)
         ans = iter->t;
   }
   if (ans == std::numeric_limits<int32>::min()) {
     KALDI_ERR << "Failed to find any output indexes in computation request.";
   }
   return ans;
 }

◆ MergeChainExamples()

void MergeChainExamples	(	bool	compress,
		std::vector< NnetChainExample > *	input,
		NnetChainExample *	output
	)

This function merges a list of NnetChainExample objects into a single one– intended to be used when forming minibatches for neural net training.

If 'compress' it compresses the output features (recommended to save disk space).

Note: the input is left as it was at the start, but it is temporarily changed inside the function; this is a trick to allow us to use the MergeExamples() routine while avoiding having to rewrite code.

Definition at line 256 of file nnet-chain-example.cc.

References rnnlm::i, NnetChainExample::inputs, NnetExample::io, rnnlm::j, KALDI_ASSERT, MergeExamples(), MergeSupervision(), and NnetChainExample::outputs.

Referenced by NnetChainExampleStructureCompare::operator()(), and ChainExampleMerger::WriteMinibatch().

                                                   {
   int32 num_examples = input->size();
   KALDI_ASSERT(num_examples > 0);
   // we temporarily make the input-features in 'input' look like regular NnetExamples,
   // so that we can recycle the MergeExamples() function.
   std::vector<NnetExample> eg_inputs(num_examples);
   for (int32 i = 0; i < num_examples; i++)
     eg_inputs[i].io.swap((*input)[i].inputs);
   NnetExample eg_output;
   MergeExamples(eg_inputs, compress, &eg_output);
   // swap the inputs back so that they are not really changed.
   for (int32 i = 0; i < num_examples; i++)
     eg_inputs[i].io.swap((*input)[i].inputs);
   // write to 'output->inputs'
   eg_output.io.swap(output->inputs);
 
   // Now deal with the chain-supervision 'outputs'.  There will
   // normally be just one of these, with name "output", but we
   // handle the more general case.
   int32 num_output_names = (*input)[0].outputs.size();
   output->outputs.resize(num_output_names);
   for (int32 i = 0; i < num_output_names; i++) {
     std::vector<const NnetChainSupervision*> to_merge(num_examples);
     for (int32 j = 0; j < num_examples; j++) {
       KALDI_ASSERT((*input)[j].outputs.size() == num_output_names);
       to_merge[j] = &((*input)[j].outputs[i]);
     }
     MergeSupervision(to_merge,
                      &(output->outputs[i]));
   }
 }

◆ MergeDiscriminativeExamples() [1/2]

void kaldi::nnet3::MergeDiscriminativeExamples	(	std::vector< NnetDiscriminativeExample > *	input,
		bool	compress,
		NnetDiscriminativeExample *	output
	)

Appends the given vector of examples (which must be non-empty) into a single output example.

Intended to be used when forming minibatches for neural net training. If 'compress' it compresses the output features (recommended to save disk space).

Note: the input is left as it was at the start, but it is temporarily changed inside the function; this is a trick to allow us to use the MergeExamples() routine while avoiding having to rewrite code.

◆ MergeDiscriminativeExamples() [2/2]

void kaldi::nnet3::MergeDiscriminativeExamples	(	bool	compress,
		std::vector< NnetDiscriminativeExample > *	input,
		NnetDiscriminativeExample *	output
	)

Definition at line 247 of file nnet-discriminative-example.cc.

References rnnlm::i, NnetDiscriminativeExample::inputs, NnetExample::io, rnnlm::j, KALDI_ASSERT, MergeExamples(), MergeSupervision(), and NnetDiscriminativeExample::outputs.

Referenced by NnetDiscriminativeExampleStructureCompare::operator()(), and DiscriminativeExampleMerger::WriteMinibatch().

                                        {
   int32 num_examples = input->size();
   KALDI_ASSERT(num_examples > 0);
   // we temporarily make the input-features in 'input' look like regular
   // NnetExamples, so that we can recycle the
   // MergeExamples() function.
   std::vector<NnetExample> eg_inputs(num_examples);
   for (int32 i = 0; i < num_examples; i++)
     eg_inputs[i].io.swap((*input)[i].inputs);
   NnetExample eg_output;
   MergeExamples(eg_inputs, compress, &eg_output);
   // swap the inputs back so that they are not really changed.
   for (int32 i = 0; i < num_examples; i++)
     eg_inputs[i].io.swap((*input)[i].inputs);
   // write to 'output->inputs'
   eg_output.io.swap(output->inputs);
 
   // Now deal with the discriminative-supervision 'outputs'.  There will
   // normally be just one of these, with name "output", but we
   // handle the more general case.
   int32 num_output_names = (*input)[0].outputs.size();
   output->outputs.resize(num_output_names);
   for (int32 i = 0; i < num_output_names; i++) {
     std::vector<const NnetDiscriminativeSupervision*> to_merge(num_examples);
     for (int32 j = 0; j < num_examples; j++) {
       KALDI_ASSERT((*input)[j].outputs.size() == num_output_names);
       to_merge[j] = &((*input)[j].outputs[i]);
     }
     MergeSupervision(to_merge,
                      &(output->outputs[i]));
   }
 }

◆ MergeExamples()

void MergeExamples	(	const std::vector< NnetExample > &	src,
		bool	compress,
		NnetExample *	dest
	)

Merge a set of input examples into a single example (typically the size of "src" will be the minibatch size).

Will crash if "src" is the empty vector. If "compress" is true, it will compress any non-sparse features in the output.

Definition at line 162 of file nnet-example-utils.cc.

References GetIoNames(), GetIoSizes(), KALDI_ASSERT, and MergeIo().

Referenced by MergeChainExamples(), MergeDiscriminativeExamples(), UnitTestNnetMergeExamples(), and ExampleMerger::WriteMinibatch().

                                            {
   KALDI_ASSERT(!src.empty());
   std::vector<std::string> io_names;
   GetIoNames(src, &io_names);
   // the sizes are the total number of Indexes we have across all examples.
   std::vector<int32> io_sizes;
   GetIoSizes(src, io_names, &io_sizes);
   MergeIo(src, io_names, io_sizes, compress, merged_eg);
 }

◆ MergeIo()

static void kaldi::nnet3::MergeIo	(	const std::vector< NnetExample > &	src,
		const std::vector< std::string > &	names,
		const std::vector< int32 > &	sizes,
		bool	compress,
		NnetExample *	merged_eg
	)

static

Definition at line 88 of file nnet-example-utils.cc.

References kaldi::AppendGeneralMatrixRows(), NnetIo::features, rnnlm::i, NnetIo::indexes, NnetExample::io, KALDI_ASSERT, rnnlm::n, and NnetIo::name.

Referenced by MergeExamples().

                                             {
   // The total number of Indexes we have across all examples.
   int32 num_feats = names.size();
 
   std::vector<int32> cur_size(num_feats, 0);
 
   // The features in the different NnetIo in the Indexes across all examples
   std::vector<std::vector<GeneralMatrix const*> > output_lists(num_feats);
 
   // Initialize the merged_eg
   merged_eg->io.clear();
   merged_eg->io.resize(num_feats);
   for (int32 f = 0; f < num_feats; f++) {
     NnetIo &io = merged_eg->io[f];
     int32 size = sizes[f];
     KALDI_ASSERT(size > 0);
     io.name = names[f];
     io.indexes.resize(size);
   }
 
   std::vector<std::string>::const_iterator names_begin = names.begin(),
                                              names_end = names.end();
   std::vector<NnetExample>::const_iterator eg_iter = src.begin(),
     eg_end = src.end();
   for (int32 n = 0; eg_iter != eg_end; ++eg_iter, ++n) {
     std::vector<NnetIo>::const_iterator io_iter = eg_iter->io.begin(),
       io_end = eg_iter->io.end();
     for (; io_iter != io_end; ++io_iter) {
       const NnetIo &io = *io_iter;
       std::vector<std::string>::const_iterator names_iter =
           std::lower_bound(names_begin, names_end, io.name);
       KALDI_ASSERT(*names_iter == io.name);
 
       int32 f = names_iter - names_begin;
       int32 this_size = io.indexes.size();
       int32 &this_offset = cur_size[f];
       KALDI_ASSERT(this_size + this_offset <= sizes[f]);
 
       // Add f'th Io's features
       output_lists[f].push_back(&(io.features));
 
       // Work on the Indexes for the f^th Io in merged_eg
       NnetIo &output_io = merged_eg->io[f];
       std::copy(io.indexes.begin(), io.indexes.end(),
                 output_io.indexes.begin() + this_offset);
       std::vector<Index>::iterator output_iter = output_io.indexes.begin();
       // Set the n index to be different for each of the original examples.
       for (int32 i = this_offset; i < this_offset + this_size; i++) {
         // we could easily support merging already-merged egs, but I don't see a
         // need for it right now.
         KALDI_ASSERT(output_iter[i].n == 0 &&
                      "Merging already-merged egs?  Not currentlysupported.");
         output_iter[i].n = n;
       }
       this_offset += this_size;  // note: this_offset is a reference.
     }
   }
   KALDI_ASSERT(cur_size == sizes);
   for (int32 f = 0; f < num_feats; f++) {
     AppendGeneralMatrixRows(output_lists[f],
                             &(merged_eg->io[f].features));
     if (compress) {
       // the following won't do anything if the features were sparse.
       merged_eg->io[f].features.Compress();
     }
   }
 }

◆ MergeSupervision() [1/2]

void MergeSupervision	(	const std::vector< const NnetDiscriminativeSupervision *> &	inputs,
		NnetDiscriminativeSupervision *	output
	)

Definition at line 185 of file nnet-discriminative-example.cc.

References NnetDiscriminativeSupervision::CheckDim(), NnetDiscriminativeSupervision::deriv_weights, VectorBase< Real >::Dim(), NnetDiscriminativeSupervision::indexes, KALDI_ASSERT, kaldi::kUndefined, kaldi::discriminative::MergeSupervision(), rnnlm::n, NnetDiscriminativeSupervision::name, NnetDiscriminativeSupervision::supervision, and DiscriminativeSupervision::Swap().

                                            {
   int32 num_inputs = inputs.size(),
       num_indexes = 0;
   for (int32 n = 0; n < num_inputs; n++) {
     KALDI_ASSERT(inputs[n]->name == inputs[0]->name);
     num_indexes += inputs[n]->indexes.size();
   }
   output->name = inputs[0]->name;
   std::vector<const discriminative::DiscriminativeSupervision*> input_supervision;
   input_supervision.reserve(inputs.size());
   for (int32 n = 0; n < num_inputs; n++)
     input_supervision.push_back(&(inputs[n]->supervision));
   discriminative::DiscriminativeSupervision output_supervision;
   discriminative::MergeSupervision(input_supervision,
                          &output_supervision);
   output->supervision.Swap(&(output_supervision));
 
   output->indexes.clear();
   output->indexes.reserve(num_indexes);
   for (int32 n = 0; n < num_inputs; n++) {
     const std::vector<Index> &src_indexes = inputs[n]->indexes;
     int32 cur_size = output->indexes.size();
     output->indexes.insert(output->indexes.end(),
                            src_indexes.begin(), src_indexes.end());
     std::vector<Index>::iterator iter = output->indexes.begin() + cur_size,
         end = output->indexes.end();
     // change the 'n' index to correspond to the index into 'input'.
     // Each example gets a different 'n' value, starting from 0.
     for (; iter != end; ++iter) {
       KALDI_ASSERT(iter->n == 0 && "Merging already-merged discriminative egs");
       iter->n = n;
     }
   }
   KALDI_ASSERT(output->indexes.size() == num_indexes);
   // OK, at this point the 'indexes' will be in the wrong order,
   // because they should be first sorted by 't' and next by 'n'.
   // 'sort' will fix this, due to the operator < on type Index.
   // TODO: Is this required?
   std::sort(output->indexes.begin(), output->indexes.end());
 
   // merge the deriv_weights.
   if (inputs[0]->deriv_weights.Dim() != 0) {
     int32 frames_per_sequence = inputs[0]->deriv_weights.Dim();
     output->deriv_weights.Resize(output->indexes.size(), kUndefined);
     KALDI_ASSERT(output->deriv_weights.Dim() ==
                  frames_per_sequence * num_inputs);
     for (int32 n = 0; n < num_inputs; n++) {
       const Vector<BaseFloat> &src_deriv_weights = inputs[n]->deriv_weights;
       KALDI_ASSERT(src_deriv_weights.Dim() == frames_per_sequence);
       // the ordering of the deriv_weights corresponds to the ordering of the
       // Indexes, where the time dimension has the greater stride.
       for (int32 t = 0; t < frames_per_sequence; t++) {
         output->deriv_weights(t * num_inputs + n) = src_deriv_weights(t);
       }
     }
   }
   output->CheckDim();
 }

◆ MergeSupervision() [2/2]

static void kaldi::nnet3::MergeSupervision	(	const std::vector< const NnetChainSupervision *> &	inputs,
		NnetChainSupervision *	output
	)

static

Definition at line 195 of file nnet-chain-example.cc.

References NnetChainSupervision::CheckDim(), NnetChainSupervision::deriv_weights, VectorBase< Real >::Dim(), NnetChainSupervision::indexes, KALDI_ASSERT, kaldi::kUndefined, rnnlm::n, NnetChainSupervision::name, and NnetChainSupervision::supervision.

Referenced by MergeChainExamples(), MergeDiscriminativeExamples(), and NnetDiscriminativeExampleStructureCompare::operator()().

                                   {
   int32 num_inputs = inputs.size(),
       num_indexes = 0;
   for (int32 n = 0; n < num_inputs; n++) {
     KALDI_ASSERT(inputs[n]->name == inputs[0]->name);
     num_indexes += inputs[n]->indexes.size();
   }
   output->name = inputs[0]->name;
   std::vector<const chain::Supervision*> input_supervision;
   input_supervision.reserve(inputs.size());
   for (int32 n = 0; n < num_inputs; n++)
     input_supervision.push_back(&(inputs[n]->supervision));
   chain::Supervision output_supervision;
   MergeSupervision(input_supervision,
                    &output_supervision);
   output->supervision.Swap(&output_supervision);
 
   output->indexes.clear();
   output->indexes.reserve(num_indexes);
   for (int32 n = 0; n < num_inputs; n++) {
     const std::vector<Index> &src_indexes = inputs[n]->indexes;
     int32 cur_size = output->indexes.size();
     output->indexes.insert(output->indexes.end(),
                            src_indexes.begin(), src_indexes.end());
     std::vector<Index>::iterator iter = output->indexes.begin() + cur_size,
         end = output->indexes.end();
     // change the 'n' index to correspond to the index into 'input'.
     // Each example gets a different 'n' value, starting from 0.
     for (; iter != end; ++iter) {
       KALDI_ASSERT(iter->n == 0 && "Merging already-merged chain egs");
       iter->n = n;
     }
   }
   KALDI_ASSERT(output->indexes.size() == num_indexes);
   // OK, at this point the 'indexes' will be in the wrong order,
   // because they should be first sorted by 't' and next by 'n'.
   // 'sort' will fix this, due to the operator < on type Index.
   std::sort(output->indexes.begin(), output->indexes.end());
 
   // merge the deriv_weights.
   if (inputs[0]->deriv_weights.Dim() != 0) {
     int32 frames_per_sequence = inputs[0]->deriv_weights.Dim();
     output->deriv_weights.Resize(output->indexes.size(), kUndefined);
     KALDI_ASSERT(output->deriv_weights.Dim() ==
                  frames_per_sequence * num_inputs);
     for (int32 n = 0; n < num_inputs; n++) {
       const Vector<BaseFloat> &src_deriv_weights = inputs[n]->deriv_weights;
       KALDI_ASSERT(src_deriv_weights.Dim() == frames_per_sequence);
       // the ordering of the deriv_weights corresponds to the ordering of the
       // Indexes, where the time dimension has the greater stride.
       for (int32 t = 0; t < frames_per_sequence; t++) {
         output->deriv_weights(t * num_inputs + n) = src_deriv_weights(t);
       }
     }
   }
   output->CheckDim();
 }

◆ MergeTaskOutput() [1/2]

void MergeTaskOutput	(	const std::vector< NnetInferenceTask > &	tasks,
		Matrix< BaseFloat > *	output
	)

Merges together the 'output_cpu' (if the 'output_to_cpu' members are true) or the 'output' members of 'tasks' into a single CPU matrix 'output'.

Requires that those outputs are nonempty (i.e. that those tasks must have been completed).

Parameters

[in]	tasks	The vector of tasks whose outputs are to be merged. The tasks must have already been completed.
	[output	output The spliced-together output matrix

TODO: in the future, maybe start from GPU and use pinned matrices for the transfer.

Definition at line 968 of file nnet-batch-compute.cc.

References NnetInferenceTask::first_used_output_frame_index, rnnlm::i, KALDI_ASSERT, NnetInferenceTask::num_initial_unused_output_frames, NnetInferenceTask::num_used_output_frames, MatrixBase< Real >::NumCols(), NnetInferenceTask::output, NnetInferenceTask::output_cpu, NnetInferenceTask::output_to_cpu, Matrix< Real >::Resize(), and MatrixBase< Real >::RowRange().

Referenced by NnetBatchInference::GetOutput(), and NnetBatchComputerOptions::Register().

                                {
   int32 num_tasks = tasks.size(),
       num_output_frames = 0,
       output_dim = -1;
   for (int32 i = 0; i < num_tasks; i++) {
     const NnetInferenceTask &task = tasks[i];
     num_output_frames += task.num_used_output_frames;
     if (i == 0) {
       output_dim = (task.output_to_cpu ?
                     task.output_cpu.NumCols() :
                     task.output.NumCols());
     }
   }
   KALDI_ASSERT(num_output_frames != 0 && output_dim != 0);
   int32 cur_output_frame = 0;
   output->Resize(num_output_frames, output_dim);
   for (int32 i = 0; i < num_tasks; i++) {
     const NnetInferenceTask &task = tasks[i];
     int32 skip = task.num_initial_unused_output_frames,
         num_used = task.num_used_output_frames;
     KALDI_ASSERT(cur_output_frame == task.first_used_output_frame_index);
     if (task.output_to_cpu) {
       output->RowRange(cur_output_frame, num_used).CopyFromMat(
           task.output_cpu.RowRange(skip, num_used));
     } else {
       output->RowRange(cur_output_frame, num_used).CopyFromMat(
           task.output.RowRange(skip, num_used));
     }
     cur_output_frame += num_used;
   }
   KALDI_ASSERT(cur_output_frame == num_output_frames);
 }

◆ MergeTaskOutput() [2/2]

void MergeTaskOutput	(	const std::vector< NnetInferenceTask > &	tasks,
		CuMatrix< BaseFloat > *	output
	)

Definition at line 1002 of file nnet-batch-compute.cc.

References CuMatrixBase< Real >::Data(), NnetInferenceTask::first_used_output_frame_index, rnnlm::i, KALDI_ASSERT, kaldi::kUndefined, NnetInferenceTask::num_initial_unused_output_frames, NnetInferenceTask::num_used_output_frames, MatrixBase< Real >::NumCols(), CuMatrixBase< Real >::NumCols(), CuMatrixBase< Real >::NumRows(), NnetInferenceTask::output, NnetInferenceTask::output_cpu, NnetInferenceTask::output_to_cpu, CuMatrix< Real >::Resize(), MatrixBase< Real >::RowRange(), CuMatrixBase< Real >::RowRange(), and CuMatrixBase< Real >::Stride().

                                  {
   int32 num_tasks = tasks.size(),
       num_output_frames = 0,
       output_dim = -1;
   for (int32 i = 0; i < num_tasks; i++) {
     const NnetInferenceTask &task = tasks[i];
     num_output_frames += task.num_used_output_frames;
     if (i == 0) {
       output_dim = (task.output_to_cpu ?
                     task.output_cpu.NumCols() :
                     task.output.NumCols());
     }
   }
   KALDI_ASSERT(num_output_frames != 0 && output_dim != 0);
   int32 cur_output_frame = 0;
   output->Resize(num_output_frames, output_dim, kUndefined);
   
 #if HAVE_CUDA == 1 
   if (CuDevice::Instantiate().Enabled()) {
 
     std::vector<const BaseFloat*> inputs(num_tasks);
     std::vector<BaseFloat*> outputs(num_tasks);
     std::vector<int32_t> ldi(num_tasks), ldo(num_tasks);
     std::vector<int32_t> num_rows(num_tasks), num_cols(num_tasks);
 
     int b=0;  // batch counter
     for (int32 i = 0; i < num_tasks; i++) {
       const NnetInferenceTask &task = tasks[i];
       int32 skip = task.num_initial_unused_output_frames,
             num_used = task.num_used_output_frames;
       KALDI_ASSERT(cur_output_frame == task.first_used_output_frame_index);
       if (task.output_to_cpu) {
         output->RowRange(cur_output_frame, num_used).CopyFromMat(
             task.output_cpu.RowRange(skip, num_used));
       } else {
         CuSubMatrix<BaseFloat> output_mat = 
           output->RowRange(cur_output_frame, num_used);
         const CuSubMatrix<BaseFloat> input_mat =  
           task.output.RowRange(skip, num_used);
 
         // create matrix batch description arrays
         num_rows[b] = output_mat.NumRows();
         num_cols[b] = output_mat.NumCols();
         outputs[b] = output_mat.Data();
         inputs[b] = input_mat.Data();
         ldo[b] = output_mat.Stride();
         ldi[b] = input_mat.Stride();
         b++; // increase batch count
       }
       cur_output_frame += num_used;
     }
 
     // execute batched copy
     cuda_batched_copy_mats(b, &num_rows[0], &num_cols[0], &inputs[0], &ldi[0], 
         &outputs[0], &ldo[0]);
 
   } else
 #endif
  {
   for (int32 i = 0; i < num_tasks; i++) {
     const NnetInferenceTask &task = tasks[i];
     int32 skip = task.num_initial_unused_output_frames,
         num_used = task.num_used_output_frames;
     KALDI_ASSERT(cur_output_frame == task.first_used_output_frame_index);
     if (task.output_to_cpu) {
       output->RowRange(cur_output_frame, num_used).CopyFromMat(
           task.output_cpu.RowRange(skip, num_used));
     } else {
       output->RowRange(cur_output_frame, num_used).CopyFromMat(
           task.output.RowRange(skip, num_used));
     }
     cur_output_frame += num_used;
   }
  }
  
   KALDI_ASSERT(cur_output_frame == num_output_frames);
 }

◆ Mod()

I kaldi::nnet3::Mod	(	I	m,
		I	n
	)

Mod(m, n), defined for integers m and n where n > 0, returns the modulus m % n, defined as the integer 0 <= i < n such that i and m are congruent modulo n; for instance, Mod(13, 10) = 3.

This is like the % operation in C/C++, except that it always returns a positive value even for negative m; in 99% of cases where it makes a difference, this is what you want. In the C/C++ standard, the sign of a % b for negative a is not specified (except by relation with the division '/' operator), but in practice it would be <= 0 for almost all implementations.

Definition at line 106 of file nnet-compile-looped.cc.

References rnnlm::n.

Referenced by CreateLoopedComputationRequest().

                                    {
   I ans = m % n;
   if (ans < 0) ans += n;
   return ans;
 }

◆ ModifyNnetIvectorPeriod()

void ModifyNnetIvectorPeriod	(	int32	ivector_period,
		Nnet *	nnet
	)

This function modifies the descriptors in the neural network to change the periodicity with which it expects to read an iVector at its input.

We normally train neural networks that expect to see an iVector at frame zero only; this is because we train on fixed-size chunks and the iVector doesn't change that much within each chunk. However, expecting just one iVector isn't that convenient for looped recognition because it changes with time, so we modify the iVector input period in the network by replacing expressions like ReplaceIndex(ivector, t, 0) with Round(ivector, 10) [assuming ivector_period == 10]. The descriptor doesn't have to be named "ivector", it would work for ReplaceIndex(foo, t, 0). This won't work in every conceivable network, but it does do what you want in the cases of interest.

It does this in a rather simple way, by getting the config lines that correspond to descriptors, and doing a search-and-replace. It's maybe not ideal, but it was the easiest way to do it.

Definition at line 28 of file nnet-compile-looped.cc.

References ConfigLine::FirstToken(), Nnet::GetConfigLines(), rnnlm::i, KALDI_ASSERT, KALDI_ERR, ConfigLine::ParseLine(), and Nnet::ReadConfig().

Referenced by DecodableNnetSimpleLoopedInfo::Init(), and UnitTestNnetCompileLooped().

                                          {
   KALDI_ASSERT(ivector_period > 0);
   std::vector<std::string> config_lines;
   nnet->GetConfigLines(false, &config_lines);
   std::ostringstream config_to_read;
   for (size_t i = 0; i < config_lines.size(); i++) {
     std::string s = config_lines[i];
     ConfigLine config_line;
     bool b = config_line.ParseLine(config_lines[i]);
     KALDI_ASSERT(b && "Could not parse config line.");
     if (config_line.FirstToken() == "component-node") {
       // What we're trying to do here is: find a line like:
       //  component-node name=foo component=foo input=Append(bar, ReplaceIndex(ivector, t, 0))
       // we want to replace it with something like:
       // component-node name=foo component=foo input=Append(bar, ReplaceIndex(ivector, t, 0))
       // .. and we want this to also work if instead of 'ivector' it has something like
       // Scale(0.5, ivector).  We assume that ReplaceIndex() expressions only occur in this
       // type of context.
       std::string whole_line = config_lines[i];
       std::string to_search_for = "ReplaceIndex(";
       std::string::size_type to_search_for_size = to_search_for.size();
       std::string::size_type pos = whole_line.find(to_search_for);
       if (pos != std::string::npos) {
         std::string::size_type comma_pos = whole_line.find(", t, 0)", pos);
         if (comma_pos != std::string::npos) {
           // if the line contained ReplaceIndex(ivector, t, 0),
           // descriptor_name would now be 'ivector'.
           std::string descriptor_name =
               whole_line.substr(pos + to_search_for_size,
                                 comma_pos - (pos + to_search_for_size));
           // Note: 7, below, is the size of: ", t, 0)".
           std::string::size_type end_pos = comma_pos + 7;
           std::string::size_type expr_size = end_pos - pos;
           // e.g. expr_size would be strlen("ReplaceIndex(ivector, t, 0)").
           std::ostringstream to_replace_with;
           to_replace_with << "Round(" << descriptor_name << ", " << ivector_period << ")";
           whole_line.replace(pos, expr_size, to_replace_with.str());
           config_to_read << whole_line << "\n";
         } else {
           KALDI_ERR << "Could not process the ReplaceIndex expression in: "
                     << whole_line;
         }
       }
     }
   }
   if (!config_to_read.str().empty()) {
     std::istringstream is(config_to_read.str());
     nnet->ReadConfig(is);
   }
 }

◆ MoveSizingCommands()

void MoveSizingCommands	(	const Nnet &	nnet,
		NnetComputation *	computation
	)

This optimization moves commands that allocate and zero matrices to as late as possible, and moves commands that deallocate matrices to as early as possible.

Definition at line 152 of file nnet-optimize.cc.

References MatrixAccesses::accesses, MatrixAccesses::allocate_command, NnetComputation::Command::command_type, NnetComputation::commands, ComputeCommandAttributes(), ComputeMatrixAccesses(), ComputeVariableAccesses(), MatrixAccesses::deallocate_command, ComputationVariables::Init(), KALDI_ASSERT, kAllocMatrix, kDeallocMatrix, kGotoLabel, and kSetConst.

Referenced by Optimize().

                                                                         {
   ComputationVariables variables;
   variables.Init(*computation);
   std::vector<CommandAttributes> attributes;
   ComputeCommandAttributes(nnet, *computation, variables, &attributes);
   std::vector<std::vector<Access> > variable_accesses;
   ComputeVariableAccesses(variables, attributes, &variable_accesses);
   std::vector<MatrixAccesses> matrix_accesses;
   ComputeMatrixAccesses(nnet, *computation, variables, attributes,
                         &matrix_accesses);
 
   // The way we will renumber the commands is, we will first set this vector up
   // with pairs (command-index * 3, pointer-to-command), and we will then modify
   // the command-indexes in this vector to the numbers that we want, and sort
   // it.  The reason for the * 3 is so that we can number commands "just-after"
   // existing indexes (by adding 1) and "just-before" (by subtracting 1).
   int32 num_commands = computation->commands.size(),
       num_matrices = matrix_accesses.size();
 
   // Matrix allocation commands tend to be followed by a command that zeroes the
   // matrix.  We want to treat the two commands as a single unit for purposes of
   // reordering.  is_command_pair[c] will be true if command c is the first
   // element of such a pair.
   std::vector<bool> is_command_pair(num_commands, false);
   for (int32 c = 0; c + 1 < num_commands; c++) {
     if (computation->commands[c].command_type == kAllocMatrix &&
         computation->commands[c+1].command_type == kSetConst &&
         computation->commands[c].arg1 == computation->commands[c+1].arg1 &&
         computation->commands[c+1].alpha == 0.0) {
       is_command_pair[c] = true;
     }
   }
 
   // 'command_reordering' contains (new-number, old-number) of commands.
   // the new-number is multiplied by 3 for reasons explained above.
   std::vector<std::pair<int32,int32> >
       command_reordering(num_commands);
   // Note: for now we include the second-elements-of-pairs (i.e.  the zeroing
   // commands that follow allocation commands) here; we'll ignore them later.
   for (int32 c = 0; c < num_commands; c++) {
     command_reordering[c].first = c * 3;
     command_reordering[c].second = c;
   }
   for (int32 m = 1; m < num_matrices; m++) {
     const MatrixAccesses &ma = matrix_accesses[m];
     // The following if-block relates to reordering of allocation (and,
     // implicitly, zeroing) commands.
     if (ma.allocate_command != -1 &&
         computation->commands[ma.allocate_command].command_type == kAllocMatrix) {
       // first_access_command will be index of first access, except for the
       // zeroing command that immediately follows the initialization command.
       int32 first_access_command = -1;
       // this block sets 'first_access_command'.
       if (!ma.accesses.empty()) {
         first_access_command = ma.accesses[0].command_index;
         if (first_access_command == ma.allocate_command + 1 &&
             is_command_pair[ma.allocate_command]) {
           if (ma.accesses.size() > 1)
             first_access_command = ma.accesses[1].command_index;
           else
             first_access_command = -1;
         }
       }
       if (first_access_command != -1) {
         KALDI_ASSERT(first_access_command > ma.allocate_command);
         // move the initialization command to just before the first access.
         command_reordering[ma.allocate_command].first =
             first_access_command * 3 - 1;
       }
     }
     // The following if-block relates to reordering of deallocation
     // commands.
     if (ma.deallocate_command != -1 && !ma.accesses.empty() &&
         computation->commands[ma.deallocate_command].command_type ==
         kDeallocMatrix) {
       int32 last_access_command = ma.accesses.back().command_index;
       // move the deallocation command to just after the last access.
       command_reordering[ma.deallocate_command].first =
           last_access_command * 3 + 1;
     }
   }
   std::sort(command_reordering.begin(), command_reordering.end());
   std::vector<NnetComputation::Command> reordered_commands;
   reordered_commands.reserve(num_commands);
   for (int32 c = 0; c < num_commands; c++) {
     int32 old_index = command_reordering[c].second;
     NnetComputation::Command &old_command = computation->commands[old_index];
     // the following assert is because this optimization is not allowed
     // after looped optimization.
     KALDI_ASSERT(old_command.command_type != kGotoLabel);
     if (old_index > 0 && is_command_pair[old_index - 1]) {
       // If the old command-index was a zeroing command that follows
       // an allocation command, ignore it; it will be reordered to
       // right after wherever the allocation command went, and we'll
       // deal with it when we deal with the first element of the pair.
       continue;
     } else {
       reordered_commands.push_back(computation->commands[old_index]);
       if (is_command_pair[old_index]) {
         // if this command is the first member of an (allocation, zeroing)
         // pair then we need to deal with the zeroing command as well.
         reordered_commands.push_back(computation->commands[old_index + 1]);
       }
     }
   }
   computation->commands = reordered_commands;
 }

◆ NameMatchesPattern()

bool NameMatchesPattern	(	const char *	name,
		const char *	pattern
	)

Definition at line 235 of file nnet-parse.cc.

Referenced by SvdApplier::DecomposeComponents(), ReadEditConfig(), ReduceRankOfComponents(), and UnitTestNameMatchesPattern().

                                                                {
   if (*pattern == '*') {
     return NameMatchesPattern(name, pattern + 1) ||
         (*name != '\0' && NameMatchesPattern(name + 1, pattern));
   } else if (*name == *pattern) {
     return (*name == '\0' || NameMatchesPattern(name + 1, pattern + 1));
   } else {
     return false;
   }
 }

◆ NnetInfo()

std::string NnetInfo ( const Nnet & nnet )

This function returns various info about the neural net.

If the nnet satisfied IsSimpleNnet(nnet), the info includes "left-context=5\nright-context=3\n...". The info includes the output of nnet.Info(). This is modeled after the info that AmNnetSimple returns in its Info() function (we need this in the CTC code).

Definition at line 492 of file nnet-utils.cc.

References ComputeSimpleNnetContext(), Nnet::Info(), Nnet::InputDim(), IsSimpleNnet(), and Nnet::OutputDim().

Referenced by UnitTestNnetContext().

                                      {
   std::ostringstream ostr;
   if (IsSimpleNnet(nnet)) {
     int32 left_context, right_context;
     // this call will crash if the nnet is not 'simple'.
     ComputeSimpleNnetContext(nnet, &left_context, &right_context);
     ostr << "left-context: " << left_context << "\n";
     ostr << "right-context: " << right_context << "\n";
   }
   ostr << "input-dim: " << nnet.InputDim("input") << "\n";
   ostr << "ivector-dim: " << nnet.InputDim("ivector") << "\n";
   ostr << "output-dim: " << nnet.OutputDim("output") << "\n";
   ostr << "# Nnet info follows.\n";
   ostr << nnet.Info();
   return ostr.str();
 }

◆ NnetIsRecurrent()

bool NnetIsRecurrent ( const Nnet & nnet )

Returns true if 'nnet' has some kind of recurrency.

Definition at line 1441 of file nnet-utils.cc.

References GraphHasCycles(), and NnetToDirectedGraph().

Referenced by TestNnetDecodable(), UnitTestNnetInputDerivatives(), and UnitTestNnetModelDerivatives().

                                        {
   std::vector<std::vector<int32> > graph;
   NnetToDirectedGraph(nnet, &graph);
   return GraphHasCycles(graph);
 }

◆ NnetParametersAreIdentical()

bool NnetParametersAreIdentical	(	const Nnet &	nnet1,
		const Nnet &	nnet2,
		BaseFloat	threshold
	)

Used for testing that the updatable parameters in two networks are the same.

May crash if structure differs. Prints warning and returns false if parameters differ. E.g. set threshold to 1.0e-05 (it's a relative threshold, applied per component).

Definition at line 1802 of file nnet-test-utils.cc.

References UpdatableComponent::DotProduct(), Nnet::GetComponent(), Nnet::GetComponentName(), KALDI_ASSERT, KALDI_WARN, kUpdatableComponent, Nnet::NumComponents(), Component::Properties(), and Component::Type().

Referenced by NnetGenerationOptions::NnetGenerationOptions(), and UnitTestNnetOptimizeWithOptions().

                                                                {
   KALDI_ASSERT(nnet1.NumComponents() == nnet2.NumComponents());
   int32 num_components = nnet1.NumComponents();
   for (int32 c = 0; c < num_components; c++) {
     const Component *c1 = nnet1.GetComponent(c),
                     *c2 = nnet2.GetComponent(c);
     KALDI_ASSERT(c1->Type() == c2->Type());
     if (c1->Properties() & kUpdatableComponent) {
       const UpdatableComponent *u1 = dynamic_cast<const UpdatableComponent*>(c1),
                                *u2 = dynamic_cast<const UpdatableComponent*>(c2);
       KALDI_ASSERT(u1 != NULL && u2 != NULL);
       BaseFloat prod11 = u1->DotProduct(*u1), prod12 = u1->DotProduct(*u2),
                 prod21 = u2->DotProduct(*u1), prod22 = u2->DotProduct(*u2);
       BaseFloat max_prod = std::max(std::max(prod11, prod12),
                                     std::max(prod21, prod22)),
                 min_prod = std::min(std::min(prod11, prod12),
                                     std::min(prod21, prod22));
       if (max_prod - min_prod > threshold * max_prod) {
         KALDI_WARN << "Component '" << nnet1.GetComponentName(c)
                    << "' differs in nnet1 versus nnet2: prod(11,12,21,22) = "
                    << prod11 << ',' << prod12 << ',' << prod21 << ',' << prod22;
         return false;
       }
     }
   }
   return true;
 }

◆ NnetToDirectedGraph()

void NnetToDirectedGraph	(	const Nnet &	nnet,
		std::vector< std::vector< int32 > > *	graph
	)

This function takes an nnet and turns it to a directed graph on nodes.

This is the reverse of the dependency graph. The nodes will be numbered from 0 to graph->size() - 1, where graph->size() == nnet.NumNodes(). For each node-index n, the vector in (*graph)[n] will contain a list of all the nodes that have a direct dependency on node n (in order to compute them). For instance, if n is the output node, (*graph)[n] will be the empty list because no other node will depend on it.

Definition at line 30 of file nnet-graph.cc.

References NetworkNode::descriptor, Nnet::GetNode(), Descriptor::GetNodeDependencies(), rnnlm::i, KALDI_ASSERT, KALDI_ERR, kComponent, kDescriptor, kDimRange, kInput, rnnlm::n, NetworkNode::node_index, NetworkNode::node_type, Nnet::NumNodes(), kaldi::SortAndUniq(), and NetworkNode::u.

Referenced by ComputeNnetComputationEpochs(), FindOrphanNodes(), and NnetIsRecurrent().

                                                               {
   graph->clear();
   int32 num_nodes = nnet.NumNodes();
   graph->resize(num_nodes);
   for (int32 n = 0; n < num_nodes; n++) {
     const NetworkNode &node = nnet.GetNode(n);
     // handle dependencies of this node.
     std::vector<int32> node_dependencies;
     switch (node.node_type) {
       case kInput:
         break;  // no node dependencies.
       case kDescriptor:
         node.descriptor.GetNodeDependencies(&node_dependencies);
         break;
       case kComponent:
         node_dependencies.push_back(n - 1);
         break;
       case kDimRange:
         node_dependencies.push_back(node.u.node_index);
         break;
       default:
         KALDI_ERR << "Invalid node type";
     }
     SortAndUniq(&node_dependencies);
     for (size_t i = 0; i < node_dependencies.size(); i++) {
       int32 dep_n = node_dependencies[i];
       KALDI_ASSERT(dep_n >= 0 && dep_n < num_nodes);
       (*graph)[dep_n].push_back(n);
     }
   }
 }

◆ NormalizeTextDescriptor()

std::string kaldi::nnet3::NormalizeTextDescriptor	(	const std::vector< std::string > &	node_names,
		const std::string &	desc_str
	)

Definition at line 180 of file nnet-descriptor-test.cc.

References GeneralDescriptor::ConvertToDescriptor(), DescriptorTokenize(), KALDI_ERR, KALDI_LOG, GeneralDescriptor::Parse(), and Descriptor::WriteConfig().

Referenced by UnitTestGeneralDescriptorSpecial().

                                                                {
   std::vector<std::string> tokens;
   DescriptorTokenize(desc_str, &tokens);
   tokens.push_back("end of input");
   const std::string *next_token = &(tokens[0]);
   GeneralDescriptor *gen_desc = GeneralDescriptor::Parse(node_names,
                                                          &next_token);
   if (*next_token != "end of input")
     KALDI_ERR << "Parsing Descriptor, expected end of input but got "
               << "'" <<  *next_token << "'";
   Descriptor *desc = gen_desc->ConvertToDescriptor();
   std::ostringstream ostr;
   desc->WriteConfig(ostr, node_names);
   delete gen_desc;
   delete desc;
   KALDI_LOG << "Result of normalizing " << desc_str << " is: " << ostr.str();
   return ostr.str();
 }

◆ NumInputNodes()

int32 NumInputNodes ( const Nnet & nnet )

returns the number of input nodes of this nnet.

Definition at line 43 of file nnet-utils.cc.

References Nnet::IsInputNode(), rnnlm::n, and Nnet::NumNodes().

Referenced by IsSimpleNnet().

                                       {
   int32 ans = 0;
   for (int32 n = 0; n < nnet.NumNodes(); n++)
     if (nnet.IsInputNode(n))
       ans++;
   return ans;
 }

◆ NumOutputIndexes()

int32 kaldi::nnet3::NumOutputIndexes ( const NnetExample & eg )

Definition at line 32 of file nnet3-merge-egs.cc.

References rnnlm::i, and NnetExample::io.

                                               {
   for (size_t i = 0; i < eg.io.size(); i++)
     if (eg.io[i].name.find("output") != std::string::npos)
       return eg.io[i].indexes.size();
   return 1;  // Suppress compiler warning.
 }

◆ NumOutputNodes()

int32 NumOutputNodes ( const Nnet & nnet )

returns the number of output nodes of this nnet.

Definition at line 35 of file nnet-utils.cc.

References Nnet::IsOutputNode(), rnnlm::n, and Nnet::NumNodes().

                                        {
   int32 ans = 0;
   for (int32 n = 0; n < nnet.NumNodes(); n++)
     if (nnet.IsOutputNode(n))
       ans++;
   return ans;
 }

◆ NumParameters()

int32 NumParameters ( const Nnet & src )

Returns the total of the number of parameters in the updatable components of the nnet.

Definition at line 359 of file nnet-utils.cc.

References Nnet::GetComponent(), KALDI_ERR, kUpdatableComponent, Nnet::NumComponents(), UpdatableComponent::NumParameters(), and Component::Properties().

Referenced by LstmNonlinearityComponent::ConsolidateMemory(), Nnet::Info(), ConstantComponent::IsComputable(), LinearComponent::LinearComponent(), main(), AffineComponent::Properties(), BlockAffineComponent::Properties(), RepeatedAffineComponent::Properties(), PerElementScaleComponent::Properties(), PerElementOffsetComponent::Properties(), ConstantFunctionComponent::Properties(), CompositeComponent::Type(), LstmNonlinearityComponent::UnVectorize(), UnVectorizeNnet(), UpdateNnetMovingAverage(), LstmNonlinearityComponent::Vectorize(), and VectorizeNnet().

                                      {
   int32 ans = 0;
   for (int32 c = 0; c < src.NumComponents(); c++) {
     const Component *comp = src.GetComponent(c);
     if (comp->Properties() & kUpdatableComponent) {
       // For now all updatable components inherit from class UpdatableComponent.
       // If that changes in future, we will change this code.
       const UpdatableComponent *uc =
           dynamic_cast<const UpdatableComponent*>(comp);
       if (uc == NULL)
         KALDI_ERR << "Updatable component does not inherit from class "
             "UpdatableComponent; change this code.";
       ans += uc->NumParameters();
     }
   }
   return ans;
 }

◆ NumUpdatableComponents()

int32 NumUpdatableComponents ( const Nnet & dest )

Returns the number of updatable components in the nnet.

Definition at line 422 of file nnet-utils.cc.

References Nnet::GetComponent(), kUpdatableComponent, Nnet::NumComponents(), and Component::Properties().

Referenced by kaldi::nnet2::CombineNnets(), FastNnetCombiner::CombineNnets(), kaldi::nnet2::CombineNnetsA(), FastNnetCombiner::ComputeCurrentNnet(), kaldi::nnet2::GetUpdateDirection(), main(), PrintVectorPerUpdatableComponent(), and UpdateNnetWithMaxChange().

                                                {
   int32 ans = 0;
   for (int32 c = 0; c < dest.NumComponents(); c++) {
       const Component *comp = dest.GetComponent(c);
     if (comp->Properties() & kUpdatableComponent)
       ans++;
   }
   return ans;
 }

◆ operator<<() [1/3]

std::ostream & operator<<	(	std::ostream &	os,
		const ComputationGraphBuilder::ComputableInfo &	info
	)

This is to be used in logging only.

Definition at line 318 of file nnet-computation-graph.cc.

References ComputationGraphBuilder::kComputable, ComputationGraphBuilder::kNotComputable, and ComputationGraphBuilder::kUnknown.

                                                                               {
   switch (info) {
     case ComputationGraphBuilder::kUnknown: os << "kUnknown";
       break;
     case ComputationGraphBuilder::kComputable: os << "kComputable";
       break;
     case ComputationGraphBuilder::kNotComputable: os << "kNotComputable";
       break;
     default: os << "[invalid enum value]"; break;
   }
   return os;
 }

◆ operator<<() [2/3]

std::ostream & operator<<	(	std::ostream &	ostream,
		const Index &	index
	)

Definition at line 424 of file nnet-common.cc.

References Index::n, Index::t, and Index::x.

Referenced by IndexLessNxt::operator()().

                                                                   {
   return ostream << '(' << index.n << ' ' << index.t << ' ' << index.x << ')';
 }

◆ operator<<() [3/3]

std::ostream & operator<<	(	std::ostream &	ostream,
		const Cindex &	cindex
	)

Definition at line 428 of file nnet-common.cc.

                                                                     {
   return ostream << '(' << cindex.first << ' ' << cindex.second << ')';
 }

◆ Optimize()

void Optimize	(	const NnetOptimizeOptions &	config,
		const Nnet &	nnet,
		int32	max_output_time_in_request,
		NnetComputation *	computation
	)

This is the top-level function for optimizing a computation.

Note: it should really be called OptimizeAndPostprocess(), because there is at least one thing it does (reordering I/O commands) that is necessary for a computation to be run.

Parameters

[in]	config	The options that control, among other things, which optimizations to apply.
[in]	nnet	The neural net for which the computation is being built
[in]	max_output_time_in_request	This value is only needed when the max-deriv-time-relative config value is set in 'config'. It should be set to the largest 't' value encountered in any of the indexes in the 'output' IoSpecifications in the ComputationRequests used to compile the computation. However if there are multiple ComputationRequests (i.e. it was an online computation) you can just set it to any value you want, because backpropagation is not supported so the max-deriv-time-relative configuration value would not have any effect.
[in,out]	computation	The computation to be optimized; this function modifies it in-place.

Definition at line 501 of file nnet-optimize.cc.

Referenced by CompileLoopedInternal(), CachingOptimizingCompiler::CompileNoShortcut(), ComputationLoopedOptimizer::ComputationLoopedOptimizer(), MemoryCompressionOptimizer::MemoryCompressionOptimizer(), NnetOptimizeOptions::Register(), UnitTestNnetCompute(), and UnitTestNnetInputDerivatives().

                                             {
   if (GetVerboseLevel() >= 3) {
     CheckComputation(nnet, *computation, true);
     KALDI_LOG << "Before optimization, max memory use (bytes) = "
               << GetMaxMemoryUse(*computation);
   }
 
   { // Call LimitDerivativeTimes(); it's important that this
     // should come before other optimizations (search for "insist" in
     // nnet-optimize-utils.cc for the reasons).
     // this will do nothing unless --min-deriv-time or --max-deriv-time
     // or --max-deriv-time-relative was set.
     int32 max_deriv_time = config.max_deriv_time;
     if (config.max_deriv_time_relative != std::numeric_limits<int32>::max())
       max_deriv_time = config.max_deriv_time_relative +
           max_output_time_in_request;
     if (config.min_deriv_time != std::numeric_limits<int32>::min() ||
         max_deriv_time != std::numeric_limits<int32>::max())
       LimitDerivativeTimes(nnet, config.min_deriv_time,
                            max_deriv_time, computation);
   }
 
   if (GetVerboseLevel() >= 3)
     CheckComputation(nnet, *computation, true);
 
   if (config.optimize && config.consolidate_model_update) {
     ConsolidateModelUpdate(nnet, computation);
 
     if (GetVerboseLevel() >= 3)
       CheckComputation(nnet, *computation, true);
   }
 
   if (config.optimize && config.convert_addition) {
     ConvertAdditionToAssignment(nnet, computation);
     if (GetVerboseLevel() >= 3)
       CheckComputation(nnet, *computation, true);
   }
 
 
   if (config.optimize &&  (config.snip_row_ops || config.optimize_row_ops ||
                            config.split_row_ops)) {
     bool must_renumber = false;
     if (config.snip_row_ops && SnipRowOps(computation))
       must_renumber = true;
     if (config.split_row_ops && SplitRowOps(computation))
       must_renumber = true;
     if (config.optimize_row_ops && ReplaceRowWithMatrixOps(computation))
       must_renumber = true;
 
     if (must_renumber) {
       RenumberComputation(computation);
       if (GetVerboseLevel() >= 3)
         CheckComputation(nnet, *computation, false);
     }
   }
 
   if (config.optimize && config.extend_matrices &&
       !config.optimize_looped_computation) {
     ExtendMatrices(computation);
     if (GetVerboseLevel() >= 3)
       CheckComputation(nnet, *computation, false);
   }
 
 
   if (config.optimize &&
       (config.remove_assignments || config.backprop_in_place ||
        config.propagate_in_place)) {
     VariableMergingOptimization(config, nnet, computation);
     if (GetVerboseLevel() >= 3)
       CheckComputation(nnet, *computation, false);
   }
 
   if (config.optimize && config.initialize_undefined) {
     RemoveUnnecessaryZeroing(nnet, computation);
     if (GetVerboseLevel() >= 3)
       CheckComputation(nnet, *computation, false);
   }
 
 
   if ((config.optimize && config.move_sizing_commands) ||
       config.optimize_looped_computation) {
     MoveSizingCommands(nnet, computation);
     if (GetVerboseLevel() >= 3)
       CheckComputation(nnet, *computation, false);
   }
 
   // the looped computation optimization has to go before
   // 'RemoveUnnecessaryAllocation()'.  We don't gate this by 'config.optimize'
   // because it's necessary for looped computation to run.
   if (config.optimize_looped_computation) {
     OptimizeLoopedComputation(nnet, computation);
     if (GetVerboseLevel() >= 3)
       CheckComputation(nnet, *computation, false);
   }
 
   if (config.optimize && config.allocate_from_other &&
       !config.optimize_looped_computation) {
     // Don't do this if it's an looped computation because we're not sure if it
     // would be correct in that case, as written.  In any case the performance
     // benefit is tiny.
     RemoveUnnecessaryAllocation(nnet, computation);
     if (GetVerboseLevel() >= 3)
       CheckComputation(nnet, *computation, false);
   }
 
   // The following is not configurable because it is necessary for
   // the computation to run correctly (we do it after compilation too,
   // but the operations may have been put out of order by
   // other optimizations.)
   ConsolidateIoOperations(nnet, computation);
 
   if (config.optimize_looped_computation)
     FixGotoLabel(computation);
 
 
   if (config.memory_compression_level > 0 &&
       !config.optimize_looped_computation) {
     OptimizeMemoryCompression(nnet, config.memory_compression_level,
                               computation);
     if (GetVerboseLevel() >= 3)
       CheckComputation(nnet, *computation, false);
   }
 
   if (GetVerboseLevel() >= 3) {
     CheckComputation(nnet, *computation, false);
     KALDI_LOG << "After optimization, max memory use (bytes) = "
               << GetMaxMemoryUse(*computation);
   }
 }

◆ OptimizeLoopedComputation()

void OptimizeLoopedComputation	(	const Nnet &	nnet,
		NnetComputation *	computation
	)

This function tries to optimize computation 'computation' for an 'looped' computation.

It expects as input a computation with no backprop but with multiple 'segments' separated by command kNoOperationLabel, where each segment corresponds to a new chunk of input and output. It tries to locate a pair of segment boundaries, with command indexes c1 and c2, where the active matrices have the same debug-info other than a time offset and can be identified with each other, and the no-op command at c2 can be replaced with 'got c1', creating a computation that 'goes on forever'. If it can't do this, it does nothing. You can figure out that this is the case by checking whether kGotoLabel is the last command in the computation. [If this optimization fails, the whole computation may have to be regenerated with more segments.]

Definition at line 4544 of file nnet-optimize-utils.cc.

References ComputationLoopedOptimizer::Optimize().

Referenced by Optimize().

                                                              {
   ComputationLoopedOptimizer optimizer(nnet, computation);
   optimizer.Optimize();
 }

◆ OptimizeMemoryCompression()

void OptimizeMemoryCompression	(	const Nnet &	nnet,
		int32	memory_compression_level,
		NnetComputation *	computation
	)

Performs optimization to reduce memory usage where possible, making use of the kCompressMatrix and kDecompressMatrix commands.

Should only be done after most other optimizations, because some optimizations (such as variable-merging) would not work correctly after doing this optimization. This does nothing for looped computations. It's OK, though, to expand a shortcut computation (i.e. call ExpandComputation) after doing this.

memory_compression_level determines how aggressive the compression is. Allowed values: 0 = no compression at all 1 = compression that doesn't affect results (e.g. compress ReLU outputs to 1 byte, as just the sign is needed). 2 = compression that may affect the results slightly (e.g. 16-bit compression of the output of NormalizeComponent and the like), but this is not implemented yet, so equivalent to 1. 3 = compression that may affect the results more than just slightly. Not implemented yet, so equivalent to 1.

Definition at line 4899 of file nnet-optimize-utils.cc.

References NnetComputation::commands, GetMaxMemoryUse(), kaldi::GetVerboseLevel(), rnnlm::i, KALDI_VLOG, KALDI_WARN, kGotoLabel, kNoOperationMarker, and MemoryCompressionOptimizer::Optimize().

Referenced by Optimize().

                                                              {
   if (memory_compression_level == 0 || computation->commands.empty())
     return;
   // don't apply this optimization to looped computations.
   if (computation->commands.back().command_type == kGotoLabel)
     return;
 
   // 'middle_command' will be the index of the command of type
   // 'kNoOperationMarker' that separates the forward and backward
   // passes.  If it doesn't exist, it means this computation doesn't
   // include
   int32 middle_command = -1;
   for (size_t i = 0; i < computation->commands.size(); i++) {
     if (computation->commands[i].command_type == kNoOperationMarker) {
       if (middle_command < 0) {
         middle_command = static_cast<int32>(i);
       } else {
         KALDI_WARN << "Found more than one command of type kNoOperationMarker "
             "in non-looped computation.";
         // there are more than one command of this type... this wasn't expected.
         // return (i.e. do nothing).
         return;
       }
     }
   }
   if (middle_command == -1) {
     return;  // This computation doesn't have a backprop pass.
   }
   if (memory_compression_level >= 1) {
     int64 bytes_used_initial, bytes_used_final;
     bool verbose_ge_2 = GetVerboseLevel() >= 2;
     if (verbose_ge_2)
       bytes_used_initial = GetMaxMemoryUse(*computation);
 
     MemoryCompressionOptimizer opt(nnet, memory_compression_level,
                                    middle_command, computation);
     opt.Optimize();
 
     if (verbose_ge_2) {
       bytes_used_final = GetMaxMemoryUse(*computation);
       if (bytes_used_final != bytes_used_initial) {
         KALDI_VLOG(2) << "Memory compression reduced  memory use from "
                       << bytes_used_initial << " to "
                       << bytes_used_final << " bytes.";
       }
     }
   }
 }

◆ ParseConfigLines()

void kaldi::nnet3::ParseConfigLines	(	const std::vector< std::string > &	lines,
		std::vector< ConfigLine > *	config_lines
	)

Definition at line 224 of file nnet-parse.cc.

References rnnlm::i, and KALDI_ERR.

Referenced by Nnet::ReadConfig(), and ReadEditConfig().

                                                            {
   config_lines->resize(lines.size());
   for (size_t i = 0; i < lines.size(); i++) {
     bool ret = (*config_lines)[i].ParseLine(lines[i]);
     if (!ret) {
       KALDI_ERR << "Error parsing config line: " << lines[i];
     }
   }
 }

◆ ParsingContext()

static std::string kaldi::nnet3::ParsingContext ( const std::string * token_ptr )

static

Definition at line 29 of file nnet-descriptor.cc.

Referenced by ExpectToken(), and ReadIntegerToken().

                                                             {
   if (*token_ptr == "end of input")
     return "";
   std::string next_few_tokens = ", next part of line is: ";
   // in the next line, *token_ptr should never equal "" but it's to mitigate the
   // effect of bugs where we read past the end of the array.
   while (*token_ptr != "end of input" && *token_ptr != "" &&
          next_few_tokens.size() < 40) {
     next_few_tokens = (next_few_tokens + " ") + *token_ptr;
     token_ptr++;
   }
   if (*token_ptr != "end of input")
     next_few_tokens = next_few_tokens + " ...";
   return next_few_tokens;
 }

◆ PerturbImage()

void kaldi::nnet3::PerturbImage	(	const ImageAugmentationConfig &	config,
		MatrixBase< BaseFloat > *	image
	)

This function randomly modifies (perturbs) the image by applying different geometric transformations according to the options in 'config'.

References: "Digital Image Processing book by Gonzalez and Woods" and "Keras: github.com/fchollet/keras/blob/master/keras/preprocessing/image.py"

Parameters

[in]	config	Configuration class that says how to perturb the image.
[in,out]	image	The image matrix to be modified. image->NumRows() is the width (number of x values) in the image; image->NumCols() is the height times number of channels/colors (channel varies the fastest).

Definition at line 205 of file nnet3-egs-augment-image.cc.

References MatrixBase< Real >::AddMatMat(), MatrixBase< Real >::AddMatMatMat(), ApplyAffineTransform(), ImageAugmentationConfig::Check(), ImageAugmentationConfig::GetFillMode(), ImageAugmentationConfig::horizontal_flip_prob, ImageAugmentationConfig::horizontal_shift, MatrixBase< Real >::IsUnit(), KALDI_ERR, kaldi::kNoTrans, kaldi::kUndefined, M_PI, ImageAugmentationConfig::num_channels, MatrixBase< Real >::NumCols(), MatrixBase< Real >::NumRows(), kaldi::RandUniform(), ImageAugmentationConfig::rotation_degree, ImageAugmentationConfig::rotation_prob, MatrixBase< Real >::SetUnit(), ImageAugmentationConfig::vertical_shift, and kaldi::WithProb().

Referenced by PerturbImageInNnetExample().

                                                 {
   config.Check();
   FillMode fill_mode = config.GetFillMode();
   int32 image_width = image->NumRows(),
       num_channels = config.num_channels,
       image_height = image->NumCols() / num_channels;
   if (image->NumCols() % num_channels != 0) {
     KALDI_ERR << "Number of columns in image must divide the number "
         "of channels";
   }
   // We do an affine transform which
   // handles flipping, translation, rotation, magnification, and shear.
   Matrix<BaseFloat> transform_mat(3, 3, kUndefined);
   transform_mat.SetUnit();
 
   Matrix<BaseFloat> shift_mat(3, 3, kUndefined);
   shift_mat.SetUnit();
   // translation (shift) mat:
   // [ 1   0  x_shift
   //   0   1  y_shift
   //   0   0  1       ]
   BaseFloat horizontal_shift = (2.0 * RandUniform() - 1.0) *
       config.horizontal_shift * image_width;
   BaseFloat vertical_shift = (2.0 * RandUniform() - 1.0) *
       config.vertical_shift * image_height;
   shift_mat(0, 2) = round(horizontal_shift);
   shift_mat(1, 2) = round(vertical_shift);
   // since we will center the image before applying the transform,
   // horizontal flipping is simply achieved by setting [0, 0] to -1:
   if (WithProb(config.horizontal_flip_prob))
     shift_mat(0, 0) = -1.0;
 
   Matrix<BaseFloat> rotation_mat(3, 3, kUndefined);
   rotation_mat.SetUnit();
   // rotation mat:
   // [ cos(theta)  -sin(theta)  0
   //   sin(theta)  cos(theta)   0
   //   0           0            1 ]
   if (RandUniform() <= config.rotation_prob) {
     BaseFloat theta = (2 * config.rotation_degree * RandUniform() -
                        config.rotation_degree) / 180.0 * M_PI;
     rotation_mat(0, 0) = cos(theta);
     rotation_mat(0, 1) = -sin(theta);
     rotation_mat(1, 0) = sin(theta);
     rotation_mat(1, 1) = cos(theta);
   }
 
   Matrix<BaseFloat> shear_mat(3, 3, kUndefined);
   shear_mat.SetUnit();
   // shear mat:
   // [ 1    -sin(shear)   0
   //   0     cos(shear)   0
   //   0     0            1 ]
 
   Matrix<BaseFloat> zoom_mat(3, 3, kUndefined);
   zoom_mat.SetUnit();
   // zoom mat:
   // [ x_zoom   0   0
   //   0   y_zoom   0
   //   0     0      1 ]
 
   // transform_mat = rotation_mat * shift_mat * shear_mat * zoom_mat:
   transform_mat.AddMatMat(1.0, shift_mat, kNoTrans,
                           shear_mat, kNoTrans, 0.0);
   transform_mat.AddMatMatMat(1.0, rotation_mat, kNoTrans,
                              transform_mat, kNoTrans,
                              zoom_mat, kNoTrans, 0.0);
   if (transform_mat.IsUnit())  // nothing to do
     return;
 
   // we should now change the origin of transform to the center of
   // the image (necessary for flipping, zoom, shear, and rotation)
   // we do this by using two translations: one before the main transform
   // and one after.
   Matrix<BaseFloat> set_origin_mat(3, 3, kUndefined);
   set_origin_mat.SetUnit();
   set_origin_mat(0, 2) = image_width / 2.0 - 0.5;
   set_origin_mat(1, 2) = image_height / 2.0 - 0.5;
   Matrix<BaseFloat> reset_origin_mat(3, 3, kUndefined);
   reset_origin_mat.SetUnit();
   reset_origin_mat(0, 2) = -image_width / 2.0 + 0.5;
   reset_origin_mat(1, 2) = -image_height / 2.0 + 0.5;
 
   // transform_mat = set_origin_mat * transform_mat * reset_origin_mat
   transform_mat.AddMatMatMat(1.0, set_origin_mat, kNoTrans,
                              transform_mat, kNoTrans,
                              reset_origin_mat, kNoTrans, 0.0);
   ApplyAffineTransform(transform_mat, config.num_channels, image, fill_mode);
 }

◆ PerturbImageInNnetExample()

void kaldi::nnet3::PerturbImageInNnetExample	(	const ImageAugmentationConfig &	config,
		NnetExample *	eg
	)

This function does image perturbation as directed by 'config' The example 'eg' is expected to contain a NnetIo member with the name 'input', representing an image.

Definition at line 302 of file nnet3-egs-augment-image.cc.

References NnetIo::features, GeneralMatrix::GetMatrix(), rnnlm::i, NnetExample::io, KALDI_ERR, NnetIo::name, and PerturbImage().

Referenced by main().

                      {
   int32 io_size = eg->io.size();
   bool found_input = false;
   for (int32 i = 0; i < io_size; i++) {
     NnetIo &io = eg->io[i];
     if (io.name == "input") {
       found_input = true;
       Matrix<BaseFloat> image;
       io.features.GetMatrix(&image);
       // note: 'GetMatrix' may uncompress if it was compressed.
       // We won't recompress, but this won't matter because this
       // program is intended to be used as part of a pipe, we
       // likely won't be dumping the perturbed data to disk.
       PerturbImage(config, &image);
 
       // modify the 'io' object.
       io.features = image;
     }
   }
   if (!found_input)
     KALDI_ERR << "Nnet example to perturb had no NnetIo object named 'input'";
 }

◆ PerturbParams()

void PerturbParams	(	BaseFloat	stddev,
		Nnet *	nnet
	)

Calls PerturbParams (with the given stddev) on all updatable components of the nnet.

Definition at line 199 of file nnet-utils.cc.

References Nnet::GetComponent(), KALDI_ASSERT, kUpdatableComponent, Nnet::NumComponents(), UpdatableComponent::PerturbParams(), and Component::Properties().

Referenced by AffineComponent::BackpropNeedsOutput(), BlockAffineComponent::BackpropNeedsOutput(), Convolutional1dComponent::BackpropNeedsOutput(), LstmNonlinearityComponent::ConsolidateMemory(), ScaleAndOffsetComponent::Copy(), ConstantComponent::IsComputable(), LinearComponent::LinearComponent(), AffineComponent::Properties(), BlockAffineComponent::Properties(), RepeatedAffineComponent::Properties(), PerElementScaleComponent::Properties(), PerElementOffsetComponent::Properties(), ConstantFunctionComponent::Properties(), CompositeComponent::Type(), UnitTestNnetModelDerivatives(), and UpdatableComponent::~UpdatableComponent().

                                {
   for (int32 c = 0; c < nnet->NumComponents(); c++) {
     Component *comp = nnet->GetComponent(c);
     if (comp->Properties() & kUpdatableComponent) {
       UpdatableComponent *u_comp = dynamic_cast<UpdatableComponent*>(comp);
       KALDI_ASSERT(u_comp != NULL);
       u_comp->PerturbParams(stddev);
     }
   }
 }

◆ PrintCindex()

void PrintCindex	(	std::ostream &	os,
		const Cindex &	cindex,
		const std::vector< std::string > &	node_names
	)

Definition at line 432 of file nnet-common.cc.

References KALDI_ASSERT.

Referenced by ComputationGraph::Print().

                                                          {
   KALDI_ASSERT(static_cast<size_t>(cindex.first) < node_names.size());
   os << node_names[cindex.first] << "(" << cindex.second.n << ","
      << cindex.second.t;
   if (cindex.second.x != 0)
     os << "," << cindex.second.x;
   os << ")";
 }

◆ PrintCindexes()

void PrintCindexes	(	std::ostream &	ostream,
		const std::vector< Cindex > &	cindexes,
		const std::vector< std::string > &	node_names
	)

this will only be used for pretty-printing.

It prints a vector of Cindexes in a compact, human-readable way with compression of ranges. If the values of the node indexes are the same for the entire vector, it will just be node-name followed by the output of PrintIndexes, e.g. some_node[ (1,1,0) ]. Otherwise it will divide the vector into ranges that each have all the same node name, and will print out each range in the way we just mentioned. 'node_names' will usually come from a call like nnet.GetNodeNames().

Definition at line 498 of file nnet-common.cc.

References KALDI_ASSERT, and PrintIndexes().

Referenced by PrintComputationPreamble().

                                                            {
   int32 num_cindexes = cindexes.size();
   if (num_cindexes == 0) {
     ostream << "[ ]";
     return;
   }
   int32 cur_offset = 0;
   std::vector<Index> indexes;
   indexes.reserve(cindexes.size());
   while (cur_offset < num_cindexes) {
     int32 cur_node_index = cindexes[cur_offset].first;
     while (cur_offset < num_cindexes &&
            cindexes[cur_offset].first == cur_node_index) {
       indexes.push_back(cindexes[cur_offset].second);
       cur_offset++;
     }
     KALDI_ASSERT(static_cast<size_t>(cur_node_index) < node_names.size());
     const std::string &node_name = node_names[cur_node_index];
     ostream << node_name;
     PrintIndexes(ostream, indexes);
     indexes.clear();
   }
 }

◆ PrintCommand() [1/2]

std::string kaldi::nnet3::PrintCommand	(	int32	num_commands,
		int32	command
	)

Definition at line 28 of file nnet-analyze-test.cc.

Referenced by NnetComputation::GetCommandStrings(), NnetComputation::Print(), and UnitTestNnetAnalyze().

                                         {
   std::ostringstream os;
   if (command < 0 || command >= num_commands)
     os << command;
   else
     os << 'c' << command;
   return os.str();
 }

◆ PrintCommand() [2/2]

static void kaldi::nnet3::PrintCommand	(	std::ostream &	os_out,
		const Nnet &	nnet,
		const NnetComputation &	computation,
		int32	command_index,
		const std::vector< std::string > &	submatrix_strings,
		const std::vector< std::string > &	indexes_strings,
		const std::vector< std::string > &	indexes_multi_strings
	)

static

Definition at line 513 of file nnet-computation.cc.

                                                                             {
   // If the string is longer than 'max_string_length' characters, it will
   // be summarized with '...' in the middle.
   size_t max_string_length = 200;
   std::ostringstream os;
   KALDI_ASSERT(command_index < computation.commands.size());
   os << "c" << command_index << ": ";
   const NnetComputation::Command &c = computation.commands[command_index];
   switch (c.command_type) {
     case kAllocMatrix:
       os << submatrix_strings[c.arg1] << " = undefined("
          << computation.submatrices[c.arg1].num_rows
          << ',' << computation.submatrices[c.arg1].num_cols << ")\n";
       break;
     case kDeallocMatrix:
       os << submatrix_strings[c.arg1] << " = []\n";
       break;
     case kSwapMatrix:
       os << submatrix_strings[c.arg1] << ".swap("
          << submatrix_strings[c.arg2] << ") [dim = "
          << computation.submatrices[c.arg1].num_rows << " x "
          << computation.submatrices[c.arg1].num_cols << "]\n";
       break;
     case kSetConst:
       os << submatrix_strings[c.arg1] << ".set(" << c.alpha << ") [dim = "
          << computation.submatrices[c.arg1].num_rows << " x "
          << computation.submatrices[c.arg1].num_cols << "];\n";
       break;
     case kPropagate:
       os << nnet.GetComponentName(c.arg1) << ".Propagate(";
       if (c.arg2 == 0) os << "NULL, ";
       else os << "precomputed_indexes[" << c.arg2 << "], ";
       os << submatrix_strings[c.arg3] << ", &" << submatrix_strings[c.arg4]
          << ")\n";
       break;
     case kBackprop:
     case kBackpropNoModelUpdate: {
       int32 component_index = c.arg1;
       os << nnet.GetComponentName(component_index) << ".Backprop(";
       if (c.arg2 == 0) os << "NULL, ";
       else os << "precomputed_indexes[" << c.arg2 << "], ";
       os << submatrix_strings[c.arg3] << ", "
          << submatrix_strings[c.arg4] << ", "
          << submatrix_strings[c.arg5] << ", "
          << (computation.need_model_derivative &&
              c.command_type == kBackprop ?
              "[component-pointer], " : "NULL, ")
          << (c.arg6 == 0 ? std::string("NULL") :
              std::string("&") + submatrix_strings[c.arg6]) << ")\n";
       break;
     }
     case kMatrixCopy:
       if (c.alpha == 1.0) {
         os << submatrix_strings[c.arg1] << " = "
            << submatrix_strings[c.arg2] << "\n";
       } else {
         os << submatrix_strings[c.arg1] << " = "
            << c.alpha << " * "
            << submatrix_strings[c.arg2] << "\n";
       }
       break;
     case kMatrixAdd:
       if (c.alpha == 1.0) {
         os << submatrix_strings[c.arg1] << " += "
            << submatrix_strings[c.arg2] << "\n";
       } else {
         os << submatrix_strings[c.arg1] << " += "
            << c.alpha << " * "
            << submatrix_strings[c.arg2] << "\n";
       }
       break;
     case kAddRows:
     case kCopyRows:
       os << submatrix_strings[c.arg1] << "."
          << (c.command_type == kAddRows ? "AddRows" :
              "CopyRows") << "(" << c.alpha << ", "
          << submatrix_strings[c.arg2] << indexes_strings[c.arg3] << ")\n";
       break;
     case kAddRowsMulti:
     case kAddToRowsMulti:
     case kCopyRowsMulti:
     case kCopyToRowsMulti: {
       CommandType ct = c.command_type;
       os << submatrix_strings[c.arg1] << "."
          << (ct == kAddRowsMulti ? "AddRowsMulti" :
              (ct == kAddToRowsMulti? "AddToRowsMulti" :
               (ct == kCopyRowsMulti ? "CopyRowsMulti" :
                "CopyToRowsMulti"))) << "("
          << c.alpha << ", "
          << indexes_multi_strings[c.arg2] << ")\n";
       break;
     }
     case kAddRowRanges: {
       os << submatrix_strings[c.arg1] << ".AddRowRanges("
          << c.alpha << ", "
          << submatrix_strings[c.arg2] << ", [";
       const std::vector<std::pair<int32, int32> > &pairs =
            computation.indexes_ranges[c.arg3];
       for (size_t i = 0; i < pairs.size(); i++) {
         if (pairs[i].first == -1) {
           os << "null";
         } else {
           os << pairs[i].first << ":" << (pairs[i].second - 1);
         }
         if (i + 1 < pairs.size()) os << ",";
       }
       os << "])\n";
       break;
     }
     case kCompressMatrix: {
       BaseFloat range = c.alpha;
       std::string truncate = (c.arg3 != 0 ? "true" : "false");
       std::string compressed_matrix_type;
       if (c.arg2 == kCompressedMatrixInt8) { compressed_matrix_type = "int8"; }
       else if (c.arg2 == kCompressedMatrixUint8) { compressed_matrix_type = "uint8"; }
       else if (c.arg2 == kCompressedMatrixInt16) { compressed_matrix_type = "int16"; }
       else {
         KALDI_ASSERT(c.arg2 == kCompressedMatrixInt16);
         compressed_matrix_type = "uint16";
       }
       os << "CompressMatrix(" << submatrix_strings[c.arg1] << ", "
          << range << ", " << compressed_matrix_type << ", "
          << truncate << ")\n";
       break;
     }
     case kDecompressMatrix:
       os << "DecompressMatrix(" << submatrix_strings[c.arg1] << ")\n";
       break;
     case kAcceptInput:
       os << submatrix_strings[c.arg1] << " = user input [for node: '"
          << nnet.GetNodeName(c.arg2) << "']\n";
       break;
     case kProvideOutput:
       os << "output " << submatrix_strings[c.arg1] << " to user"
          << " [for node: '" << nnet.GetNodeName(c.arg2) << "']\n";
       break;
     case kNoOperation:
       os << "[no-op]\n";
       break;
     case kNoOperationPermanent:
       os << "[no-op-permanent]\n";
       break;
     case kNoOperationMarker:
       os << "# computation segment separator [e.g., begin backward commands]\n";
       break;
     case kNoOperationLabel:
       os << "[label for goto statement]\n";
       break;
     case kGotoLabel:
       os << "goto c" << c.arg1 << "\n";
       break;
     default:
       KALDI_ERR << "Un-handled command type.";
   }
   std::string str = os.str();
   if (str.size() <= max_string_length) {
     os_out << str;
   } else {
     size_t len = str.size();
     os_out << str.substr(0, max_string_length / 2) << " ... "
            << str.substr(len - max_string_length / 2);
   }
 }

◆ PrintCommandAttributes()

void PrintCommandAttributes	(	std::ostream &	os,
		const std::vector< CommandAttributes > &	attributes
	)

This function is to be used in debugging; it produces human-readable output.

Definition at line 1368 of file nnet-analyze.cc.

References CommandAttributes::matrices_read, CommandAttributes::matrices_written, CommandAttributes::variables_read, and CommandAttributes::variables_written.

Referenced by CommandAttributes::CommandAttributes().

                                                                             {
   int32 num_commands = attributes.size();
   for (int32 c = 0; c < num_commands; c++) {
     const CommandAttributes &this_attr = attributes[c];
     os << "c" << c << ": ";
     if (!this_attr.variables_read.empty()) {
       os << "r(";
       std::vector<int32>::const_iterator iter = this_attr.variables_read.begin(),
           end = this_attr.variables_read.end();
       for (; iter != end; ++iter) {
         os << "v" << *iter;
         if (iter+1 != end) os << ",";
       }
       os << ") ";
     }
     if (!this_attr.variables_written.empty()) {
       os << "w(";
       std::vector<int32>::const_iterator
           iter = this_attr.variables_written.begin(),
           end = this_attr.variables_written.end();
       for (; iter != end; ++iter) {
         os << "v" << *iter;
         if (iter+1 != end) os << ",";
       }
       os << ") ";
     }
     if (!this_attr.matrices_read.empty()) {
       os << "r(";
       std::vector<int32>::const_iterator iter = this_attr.matrices_read.begin(),
           end = this_attr.matrices_read.end();
       for (; iter != end; ++iter) {
         os << "m" << *iter;
         if (iter+1 != end) os << ",";
       }
       os << ") ";
     }
     if (!this_attr.matrices_written.empty()) {
       os << "w(";
       std::vector<int32>::const_iterator
           iter = this_attr.matrices_written.begin(),
           end = this_attr.matrices_written.end();
       for (; iter != end; ++iter) {
         os << "m" << *iter;
         if (iter+1 != end) os << ",";
       }
       os << ")";
     }
     os << "\n";
   }
 }

◆ PrintComputationPreamble()

static void kaldi::nnet3::PrintComputationPreamble	(	std::ostream &	os,
		const NnetComputation &	c,
		const Nnet &	nnet,
		const std::vector< std::string > &	submatrix_strings,
		const std::vector< std::string > &	indexes_strings,
		const std::vector< std::string > &	indexes_multi_strings
	)

static

Definition at line 684 of file nnet-computation.cc.

References NnetComputation::MatrixDebugInfo::cindexes, Nnet::GetNodeNames(), rnnlm::i, NnetComputation::MatrixDebugInfo::is_deriv, KALDI_ASSERT, NnetComputation::matrices, NnetComputation::matrix_debug_info, and PrintCindexes().

Referenced by NnetComputation::GetCommandStrings(), and NnetComputation::Print().

                                                        {
 
   // First print info about the matrices.
   os << "matrix ";
   for (int32 i = 1; i < c.matrices.size(); i++) {
     os << "m" << i << "(" << c.matrices[i].num_rows
        << ", " << c.matrices[i].num_cols << ")";
     if (i + 1 < c.matrices.size())
       os << ", ";
   }
   os << "\n";
   if (!c.matrix_debug_info.empty()) {
     os << "# The following show how matrices correspond to network-nodes and\n"
        << "# cindex-ids.  Format is: matrix = <node-id>.[value|deriv][ <list-of-cindex-ids> ]\n"
        << "# where a cindex-id is written as (n,t[,x]) but ranges of t values are compressed\n"
        << "# so we write (n, tfirst:tlast).\n";
     KALDI_ASSERT(c.matrix_debug_info.size() == c.matrices.size());
     for (int32 i = 1; i < c.matrices.size(); i++) {
       const NnetComputation::MatrixDebugInfo &debug_info =
           c.matrix_debug_info[i];
       os << "m" << i << " == " << (debug_info.is_deriv ? "deriv: " : "value: ");
       PrintCindexes(os, debug_info.cindexes, nnet.GetNodeNames());
       os << "\n";
     }
   }
 }

◆ PrintFloatSuccinctly()

static void kaldi::nnet3::PrintFloatSuccinctly	(	std::ostream &	os,
		BaseFloat	f
	)

static

Definition at line 94 of file nnet-parse.cc.

Referenced by SummarizeVector().

                                                               {
   if (fabs(f) < 10000.0 && fabs(f) >= 10.0) {
     os  << std::fixed << std::setprecision(0) << f;
   } else if (fabs(f) >= 0.995) {
     os  << std::fixed << std::setprecision(1) << f;
   } else if (fabs(f) >= 0.01) {
     os  << std::fixed << std::setprecision(2) << f;
   } else {
     os << std::setprecision(1) << f;
   }
   os.unsetf(std::ios_base::floatfield);
   os << std::setprecision(6);  // Restore the default.
 }

◆ PrintGraphToString()

std::string PrintGraphToString ( const std::vector< std::vector< int32 > > & graph )

Prints a graph to a string in a pretty way for human readability, e.g.

as 0 -> 1,2; 1 -> 2; 2 -> 3,4,5; etc.

Definition at line 248 of file nnet-graph.cc.

References rnnlm::i, and rnnlm::j.

Referenced by ComputeNnetComputationEpochs().

                                                                         {
   std::ostringstream os;
   int32 num_nodes = graph.size();
   for (int32 i = 0; i < num_nodes; i++) {
     os << i << " -> (";
     const std::vector<int32> &vec = graph[i];
     int32 size = vec.size();
     for (int32 j = 0; j < size; j++) {
       os << vec[j];
       if (j + 1 < size) os << ",";
     }
     os << ")";
     if (i + 1 < num_nodes) os << "; ";
   }
   return os.str();
 }

◆ PrintIndexes()

void PrintIndexes	(	std::ostream &	ostream,
		const std::vector< Index > &	indexes
	)

this will only be used for pretty-printing.

It prints a vector of Indexes in a compact, human-readable way with compression of ranges (it also doesn't print the x index if it's 1.0. Example output: "[ (1,1:20), (2, 1:20) ]" which would correspond to the indexes [ (1,1,0), (1,2,0) ... (1,20,0) (2,1,0) ... (2,20,0) ].

Definition at line 442 of file nnet-common.cc.

References rnnlm::i, KALDI_ASSERT, Index::n, Index::t, and Index::x.

Referenced by IoSpecification::Print(), and PrintCindexes().

                                                    {
   if (indexes.empty()) {
     os << "[ ]";
     return;
   }
   // If the string is longer than 'max_string_length' characters, it will
   // be summarized with '...' in the middle.
   size_t max_string_length = 200;
   std::ostringstream os_temp;
 
   // range_starts will be the starts of ranges (with consecutive t values and
   // the same n value and zero x values) that we compactly print.  we'll append
   // "end" to range_starts for convenience.n
   std::vector<int32> range_starts;
   int32 cur_start = 0, end = indexes.size();
   for (int32 i = cur_start; i < end; i++) {
     const Index &index = indexes[i];
     if (i > cur_start &&
         (index.t != indexes[i-1].t + 1 ||
          index.n != indexes[i-1].n ||
          index.x != indexes[i-1].x)) {
       range_starts.push_back(cur_start);
       cur_start = i;
     }
   }
   range_starts.push_back(cur_start);
   range_starts.push_back(end);
   os_temp << "[";
   int32 num_ranges = range_starts.size() - 1;
   for (int32 r = 0; r < num_ranges; r++) {
     int32 range_start = range_starts[r], range_end = range_starts[r+1];
     KALDI_ASSERT(range_end > range_start);
     os_temp << "(" << indexes[range_start].n << ",";
     if (range_end == range_start + 1)
       os_temp << indexes[range_start].t;
     else
       os_temp << indexes[range_start].t << ":" << indexes[range_end - 1].t;
     if (indexes[range_start].x != 0)
       os_temp << "," << indexes[range_start].x;
     os_temp << ")";
     if (r + 1 < num_ranges)
       os_temp << ", ";
   }
   os_temp << "]";
 
   std::string str = os_temp.str();
   if (str.size() <= max_string_length) {
     os << str;
   } else {
     size_t len = str.size();
     os << str.substr(0, max_string_length / 2) << " ... "
        << str.substr(len - max_string_length / 2);
   }
 }

◆ PrintIntegerVector()

void PrintIntegerVector	(	std::ostream &	os,
		const std::vector< int32 > &	ints
	)

Definition at line 525 of file nnet-common.cc.

References rnnlm::i, and KALDI_ASSERT.

Referenced by kaldi::nnet3::computation_graph::ComputeEpochInfo(), and GetIndexesStrings().

                                                       {
   if (ints.empty()) {
     os << "[ ]";
     return;
   }
   // range_starts will be the starts of ranges (with consecutive or identical
   // values) that we compactly print.  we'll append "end" to range_starts for
   // convenience.
   std::vector<int32> range_starts;
   int32 cur_start = 0, end = ints.size();
   for (int32 i = cur_start; i < end; i++) {
     if (i > cur_start) {
       int32 range_start_val = ints[cur_start],
           range_start_plus_one_val = ints[cur_start+1],
           cur_val = ints[i];
       // if we have reached the end of a range...
       if (!((range_start_plus_one_val == range_start_val &&
              cur_val == range_start_val) ||
             (range_start_plus_one_val == range_start_val + 1 &&
              cur_val == range_start_val + i - cur_start))) {
         range_starts.push_back(cur_start);
         cur_start = i;
       }
     }
   }
   range_starts.push_back(cur_start);
   range_starts.push_back(end);
   os << "[";
   int32 num_ranges = range_starts.size() - 1;
   for (int32 r = 0; r < num_ranges; r++) {
     int32 range_start = range_starts[r], range_end = range_starts[r+1];
     KALDI_ASSERT(range_end > range_start);
     if (range_end == range_start + 1)
       os << ints[range_start];
     else if (range_end == range_start + 2)  // don't print ranges of 2.
       os << ints[range_start] << ", " << ints[range_start+1];
     else if (ints[range_start] == ints[range_start+1])
       os << ints[range_start] << "x" << (range_end - range_start);
     else
       os << ints[range_start] << ":" << ints[range_end - 1];
     if (r + 1 < num_ranges)
       os << ", ";
   }
   os << "]";
 }

◆ PrintMatrixAccesses()

void PrintMatrixAccesses	(	std::ostream &	os,
		const std::vector< MatrixAccesses > &	matrix_accesses
	)

This function is to be used in debugging; it produces human-readable output.

Definition at line 1350 of file nnet-analyze.cc.

References MatrixAccesses::accesses, MatrixAccesses::allocate_command, MatrixAccesses::deallocate_command, kReadAccess, and kWriteAccess.

Referenced by MatrixAccesses::MatrixAccesses().

                                                                            {
   int32 num_matrices = matrix_accesses.size();
   for (int32 m = 1; m < num_matrices; m++) {
     const MatrixAccesses &a = matrix_accesses[m];
     os << "m" << m << ": init-command=" << a.allocate_command
        << ", destroy-command=" << a.deallocate_command
        << ", accesses=";
     std::vector<Access>::const_iterator iter = a.accesses.begin(),
         end = a.accesses.end();
     for (; iter != end; ++iter)
       os << 'c' << iter->command_index << "("
          << (iter->access_type == kReadAccess ? "r" :
              (iter->access_type == kWriteAccess ? "w" : "rw")) << ") ";
     os << "\n";
   }
 }

◆ PrintParameterStats() [1/2]

void PrintParameterStats	(	std::ostringstream &	os,
		const std::string &	name,
		const CuVectorBase< BaseFloat > &	params,
		bool	include_mean = `false`
	)

Print to 'os' some information about the mean and standard deviation of some parameters, used in Info() functions in nnet-simple-component.cc.

For example: PrintParameterStats(os, "bias", bias_params_, true); would print to 'os' something like the string ", bias-{mean,stddev}=-0.013,0.196". If 'include_mean = false', it will print something like ", bias-rms=0.2416", and this represents and uncentered standard deviation.

Definition at line 157 of file nnet-parse.cc.

References CuVectorBase< Real >::Dim(), CuVectorBase< Real >::Sum(), and kaldi::VecVec().

Referenced by LstmNonlinearityComponent::ConsolidateMemory(), ConvolutionComponent::Info(), TimeHeightConvolutionComponent::Info(), LstmNonlinearityComponent::Info(), AffineComponent::Info(), TdnnComponent::Info(), BlockAffineComponent::Info(), RepeatedAffineComponent::Info(), ConstantComponent::Info(), LinearComponent::Info(), FixedAffineComponent::Info(), FixedScaleComponent::Info(), FixedBiasComponent::Info(), PerElementScaleComponent::Info(), PerElementOffsetComponent::Info(), ConstantFunctionComponent::Info(), and ScaleAndOffsetComponent::Info().

                                             {
   os << std::setprecision(4);
   os << ", " << name << '-';
   if (include_mean) {
     BaseFloat mean = params.Sum() / params.Dim(),
         stddev = std::sqrt(VecVec(params, params) / params.Dim() - mean * mean);
     os << "{mean,stddev}=" << mean << ',' << stddev;
   } else {
     BaseFloat rms = std::sqrt(VecVec(params, params) / params.Dim());
     os << "rms=" << rms;
   }
   os << std::setprecision(6);  // restore the default precision.
 }

◆ PrintParameterStats() [2/2]

void PrintParameterStats	(	std::ostringstream &	os,
		const std::string &	name,
		const CuMatrix< BaseFloat > &	params,
		bool	include_mean = `false`,
		bool	include_row_norms = `false`,
		bool	include_column_norms = `false`,
		bool	include_singular_values = `false`
	)

Print to 'os' some information about the mean and standard deviation of some parameters, used in Info() functions in nnet-simple-component.cc.

For example: PrintParameterStats(os, "linear-params", linear_params_; would print to 'os' something like the string ", linear-params-rms=0.239". If you set 'include_mean' to true, it will print something like ", linear-params-{mean-stddev}=0.103,0.183". If you set 'include_row_norms' to true, it will print something like ", linear-params-row-norms=[percentiles(0,1........, stddev=0.0508]" If you set 'include_column_norms' to true, it will print something like ", linear-params-col-norms=[percentiles(0,1........, stddev=0.0508]" If you set 'include_singular_values' to true, it will print something like ", linear-params-singular-values=[percentiles(0,1........, stddev=0.0508]"

Definition at line 174 of file nnet-parse.cc.

References CuVectorBase< Real >::AddDiagMat2(), kaldi::kNoTrans, kaldi::kTrans, CuMatrixBase< Real >::NumCols(), CuMatrixBase< Real >::NumRows(), CuMatrixBase< Real >::Sum(), SummarizeVector(), MatrixBase< Real >::Svd(), Vector< Real >::Swap(), and kaldi::TraceMatMat().

                                                        {
   os << std::setprecision(4);
   os << ", " << name << '-';
   int32 dim = params.NumRows() * params.NumCols();
   if (include_mean) {
     BaseFloat mean = params.Sum() / dim,
         stddev = std::sqrt(TraceMatMat(params, params, kTrans) / dim -
                            mean * mean);
     os << "{mean,stddev}=" << mean << ',' << stddev;
   } else {
     BaseFloat rms = std::sqrt(TraceMatMat(params, params, kTrans) / dim);
     os << "rms=" << rms;
   }
   os << std::setprecision(6);  // restore the default precision.
 
   if (include_row_norms) {
     CuVector<BaseFloat> row_norms(params.NumRows());
     row_norms.AddDiagMat2(1.0, params, kNoTrans, 0.0);
     row_norms.ApplyPow(0.5);
     Vector<BaseFloat> row_norms_cpu;
     row_norms.Swap(&row_norms_cpu);
     os << ", " << name << "-row-norms="
        << SummarizeVector(row_norms_cpu);
   }
   if (include_column_norms) {
     CuVector<BaseFloat> col_norms(params.NumCols());
     col_norms.AddDiagMat2(1.0, params, kTrans, 0.0);
     col_norms.ApplyPow(0.5);
     Vector<BaseFloat> col_norms_cpu;
     col_norms.Swap(&col_norms_cpu);
     os << ", " << name << "-col-norms="
        << SummarizeVector(col_norms_cpu);
   }
   if (include_singular_values) {
     Matrix<BaseFloat> params_cpu(params);
     Vector<BaseFloat> s(std::min(params.NumRows(), params.NumCols()));
     params_cpu.Svd(&s);
     std::string singular_values_str = SummarizeVector(s);
     os << ", " << name << "-singular-values=" << singular_values_str;
     std::ostringstream name_os;
   }
 }

◆ PrintPriorDiagnostics()

void kaldi::nnet3::PrintPriorDiagnostics	(	const Vector< BaseFloat > &	old_priors,
		const Vector< BaseFloat > &	new_priors
	)

Definition at line 48 of file nnet3-am-adjust-priors.cc.

References VectorBase< Real >::AddVec(), VectorBase< Real >::ApplyAbs(), VectorBase< Real >::Dim(), KALDI_LOG, KlDivergence(), and VectorBase< Real >::Max().

Referenced by main().

                                                                 {
   if (old_priors.Dim() == 0) {
     KALDI_LOG << "Model did not previously have priors attached.";
   } else {
     Vector<BaseFloat> diff_prior(new_priors);
     diff_prior.AddVec(-1.0, old_priors);
     diff_prior.ApplyAbs();
     int32 max_index;
     diff_prior.Max(&max_index);
     KALDI_LOG << "Adjusting priors: largest absolute difference was for "
               << "pdf " << max_index << ", " << old_priors(max_index)
               << " -> " << new_priors(max_index);
     KALDI_LOG << "Adjusting priors: K-L divergence from old to new is "
               << KlDivergence(old_priors, new_priors);
   }
 }

◆ PrintVectorPerUpdatableComponent()

std::string PrintVectorPerUpdatableComponent	(	const Nnet &	nnet,
		const VectorBase< BaseFloat > &	vec
	)

This function is for printing, to a string, a vector with one element per updatable component of the nnet (e.g.

the output of ComponentDotProducts), in a human readable way, as [ component-name1:number1 component-name2:number2 ... ].

Definition at line 231 of file nnet-utils.cc.

References VectorBase< Real >::Dim(), Nnet::GetComponent(), Nnet::GetComponentName(), KALDI_ASSERT, kUpdatableComponent, Nnet::NumComponents(), NumUpdatableComponents(), and Component::Properties().

Referenced by main().

                                                                                {
   std::ostringstream os;
   os << "[ ";
   KALDI_ASSERT(NumUpdatableComponents(nnet) == vec.Dim());
   int32 updatable_c = 0;
   for (int32 c = 0; c < nnet.NumComponents(); c++) {
     const Component *comp = nnet.GetComponent(c);
     if (comp->Properties() & kUpdatableComponent) {
       const std::string &component_name = nnet.GetComponentName(c);
       os << component_name << ':' << vec(updatable_c) << ' ';
       updatable_c++;
     }
   }
   KALDI_ASSERT(updatable_c == vec.Dim());
   os << ']';
   return os.str();
 }

◆ PrintVectorVectorPair()

void kaldi::nnet3::PrintVectorVectorPair ( std::vector< std::vector< std::pair< int32, int32 > > > vec_vec_pair )

Definition at line 50 of file nnet-compile-utils-test.cc.

References rnnlm::i, rnnlm::j, and KALDI_LOG.

Referenced by UnitTestSplitLocations(), and UnitTestSplitLocationsBackward().

                                                                  {
   std::ostringstream ostream;
   for (int32 i = 0; i < vec_vec_pair.size(); i++) {
     for (int32 j = 0; j < vec_vec_pair[i].size(); j++)  {
       ostream << "(" << vec_vec_pair[i][j].first << ","
               << vec_vec_pair[i][j].second << ") ";
     }
     ostream << std::endl;
   }
   KALDI_LOG << ostream.str();
 }

◆ ProcessFile() [1/3]

static bool kaldi::nnet3::ProcessFile	(	const GeneralMatrix &	feats,
		const MatrixBase< BaseFloat > *	ivector_feats,
		int32	ivector_period,
		const Posterior &	pdf_post,
		const std::string &	utt_id,
		bool	compress,
		int32	num_pdfs,
		int32	length_tolerance,
		UtteranceSplitter *	utt_splitter,
		NnetExampleWriter *	example_writer
	)

static

Definition at line 33 of file nnet3-get-egs.cc.

References NnetExample::Compress(), UtteranceSplitter::Config(), kaldi::ExtractRowRangeWithPadding(), ChunkTimeInfo::first_frame, ExampleGenerationConfig::frame_subsampling_factor, UtteranceSplitter::GetChunksForUtterance(), rnnlm::i, NnetExample::io, KALDI_WARN, ChunkTimeInfo::left_context, UtteranceSplitter::LengthsMatch(), ChunkTimeInfo::num_frames, MatrixBase< Real >::NumCols(), MatrixBase< Real >::NumRows(), GeneralMatrix::NumRows(), ChunkTimeInfo::output_weights, kaldi::RandInt(), ChunkTimeInfo::right_context, MatrixBase< Real >::Row(), and TableWriter< Holder >::Write().

                                                            {
   int32 num_input_frames = feats.NumRows();
   if (!utt_splitter->LengthsMatch(utt_id, num_input_frames,
                                   static_cast<int32>(pdf_post.size()),
                                   length_tolerance))
     return false;  // LengthsMatch() will have printed a warning.
 
   std::vector<ChunkTimeInfo> chunks;
 
   utt_splitter->GetChunksForUtterance(num_input_frames, &chunks);
 
   if (chunks.empty()) {
     KALDI_WARN << "Not producing egs for utterance " << utt_id
                << " because it is too short: "
                << num_input_frames << " frames.";
   }
 
   // 'frame_subsampling_factor' is not used in any recipes at the time of
   // writing, this is being supported to unify the code with the 'chain' recipes
   // and in case we need it for some reason in future.
   int32 frame_subsampling_factor =
       utt_splitter->Config().frame_subsampling_factor;
 
   for (size_t c = 0; c < chunks.size(); c++) {
     const ChunkTimeInfo &chunk = chunks[c];
 
     int32 tot_input_frames = chunk.left_context + chunk.num_frames +
         chunk.right_context;
 
     int32 start_frame = chunk.first_frame - chunk.left_context;
 
     GeneralMatrix input_frames;
     ExtractRowRangeWithPadding(feats, start_frame, tot_input_frames,
                                &input_frames);
 
     // 'input_frames' now stores the relevant rows (maybe with padding) from the
     // original Matrix or (more likely) CompressedMatrix.  If a CompressedMatrix,
     // it does this without un-compressing and re-compressing, so there is no loss
     // of accuracy.
 
     NnetExample eg;
     // call the regular input "input".
     eg.io.push_back(NnetIo("input", -chunk.left_context, input_frames));
 
     if (ivector_feats != NULL) {
       // if applicable, add the iVector feature.
       // choose iVector from a random frame in the chunk
       int32 ivector_frame = RandInt(start_frame,
                                     start_frame + num_input_frames - 1),
           ivector_frame_subsampled = ivector_frame / ivector_period;
       if (ivector_frame_subsampled < 0)
         ivector_frame_subsampled = 0;
       if (ivector_frame_subsampled >= ivector_feats->NumRows())
         ivector_frame_subsampled = ivector_feats->NumRows() - 1;
       Matrix<BaseFloat> ivector(1, ivector_feats->NumCols());
       ivector.Row(0).CopyFromVec(ivector_feats->Row(ivector_frame_subsampled));
       eg.io.push_back(NnetIo("ivector", 0, ivector));
     }
 
     // Note: chunk.first_frame and chunk.num_frames will both be
     // multiples of frame_subsampling_factor.
     int32 start_frame_subsampled = chunk.first_frame / frame_subsampling_factor,
         num_frames_subsampled = chunk.num_frames / frame_subsampling_factor;
 
     Posterior labels(num_frames_subsampled);
 
     // TODO: it may be that using these weights is not actually helpful (with
     // chain training, it was not), and that setting them all to 1 is better.
     // We could add a boolean option to this program to control that; but I
     // don't want to add such an option if experiments show that it is not
     // helpful.
     for (int32 i = 0; i < num_frames_subsampled; i++) {
       int32 t = i + start_frame_subsampled;
       if (t < pdf_post.size())
         labels[i] = pdf_post[t];
       for (std::vector<std::pair<int32, BaseFloat> >::iterator
                iter = labels[i].begin(); iter != labels[i].end(); ++iter)
         iter->second *= chunk.output_weights[i];
     }
 
     eg.io.push_back(NnetIo("output", num_pdfs, 0, labels, frame_subsampling_factor));
 
     if (compress)
       eg.Compress();
 
     std::ostringstream os;
     os << utt_id << "-" << chunk.first_frame;
 
     std::string key = os.str(); // key is <utt_id>-<frame_id>
 
     example_writer->Write(key, eg);
   }
   return true;
 }

◆ ProcessFile() [2/3]

static bool kaldi::nnet3::ProcessFile	(	const GeneralMatrix &	feats,
		const MatrixBase< BaseFloat > *	ivector_feats,
		int32	ivector_period,
		const MatrixBase< BaseFloat > &	targets,
		const std::string &	utt_id,
		bool	compress,
		int32	num_targets,
		int32	length_tolerance,
		UtteranceSplitter *	utt_splitter,
		NnetExampleWriter *	example_writer
	)

static

Definition at line 34 of file nnet3-get-egs-dense-targets.cc.

References NnetExample::Compress(), UtteranceSplitter::Config(), VectorBase< Real >::CopyFromVec(), kaldi::ExtractRowRangeWithPadding(), ChunkTimeInfo::first_frame, ExampleGenerationConfig::frame_subsampling_factor, UtteranceSplitter::GetChunksForUtterance(), rnnlm::i, NnetExample::io, KALDI_ASSERT, KALDI_WARN, ChunkTimeInfo::left_context, UtteranceSplitter::LengthsMatch(), ChunkTimeInfo::num_frames, MatrixBase< Real >::NumCols(), MatrixBase< Real >::NumRows(), GeneralMatrix::NumRows(), kaldi::RandInt(), ChunkTimeInfo::right_context, MatrixBase< Real >::Row(), and TableWriter< Holder >::Write().

                                                            {
   int32 num_input_frames = feats.NumRows();
   if (!utt_splitter->LengthsMatch(utt_id, num_input_frames,
                                   targets.NumRows(),
                                   length_tolerance)) {
     return false;
   }
   if (targets.NumRows() == 0)
     return false;
   KALDI_ASSERT(num_targets < 0 || targets.NumCols() == num_targets);
 
   std::vector<ChunkTimeInfo> chunks;
 
   utt_splitter->GetChunksForUtterance(num_input_frames, &chunks);
 
   if (chunks.empty()) {
     KALDI_WARN << "Not producing egs for utterance " << utt_id
                << " because it is too short: "
                << num_input_frames << " frames.";
     return false;
   }
 
   // 'frame_subsampling_factor' is not used in any recipes at the time of
   // writing, this is being supported to unify the code with the 'chain' recipes
   // and in case we need it for some reason in future.
   int32 frame_subsampling_factor =
       utt_splitter->Config().frame_subsampling_factor;
 
   for (size_t c = 0; c < chunks.size(); c++) {
     const ChunkTimeInfo &chunk = chunks[c];
 
     int32 tot_input_frames = chunk.left_context + chunk.num_frames +
         chunk.right_context;
 
     int32 start_frame = chunk.first_frame - chunk.left_context;
 
     GeneralMatrix input_frames;
     ExtractRowRangeWithPadding(feats, start_frame, tot_input_frames,
                                &input_frames);
 
     // 'input_frames' now stores the relevant rows (maybe with padding) from the
     // original Matrix or (more likely) CompressedMatrix.  If a CompressedMatrix,
     // it does this without un-compressing and re-compressing, so there is no loss
     // of accuracy.
 
     NnetExample eg;
     // call the regular input "input".
     eg.io.push_back(NnetIo("input", -chunk.left_context, input_frames));
 
     if (ivector_feats != NULL) {
       // if applicable, add the iVector feature.
       // choose iVector from a random frame in the chunk
       int32 ivector_frame = RandInt(start_frame,
                                     start_frame + num_input_frames - 1),
           ivector_frame_subsampled = ivector_frame / ivector_period;
       if (ivector_frame_subsampled < 0)
         ivector_frame_subsampled = 0;
       if (ivector_frame_subsampled >= ivector_feats->NumRows())
         ivector_frame_subsampled = ivector_feats->NumRows() - 1;
       Matrix<BaseFloat> ivector(1, ivector_feats->NumCols());
       ivector.Row(0).CopyFromVec(ivector_feats->Row(ivector_frame_subsampled));
       eg.io.push_back(NnetIo("ivector", 0, ivector));
     }
 
     // Note: chunk.first_frame and chunk.num_frames will both be
     // multiples of frame_subsampling_factor.
     int32 start_frame_subsampled = chunk.first_frame / frame_subsampling_factor,
         num_frames_subsampled = chunk.num_frames / frame_subsampling_factor;
 
     KALDI_ASSERT(start_frame_subsampled + num_frames_subsampled - 1 <
                  targets.NumRows());
 
 
     // add the labels.
     Matrix<BaseFloat> targets_part(num_frames_subsampled, targets.NumCols());
     for (int32 i = 0; i < num_frames_subsampled; i++) {
       // Copy the i^th row of the target matrix from the (t+i)^th row of the
       // input targets matrix
       int32 t = i + start_frame_subsampled;
       if (t >= targets.NumRows())
         t = targets.NumRows() - 1;
       SubVector<BaseFloat> this_target_dest(targets_part, i);
       SubVector<BaseFloat> this_target_src(targets, t);
       this_target_dest.CopyFromVec(this_target_src);
     }
 
     // push this created targets matrix into the eg
     eg.io.push_back(NnetIo("output", 0, targets_part, frame_subsampling_factor));
 
     if (compress)
       eg.Compress();
 
     std::ostringstream os;
     os << utt_id << "-" << chunk.first_frame;
 
     std::string key = os.str(); // key is <utt_id>-<frame_id>
 
     example_writer->Write(key, eg);
   }
   return true;
 }

◆ ProcessFile() [3/3]

static bool kaldi::nnet3::ProcessFile	(	const discriminative::SplitDiscriminativeSupervisionOptions &	config,
		const TransitionModel &	tmodel,
		const MatrixBase< BaseFloat > &	feats,
		const MatrixBase< BaseFloat > *	ivector_feats,
		int32	ivector_period,
		const discriminative::DiscriminativeSupervision &	supervision,
		const std::string &	utt_id,
		bool	compress,
		UtteranceSplitter *	utt_splitter,
		NnetDiscriminativeExampleWriter *	example_writer
	)

static

Definition at line 39 of file nnet3-discriminative-get-egs.cc.

References NnetDiscriminativeExample::Compress(), UtteranceSplitter::Config(), VectorBase< Real >::CopyFromVec(), ChunkTimeInfo::first_frame, ExampleGenerationConfig::frame_subsampling_factor, DiscriminativeSupervision::frames_per_sequence, UtteranceSplitter::GetChunksForUtterance(), DiscriminativeSupervisionSplitter::GetFrameRange(), NnetDiscriminativeExample::inputs, rnnlm::j, KALDI_ASSERT, KALDI_WARN, kaldi::kUndefined, ChunkTimeInfo::left_context, UtteranceSplitter::LengthsMatch(), ChunkTimeInfo::num_frames, DiscriminativeSupervision::num_sequences, MatrixBase< Real >::NumCols(), MatrixBase< Real >::NumRows(), ChunkTimeInfo::output_weights, NnetDiscriminativeExample::outputs, kaldi::RandInt(), ChunkTimeInfo::right_context, MatrixBase< Real >::Row(), and TableWriter< Holder >::Write().

Referenced by main().

                                                                          {
   KALDI_ASSERT(supervision.num_sequences == 1);
   int32 num_input_frames = feats.NumRows(),
       num_output_frames = supervision.frames_per_sequence;
 
   if (!utt_splitter->LengthsMatch(utt_id, num_input_frames, num_output_frames))
     return false;  // LengthsMatch() will have printed a warning.
 
   std::vector<ChunkTimeInfo> chunks;
 
   utt_splitter->GetChunksForUtterance(num_input_frames, &chunks);
 
   if (chunks.empty()) {
     KALDI_WARN << "Not producing egs for utterance " << utt_id
                << " because it is too short: "
                << num_input_frames << " frames.";
   }
 
   int32 frame_subsampling_factor = utt_splitter->Config().frame_subsampling_factor;
 
   discriminative::DiscriminativeSupervisionSplitter splitter(config, tmodel,
                                                              supervision);
 
   for (size_t c = 0; c < chunks.size(); c++) {
     ChunkTimeInfo &chunk = chunks[c];
 
     NnetDiscriminativeExample nnet_discriminative_eg;
     nnet_discriminative_eg.outputs.resize(1);
 
     int32 start_frame_subsampled = chunk.first_frame / frame_subsampling_factor,
         num_frames_subsampled = chunk.num_frames / frame_subsampling_factor;
 
     discriminative::DiscriminativeSupervision supervision_part;
 
     splitter.GetFrameRange(start_frame_subsampled,
                            num_frames_subsampled,
                            (c == 0 ? false : true),
                            &supervision_part);
 
     SubVector<BaseFloat> output_weights(
         &(chunk.output_weights[0]),
         static_cast<int32>(chunk.output_weights.size()));
 
     int32 first_frame = 0;  // we shift the time-indexes of all these parts so
                             // that the supervised part starts from frame 0.
     NnetDiscriminativeSupervision nnet_supervision("output", supervision_part,
                                                    output_weights,
                                                    first_frame,
                                                    frame_subsampling_factor);
     nnet_discriminative_eg.outputs[0].Swap(&nnet_supervision);
 
     nnet_discriminative_eg.inputs.resize(ivector_feats != NULL ? 2 : 1);
 
 
     int32 tot_input_frames = chunk.left_context + chunk.num_frames +
         chunk.right_context;
 
     Matrix<BaseFloat> input_frames(tot_input_frames, feats.NumCols(),
                                    kUndefined);
 
     int32 start_frame = chunk.first_frame - chunk.left_context;
     for (int32 t = start_frame; t < start_frame + tot_input_frames; t++) {
       int32 t2 = t;
       if (t2 < 0) t2 = 0;
       if (t2 >= num_input_frames) t2 = num_input_frames - 1;
       int32 j = t - start_frame;
       SubVector<BaseFloat> src(feats, t2),
           dest(input_frames, j);
       dest.CopyFromVec(src);
     }
 
     NnetIo input_io("input", -chunk.left_context, input_frames);
     nnet_discriminative_eg.inputs[0].Swap(&input_io);
 
     if (ivector_feats != NULL) {
       // if applicable, add the iVector feature.
       // choose iVector from a random frame in the chunk
       int32 ivector_frame = RandInt(start_frame,
                                     start_frame + num_input_frames - 1),
           ivector_frame_subsampled = ivector_frame / ivector_period;
       if (ivector_frame_subsampled < 0)
         ivector_frame_subsampled = 0;
       if (ivector_frame_subsampled >= ivector_feats->NumRows())
         ivector_frame_subsampled = ivector_feats->NumRows() - 1;
       Matrix<BaseFloat> ivector(1, ivector_feats->NumCols());
       ivector.Row(0).CopyFromVec(ivector_feats->Row(ivector_frame_subsampled));
       NnetIo ivector_io("ivector", 0, ivector);
       nnet_discriminative_eg.inputs[1].Swap(&ivector_io);
     }
 
     if (compress)
       nnet_discriminative_eg.Compress();
 
     std::ostringstream os;
     os << utt_id << "-" << chunk.first_frame;
 
     std::string key = os.str(); // key is <utt_id>-<frame_id>
 
     example_writer->Write(key, nnet_discriminative_eg);
   }
   return true;
 }

◆ ProcessRangeFile()

static void kaldi::nnet3::ProcessRangeFile	(	const std::string &	range_rxfilename,
		unordered_map< std::string, std::vector< ChunkInfo > >	utt_to_chunks
	)

static

Definition at line 41 of file nnet3-xvector-get-egs.cc.

References kaldi::ConvertStringToInteger(), KALDI_ERR, ChunkInfo::label, ChunkInfo::name, ChunkInfo::num_frames, ChunkInfo::output_archive_id, kaldi::SplitStringToVector(), ChunkInfo::start_frame, and Input::Stream().

Referenced by main().

                                                                       {
   Input range_input(range_rxfilename);
   if (!range_rxfilename.empty()) {
     std::string line;
     while (std::getline(range_input.Stream(), line)) {
       ChunkInfo *chunk_info = new ChunkInfo();
       std::vector<std::string> fields;
       SplitStringToVector(line, " \t\n\r", true, &fields);
       if (fields.size() != 6)
         KALDI_ERR << "Expected 6 fields in line of range file, got "
                   << fields.size() << " instead.";
 
       std::string utt = fields[0],
                   start_frame_str = fields[3],
                   num_frames_str = fields[4],
                   label_str = fields[5];
 
       if (!ConvertStringToInteger(fields[1], &(chunk_info->output_archive_id))
         || !ConvertStringToInteger(start_frame_str, &(chunk_info->start_frame))
         || !ConvertStringToInteger(num_frames_str, &(chunk_info->num_frames))
         || !ConvertStringToInteger(label_str, &(chunk_info->label)))
         KALDI_ERR << "Expected integer for output archive in range file.";
 
       chunk_info->name = utt + "-" + start_frame_str + "-" + num_frames_str
         + "-" + label_str;
       unordered_map<std::string, std::vector<ChunkInfo*> >::iterator
         got = utt_to_chunks->find(utt);
 
       if (got == utt_to_chunks->end()) {
         std::vector<ChunkInfo* > chunk_infos;
         chunk_infos.push_back(chunk_info);
         utt_to_chunks->insert(std::pair<std::string,
           std::vector<ChunkInfo* > > (utt, chunk_infos));
       } else {
         got->second.push_back(chunk_info);
       }
     }
   }
 }

◆ ReadCindexVector()

void ReadCindexVector	(	std::istream &	is,
		bool	binary,
		std::vector< Cindex > *	vec
	)

Definition at line 309 of file nnet-common.cc.

References ExpectToken(), rnnlm::i, KALDI_ERR, kaldi::ReadBasicType(), and ReadCindexVectorElementBinary().

Referenced by NnetComputation::MatrixDebugInfo::Read(), and UnitTestCindexIo().

                                               {
   ExpectToken(is, binary, "<I1V>");
   int32 size;
   ReadBasicType(is, binary, &size);
   if (size < 0) {
     KALDI_ERR << "Error reading Index vector: size = "
               << size;
   }
   vec->resize(size);
   if (!binary) {
     for (int32 i = 0; i < size; i++) {
       is >> std::ws;
       if (is.peek() == static_cast<int>(']') || i == 0) {
         if (i != 0)
           is.get();
         is >> std::ws;
         if (is.peek() == static_cast<int>('[')) {
           is.get();
         } else {
           KALDI_ERR << "ReadCintegerVector: expected to see [, saw "
                     << is.peek() << ", at file position " << is.tellg();
         }
         ReadBasicType(is, binary, &((*vec)[i].first));
         is >> std::ws;
         if (is.peek() == static_cast<int>(':')) {
           is.get();
         } else {
           KALDI_ERR << "ReadCintegerVector: expected to see :, saw "
                     << is.peek() << ", at file position " << is.tellg();
         }
       } else {
         (*vec)[i].first = (*vec)[i-1].first;
       }
       (*vec)[i].second.Read(is, binary);
       if (i == size - 1) {
         is >> std::ws;
         if (is.peek() == static_cast<int>(']')) {
           is.get();
         } else {
           KALDI_ERR << "ReadCintegerVector: expected to see ], saw "
                     << is.peek() << ", at file position " << is.tellg();
         }
       }
     }
   } else {
     for (int32 i = 0; i < size; i++)
       ReadCindexVectorElementBinary(is, i, vec);
   }
 }

◆ ReadCindexVectorElementBinary()

static void kaldi::nnet3::ReadCindexVectorElementBinary	(	std::istream &	is,
		int32	i,
		std::vector< Cindex > *	vec
	)

static

Definition at line 224 of file nnet-common.cc.

References rnnlm::i, KALDI_ASSERT, KALDI_ERR, Index::n, kaldi::ReadBasicType(), Index::t, and Index::x.

Referenced by ReadCindexVector().

                             {
   bool binary = true;
   Index &index = (*vec)[i].second;
   if (!is.good())
     KALDI_ERR << "End of file while reading vector of Cindex.";
   if (is.peek() == static_cast<int>('|')) {
     is.get();
     ReadBasicType(is, binary, &((*vec)[i].first));
   } else {
     KALDI_ASSERT(i != 0);
     (*vec)[i].first = (*vec)[i-1].first;
   }
   signed char c = is.get();
   if (i == 0) {
     if (std::abs(int(c)) < 125) {
       index.n = 0;
       index.t = c;
       index.x = 0;
     } else if (c == 125 || c == 126) {
       index.n = c - 125;
       index.t = 0;
       index.x = 0;
     } else {
       if (c != 127)
         KALDI_ERR << "Unexpected character " << c
                   << " encountered while reading Cindex vector.";
       ReadBasicType(is, binary, &(index.n));
       ReadBasicType(is, binary, &(index.t));
       ReadBasicType(is, binary, &(index.x));
     }
   } else {
     Index &last_index = (*vec)[i-1].second;
     if (std::abs(int(c)) < 124) {
       index.n = last_index.n;
       index.t = last_index.t + c;
       index.x = last_index.x;
     } else if (c == 125 || c == 126) {
       index.n = last_index.n + c - 125;
       index.t = last_index.t;
       index.x = last_index.x;
     } else {
       if (c != 127)
         KALDI_ERR << "Unexpected character " << c
                   << " encountered while reading Cindex vector.";
       ReadBasicType(is, binary, &(index.n));
       ReadBasicType(is, binary, &(index.t));
       ReadBasicType(is, binary, &(index.x));
     }
   }
 }

◆ ReadEditConfig()

void ReadEditConfig	(	std::istream &	config_file,
		Nnet *	nnet
	)

ReadEditConfig() reads a file with a similar-looking format to the config file read by Nnet::ReadConfig(), but this consists of a sequence of operations to perform on an existing network, mostly modifying components.

It's one "directive" (i.e. command) per line, but if supplying the options via the –edits option to programs like nnet3-am-copy, you can use a semicolon in place of the newline to separate commands.

The following describes the allowed commands. Note: all patterns are like UNIX globbing patterns where the only metacharacter is '*', representing zero or more characters.

  convert-to-fixed-affine [name=<name-pattern>]
    Converts the given affine components to FixedAffineComponent which is not updatable.

  remove-orphan-nodes [remove-orphan-inputs=(true|false)]
    Removes orphan nodes (that are never used to compute anything).  Note:
    remove-orphan-inputs defaults to false.

  remove-orphan-components
    Removes orphan components (those that are never used by any node).

  remove-orphans [remove-orphan-inputs=(true|false)]
    The same as calling remove-orphan-nodes and then remove-orphan-components.

  set-learning-rate [name=<name-pattern>] learning-rate=<learning-rate>
     Sets the learning rate for any updatable components matching the name pattern.
     Note: this sets the 'underlying' learning rate, i.e. it will get
     multiplied by any 'learning-rate-factor' set in the components.

  set-learning-rate-factor [name=<name-pattern>] learning-rate-factor=<learning-rate-factor>
     Sets the learning rate factor for any updatable components matching the name pattern.

  rename-node old-name=<old-name> new-name=<new-name>
     Renames a node; this is a surface renaming that does not affect the structure
     (for structural changes, use the regular config file format, not the
     edits-config).  This is mostly useful for outputs, e.g. when doing
     multilingual experiments.

  remove-output-nodes name=<name-pattern>
     Removes a subset of output nodes, those matching the pattern.  You cannot
     remove internal nodes directly; instead you should use the command
     'remove-orphans'.

  set-dropout-proportion [name=<name-pattern>] proportion=<dropout-proportion>
     Sets the dropout rates for any components of type DropoutComponent,
     DropoutMaskComponent or GeneralDropoutComponent whose
     names match the given <name-pattern> (e.g. lstm*).  <name-pattern> defaults to "*".

  apply-svd name=<name-pattern> bottleneck-dim=<dim> energy-threshold=<threshold> shrinkage-threshold=<s>
     Locates all components with names matching <name-pattern>, which are
     type AffineComponent or child classes thereof.  If <dim> is
     less than the minimum of the (input or output) dimension of the component,
     it does SVD on the components' parameters, retaining only the largest
     <dim> singular values, replacing these components with sequences of two
     components, of types LinearComponent and NaturalGradientAffineComponent.
     Instead we can set the filtering criterion for the Singular values as energy-threshold,
     and retain those values which contribute to energy-threshold times the total energy of
     the original singular values. A particular SVD factored component is left unshrinked,
     if the shrinkage ratio of the total no. of its parameters,
     after the SVD based refactoring, is greater than shrinkage threshold.
     See also 'reduce-rank'.

  reduce-rank name=<name-pattern> rank=<dim>
     Locates all components with names matching <name-pattern>, which are
     type AffineComponent or child classes thereof.  Does SVD on the
     components' parameters, retaining only the largest <dim> singular values,
     and writes the reconstructed matrix back to the component.  See also
     'apply-svd', which structurally breaks the component into two pieces.

Definition at line 1234 of file nnet-utils.cc.

References SvdApplier::ApplySvd(), ConfigLine::FirstToken(), Nnet::GetComponent(), Nnet::GetComponentName(), Nnet::GetNodeIndex(), Nnet::GetNodeName(), ConfigLine::GetValue(), ConfigLine::HasUnusedValues(), rnnlm::i, Nnet::IsOutputNode(), KALDI_ERR, KALDI_LOG, rnnlm::n, NameMatchesPattern(), Nnet::NumComponents(), Nnet::NumNodes(), ParseConfigLines(), kaldi::ReadConfigLines(), ReduceRankOfComponents(), Nnet::RemoveOrphanComponents(), Nnet::RemoveOrphanNodes(), Nnet::RemoveSomeNodes(), Nnet::SetComponent(), DropoutComponent::SetDropoutProportion(), DropoutMaskComponent::SetDropoutProportion(), GeneralDropoutComponent::SetDropoutProportion(), UpdatableComponent::SetLearningRateFactor(), Nnet::SetNodeName(), UpdatableComponent::SetUnderlyingLearningRate(), ConfigLine::UnusedValues(), and ConfigLine::WholeLine().

Referenced by CollapseModelConfig::CollapseModelConfig(), and main().

                                                             {
   std::vector<std::string> lines;
   ReadConfigLines(edit_config_is, &lines);
   // we process this as a sequence of lines.
   std::vector<ConfigLine> config_lines;
   ParseConfigLines(lines, &config_lines);
   for (size_t i = 0; i < config_lines.size(); i++) {
     ConfigLine &config_line = config_lines[i];
     const std::string &directive = config_lines[i].FirstToken();
     if (directive == "convert-to-fixed-affine") {
       std::string name_pattern = "*";
       // name_pattern defaults to '*' if none is given.  Note: this pattern
       // matches names of components, not nodes.
       config_line.GetValue("name", &name_pattern);
       int32 num_components_changed = 0;
       for (int32 c = 0; c < nnet->NumComponents(); c++) {
         Component *component = nnet->GetComponent(c);
         AffineComponent *affine = NULL;
         if (NameMatchesPattern(nnet->GetComponentName(c).c_str(),
                                name_pattern.c_str()) &&
             (affine = dynamic_cast<AffineComponent*>(component))) {
           nnet->SetComponent(c, new FixedAffineComponent(*affine));
           num_components_changed++;
         }
       }
       KALDI_LOG << "Converted " << num_components_changed
                 << " components to FixedAffineComponent.";
     } else if (directive == "remove-orphan-nodes") {
       bool remove_orphan_inputs = false;
       config_line.GetValue("remove-orphan-inputs", &remove_orphan_inputs);
       nnet->RemoveOrphanNodes(remove_orphan_inputs);
     } else if (directive == "remove-orphan-components") {
       nnet->RemoveOrphanComponents();
     } else if (directive == "remove-orphans") {
       bool remove_orphan_inputs = false;
       config_line.GetValue("remove-orphan-inputs", &remove_orphan_inputs);
       nnet->RemoveOrphanNodes(remove_orphan_inputs);
       nnet->RemoveOrphanComponents();
     } else if (directive == "set-learning-rate") {
       std::string name_pattern = "*";
       // name_pattern defaults to '*' if none is given.  This pattern
       // matches names of components, not nodes.
       config_line.GetValue("name", &name_pattern);
       BaseFloat learning_rate = -1;
       if (!config_line.GetValue("learning-rate", &learning_rate)) {
         KALDI_ERR << "In edits-config, expected learning-rate to be set in line: "
                   << config_line.WholeLine();
       }
       // Note: the learning rate you provide will be multiplied by any
       // 'learning-rate-factor' that is defined in the component,
       // so if you call SetUnderlyingLearningRate(), the actual learning
       // rate (learning_rate_) is set to the value you provide times
       // learning_rate_factor_.
       UpdatableComponent *component = NULL;
       int32 num_learning_rates_set = 0;
       for (int32 c = 0; c < nnet->NumComponents(); c++) {
         if (NameMatchesPattern(nnet->GetComponentName(c).c_str(),
                                name_pattern.c_str()) &&
             (component =
              dynamic_cast<UpdatableComponent*>(nnet->GetComponent(c)))) {
           component->SetUnderlyingLearningRate(learning_rate);
           num_learning_rates_set++;
         }
       }
       KALDI_LOG << "Set learning rates for " << num_learning_rates_set << " components.";
     } else if (directive == "set-learning-rate-factor") {
       std::string name_pattern = "*";
       // name_pattern defaults to '*' if none is given.
       config_line.GetValue("name", &name_pattern);
       BaseFloat learning_rate_factor = -1;
       if (!config_line.GetValue("learning-rate-factor", &learning_rate_factor)) {
         KALDI_ERR << "In edits-config, expected learning-rate-factor to be set in line: "
                   << config_line.WholeLine();
       }
       // Note: the learning_rate_factor_  defined in the component
       // sets to the value you provided, so if you call SetUnderlyingLearningRate(),
       // the actual learning rate (learning_rate_) is set to the value you provided
       // times learning_rate.
       UpdatableComponent *component = NULL;
       int32 num_learning_rate_factors_set = 0;
       for (int32 c = 0; c < nnet->NumComponents(); c++) {
         if (NameMatchesPattern(nnet->GetComponentName(c).c_str(),
             name_pattern.c_str()) &&
             (component =
             dynamic_cast<UpdatableComponent*>(nnet->GetComponent(c)))) {
           component->SetLearningRateFactor(learning_rate_factor);
           num_learning_rate_factors_set++;
         }
       }
       KALDI_LOG << "Set learning rate factors for " << num_learning_rate_factors_set
                 << " components.";
     } else if (directive == "rename-node") {
       // this is a shallow renaming of a node, and it requires that the name used is
       // not the name of another node.
       std::string old_name, new_name;
       if (!config_line.GetValue("old-name", &old_name) ||
           !config_line.GetValue("new-name", &new_name) ||
           config_line.HasUnusedValues()) {
         KALDI_ERR << "In edits-config, could not make sense of this rename-node "
                   << "directive (expect old-name=xxx new-name=xxx) "
                   << config_line.WholeLine();
       }
       if (nnet->GetNodeIndex(old_name) < 0)
         KALDI_ERR << "Could not rename node from " << old_name << " to "
                   << new_name << " because there is no node called "
                   << old_name;
       // further checks will happen inside SetNodeName().
       nnet->SetNodeName(nnet->GetNodeIndex(old_name), new_name);
     } else if (directive == "remove-output-nodes") {
       // note: after remove-output-nodes you probably want to do 'remove-orphans'.
       std::string name_pattern;
       if (!config_line.GetValue("name", &name_pattern) ||
           config_line.HasUnusedValues())
         KALDI_ERR << "In edits-config, could not make sense of "
                   << "remove-output-nodes directive: "
                   << config_line.WholeLine();
       std::vector<int32> nodes_to_remove;
       int32 outputs_remaining = 0;
       for (int32 n = 0; n < nnet->NumNodes(); n++) {
         if (nnet->IsOutputNode(n)) {
           if (NameMatchesPattern(nnet->GetNodeName(n).c_str(),
                                  name_pattern.c_str()))
             nodes_to_remove.push_back(n);
           else
             outputs_remaining++;
         }
       }
       KALDI_LOG << "Removing " << nodes_to_remove.size() << " output nodes.";
       if (outputs_remaining == 0)
         KALDI_ERR << "All outputs were removed.";
       nnet->RemoveSomeNodes(nodes_to_remove);
     } else if (directive == "set-dropout-proportion") {
       std::string name_pattern = "*";
       // name_pattern defaults to '*' if none is given.  This pattern
       // matches names of components, not nodes.
       config_line.GetValue("name", &name_pattern);
       BaseFloat proportion = -1;
       if (!config_line.GetValue("proportion", &proportion)) {
         KALDI_ERR << "In edits-config, expected proportion to be set in line: "
                   << config_line.WholeLine();
       }
       int32 num_dropout_proportions_set = 0;
       for (int32 c = 0; c < nnet->NumComponents(); c++) {
         if (NameMatchesPattern(nnet->GetComponentName(c).c_str(),
                                name_pattern.c_str())) {
           DropoutComponent *dropout_component =
              dynamic_cast<DropoutComponent*>(nnet->GetComponent(c));
           DropoutMaskComponent *mask_component =
              dynamic_cast<DropoutMaskComponent*>(nnet->GetComponent(c));
           GeneralDropoutComponent *general_dropout_component =
              dynamic_cast<GeneralDropoutComponent*>(nnet->GetComponent(c));
           if (dropout_component != NULL) {
             dropout_component->SetDropoutProportion(proportion);
             num_dropout_proportions_set++;
           } else if (mask_component != NULL){
             mask_component->SetDropoutProportion(proportion);
             num_dropout_proportions_set++;
           } else if (general_dropout_component != NULL){
             general_dropout_component->SetDropoutProportion(proportion);
             num_dropout_proportions_set++;
           }
         }
       }
       KALDI_LOG << "Set dropout proportions for "
                 << num_dropout_proportions_set << " components.";
     } else if (directive == "apply-svd") {
       std::string name_pattern;
       int32 bottleneck_dim = -1;
       BaseFloat energy_threshold = -1;
       BaseFloat shrinkage_threshold = 1.0;
       config_line.GetValue("bottleneck-dim", &bottleneck_dim);
       config_line.GetValue("energy-threshold", &energy_threshold);
       config_line.GetValue("shrinkage-threshold", &shrinkage_threshold);
       if (!config_line.GetValue("name", &name_pattern))
         KALDI_ERR << "Edit directive apply-svd requires 'name' to be specified.";
       if (bottleneck_dim <= 0 && energy_threshold <=0)
         KALDI_ERR << "Either Bottleneck-dim or energy-threshold "
           "must be set in apply-svd command. "
           "Range of possible values is (0 1]";
       SvdApplier applier(name_pattern, bottleneck_dim,
                          energy_threshold,
                          shrinkage_threshold,
                          nnet);
       applier.ApplySvd();
     } else if (directive == "reduce-rank") {
       std::string name_pattern;
       int32 rank = -1;
       if (!config_line.GetValue("name", &name_pattern) ||
           !config_line.GetValue("rank", &rank))
         KALDI_ERR << "Edit directive reduce-rank requires 'name' and "
             "'rank' to be specified.";
       if (rank <= 0)
         KALDI_ERR << "Rank must be positive in reduce-rank command.";
       ReduceRankOfComponents(name_pattern, rank, nnet);
     } else {
       KALDI_ERR << "Directive '" << directive << "' is not currently "
           "supported (reading edit-config).";
     }
     if (config_line.HasUnusedValues()) {
       KALDI_ERR << "Could not interpret '" << config_line.UnusedValues()
                 << "' in edit config line " << config_line.WholeLine();
     }
   }
 }

◆ ReadIndexVector()

void ReadIndexVector	(	std::istream &	is,
		bool	binary,
		std::vector< Index > *	vec
	)

Definition at line 143 of file nnet-common.cc.

References ExpectToken(), rnnlm::i, KALDI_ERR, kaldi::ReadBasicType(), and ReadIndexVectorElementBinary().

Referenced by IndexLessNxt::operator()(), NnetIo::Read(), NnetDiscriminativeSupervision::Read(), IoSpecification::Read(), NnetChainSupervision::Read(), NnetComputation::Read(), and UnitTestIndexIo().

                                             {
   ExpectToken(is, binary, "<I1V>");
   int32 size;
   ReadBasicType(is, binary, &size);
   if (size < 0) {
     KALDI_ERR << "Error reading Index vector: size = "
               << size;
   }
   vec->resize(size);
   if (!binary) {
     for (int32 i = 0; i < size; i++)
       (*vec)[i].Read(is, binary);
   } else {
     for (int32 i = 0; i < size; i++)
       ReadIndexVectorElementBinary(is, i, vec);
   }
 }

◆ ReadIndexVectorElementBinary()

static void kaldi::nnet3::ReadIndexVectorElementBinary	(	std::istream &	is,
		int32	i,
		std::vector< Index > *	vec
	)

static

Definition at line 87 of file nnet-common.cc.

References rnnlm::i, KALDI_ERR, Index::n, kaldi::ReadBasicType(), Index::t, and Index::x.

Referenced by ReadIndexVector().

                            {
   bool binary = true;
   Index &index = (*vec)[i];
   if (!is.good())
     KALDI_ERR << "End of file while reading vector of Index.";
   signed char c = is.get();
   if (i == 0) {
     if (std::abs(int(c)) < 125) {
       index.n = 0;
       index.t = c;
       index.x = 0;
     } else {
       if (c != 127)
         KALDI_ERR << "Unexpected character " << c
                   << " encountered while reading Index vector.";
       ReadBasicType(is, binary, &(index.n));
       ReadBasicType(is, binary, &(index.t));
       ReadBasicType(is, binary, &(index.x));
     }
   } else {
     Index &last_index = (*vec)[i-1];
     if (std::abs(int(c)) < 125) {
       index.n = last_index.n;
       index.t = last_index.t + c;
       index.x = last_index.x;
     } else {
       if (c != 127)
         KALDI_ERR << "Unexpected character " << c
                   << " encountered while reading Index vector.";
       ReadBasicType(is, binary, &(index.n));
       ReadBasicType(is, binary, &(index.t));
       ReadBasicType(is, binary, &(index.x));
     }
   }
 }

◆ ReadIntegerToken()

static int32 kaldi::nnet3::ReadIntegerToken	(	const std::string &	what_we_are_parsing,
		const std::string **	next_token
	)

static

Definition at line 56 of file nnet-descriptor.cc.

References kaldi::ConvertStringToInteger(), KALDI_ERR, and ParsingContext().

Referenced by GeneralDescriptor::ParseOffset(), GeneralDescriptor::ParseReplaceIndex(), and GeneralDescriptor::ParseRound().

                                                             {
   int32 ans;
   if (!ConvertStringToInteger(**next_token, &ans))
     KALDI_ERR << "Expected integer while parsing "
               << what_we_are_parsing << ", got '"
               << **next_token << "'" << ParsingContext(*next_token);
   (*next_token)++;  // advance token pointer.
   return ans;
 }

◆ ReadVectorAsChar()

void ReadVectorAsChar	(	std::istream &	is,
		bool	binary,
		Vector< BaseFloat > *	vec
	)

Definition at line 258 of file nnet-example-utils.cc.

References VectorBase< Real >::Data(), rnnlm::i, kaldi::kUndefined, Vector< Real >::Read(), kaldi::ReadIntegerVector(), and Vector< Real >::Resize().

Referenced by NnetDiscriminativeSupervision::Read(), and NnetChainSupervision::Read().

                                               {
   if (binary) {
     BaseFloat scale = 1.0 / 255.0;
     std::vector<unsigned char> char_vec;
     ReadIntegerVector(is, binary, &char_vec);
     int32 dim = char_vec.size();
     vec->Resize(dim, kUndefined);
     BaseFloat *data = vec->Data();
     for (int32 i = 0; i < dim; i++)
       data[i] = scale * char_vec[i];
   } else {
     vec->Read(is, binary);
   }
 }

◆ RearrangeIndexes()

void kaldi::nnet3::RearrangeIndexes	(	const std::vector< std::vector< int32 > > &	in,
		std::vector< std::vector< int32 > > *	out
	)

Definition at line 367 of file nnet-combined-component.cc.

References rnnlm::i, and rnnlm::j.

Referenced by ConvolutionComponent::InderivPatchesToInderiv(), and MaxpoolingComponent::InderivPatchesToInderiv().

                                                                                    {
   int32 D = in.size();
   int32 L = 0;
   for (int32 i = 0; i < D; i++)
     if (in[i].size() > L)
       L = in[i].size();
   out->resize(L);
   for (int32 i = 0; i < L; i++)
     (*out)[i].resize(D, -1);
   for (int32 i = 0; i < D; i++) {
     for (int32 j = 0; j < in[i].size(); j++) {
       (*out)[j][i] = in[i][j];
     }
   }
 }

◆ RecomputeStats() [1/2]

void RecomputeStats	(	const std::vector< NnetChainExample > &	egs,
		const chain::ChainTrainingOptions &	chain_config,
		const fst::StdVectorFst &	den_fst,
		Nnet *	nnet
	)

This function zeros the stored component-level stats in the nnet using ZeroComponentStats(), then recomputes them with the supplied egs.

It affects batch-norm, for instance. See also the version of RecomputeStats declared in nnet-utils.h.

Definition at line 248 of file nnet-chain-diagnostics.cc.

References NnetChainComputeProb::Compute(), HasXentOutputs(), rnnlm::i, KALDI_LOG, NnetChainComputeProb::PrintTotalStats(), NnetComputeProbOptions::store_component_stats, and ZeroComponentStats().

Referenced by main().

                                 {
   KALDI_LOG << "Recomputing stats on nnet (affects batch-norm)";
   chain::ChainTrainingOptions chain_config(chain_config_in);
   if (HasXentOutputs(*nnet) &&
       chain_config.xent_regularize == 0) {
     // this forces it to compute the output for xent outputs, 
     // usually 'output-xent', which
     // means that we'll be computing batch-norm stats for any
     // components in that branch that have batch-norm.
     chain_config.xent_regularize = 0.1;
   }
 
   ZeroComponentStats(nnet);
   NnetComputeProbOptions nnet_config;
   nnet_config.store_component_stats = true;
   NnetChainComputeProb prob_computer(nnet_config, chain_config, den_fst, nnet);
   for (size_t i = 0; i < egs.size(); i++)
     prob_computer.Compute(egs[i]);
   prob_computer.PrintTotalStats();
   KALDI_LOG << "Done recomputing stats.";
 }

◆ RecomputeStats() [2/2]

void RecomputeStats	(	const std::vector< NnetExample > &	egs,
		Nnet *	nnet
	)

This function zeros the stored component-level stats in the nnet using ZeroComponentStats(), then recomputes them with the supplied egs.

It affects batch-norm, for instance. See also the version of RecomputeStats declared in nnet-chain-diagnostics.h.

Definition at line 550 of file nnet-utils.cc.

References NnetComputeProb::Compute(), rnnlm::i, KALDI_LOG, NnetComputeProb::PrintTotalStats(), NnetComputeProbOptions::store_component_stats, and ZeroComponentStats().

                                                                    {
   KALDI_LOG << "Recomputing stats on nnet (affects batch-norm)";
   ZeroComponentStats(nnet);
   NnetComputeProbOptions opts;
   opts.store_component_stats = true;
   NnetComputeProb prob_computer(opts, nnet);
   for (size_t i = 0; i < egs.size(); i++)
     prob_computer.Compute(egs[i]);
   prob_computer.PrintTotalStats();
   KALDI_LOG << "Done recomputing stats.";
 }

◆ ReduceRankOfComponents()

void kaldi::nnet3::ReduceRankOfComponents	(	const std::string	component_name_pattern,
		int32	rank,
		Nnet *	nnet
	)

Definition at line 1172 of file nnet-utils.cc.

References MatrixBase< Real >::AddMatMat(), AffineComponent::BiasParams(), SvdApplier::ModifiedComponentInfo::component_name, Nnet::GetComponent(), Nnet::GetComponentName(), AffineComponent::InputDim(), KALDI_LOG, KALDI_WARN, kaldi::kCopyData, kaldi::kNoTrans, AffineComponent::LinearParams(), NameMatchesPattern(), Nnet::NumComponents(), AffineComponent::OutputDim(), Matrix< Real >::Resize(), AffineComponent::SetParams(), kaldi::SortSvd(), CuVector< Real >::Swap(), and CuMatrix< Real >::Swap().

Referenced by ReadEditConfig().

                                         {
   int32 num_components_changed = 0;
   for (int32 c = 0; c < nnet->NumComponents(); c++) {
     Component *component = nnet->GetComponent(c);
     std::string component_name = nnet->GetComponentName(c);
     if (NameMatchesPattern(component_name.c_str(),
                            component_name_pattern.c_str())) {
       AffineComponent *affine =  dynamic_cast<AffineComponent*>(component);
       if (affine == NULL) {
         KALDI_WARN << "Not reducing rank of component " << component_name
                    << " as it is not an AffineComponent.";
         continue;
       }
       int32 input_dim = affine->InputDim(),
           output_dim = affine->OutputDim();
       if (input_dim <= rank || output_dim <= rank) {
         KALDI_WARN << "Not reducing rank of component " << component_name
                    << " with SVD to rank " << rank
                    << " because its dimension is " << input_dim
                    << " -> " << output_dim;
         continue;
       }
       Matrix<BaseFloat> linear_params(affine->LinearParams());
       Vector<BaseFloat> bias_params(affine->BiasParams());
 
       // note: 'linear_params' is of dimension output_dim by input_dim.
       int32 middle_dim = std::min<int32>(input_dim, output_dim);
       Vector<BaseFloat> s(middle_dim);
       Matrix<BaseFloat> U(output_dim, middle_dim),
           Vt(middle_dim, input_dim);
       linear_params.Svd(&s, &U, &Vt);
       // make sure the singular values are sorted from greatest to least value.
       SortSvd(&s, &U, &Vt);
       BaseFloat s_sum_orig = s.Sum();
       s.Resize(rank, kCopyData);
       U.Resize(output_dim, rank, kCopyData);
       Vt.Resize(rank, input_dim, kCopyData);
       BaseFloat s_sum_reduced = s.Sum();
       KALDI_LOG << "For component " << component_name
                 << " singular value sum changed by reduce-rank command "
                 << (s_sum_orig - s_sum_reduced)
                 << " (from " << s_sum_orig << " to " << s_sum_reduced << ")";
       U.MulColsVec(s);
       Matrix<BaseFloat> linear_params_reduced_rank(output_dim, input_dim);
       linear_params_reduced_rank.AddMatMat(1.0, U, kNoTrans, Vt, kNoTrans, 0.0);
       CuMatrix<BaseFloat> linear_params_reduced_rank_cuda;
       linear_params_reduced_rank_cuda.Swap(&linear_params_reduced_rank);
       CuVector<BaseFloat> bias_params_cuda;
       bias_params_cuda.Swap(&bias_params);
       affine->SetParams(bias_params_cuda, linear_params_reduced_rank_cuda);
       num_components_changed++;
     }
   }
   KALDI_LOG << "Reduced rank of parameters of " << num_components_changed
             << " components.";
 }

◆ RemoveCommandsForUnusedMatrix()

void RemoveCommandsForUnusedMatrix	(	const Analyzer &	analyzer,
		int32	m,
		NnetComputation *	computation
	)

This function removes from 'computation' the commands accessing matrix 'm', which is assumed to be unused according to the MatrixIsUnused() command above.

Specifically, it changes the types of the relevant commands in 'computation' to kNoOperation. (The commands changed in this way will be of type kAllocMatrix, kDeallocMatrix and kSetConst). The index for the matrix may later be removed entirely by RenumberComputation().

Definition at line 4600 of file nnet-optimize-utils.cc.

References MatrixAccesses::accesses, MatrixAccesses::allocate_command, NnetComputation::Command::command_type, NnetComputation::commands, MatrixAccesses::deallocate_command, rnnlm::i, KALDI_ASSERT, kAllocMatrix, kDeallocMatrix, kNoOperation, kSetConst, and Analyzer::matrix_accesses.

Referenced by DerivativeTimeLimiter::PruneMatrices().

                                                                  {
   const MatrixAccesses &accesses = analyzer.matrix_accesses[m];
   if (accesses.allocate_command >= 0) {
     NnetComputation::Command &command = computation->commands[
         accesses.allocate_command];
     KALDI_ASSERT(command.command_type == kNoOperation ||
                  command.command_type == kAllocMatrix);
     command.command_type = kNoOperation;
   }
   if (accesses.deallocate_command >= 0) {
     NnetComputation::Command &command = computation->commands[
         accesses.deallocate_command];
     KALDI_ASSERT(command.command_type == kNoOperation ||
                  command.command_type == kDeallocMatrix);
     command.command_type = kNoOperation;
   }
   for (size_t i = 0; i < accesses.accesses.size(); i++) {
     int32 command_index = accesses.accesses[i].command_index;
     NnetComputation::Command &command = computation->commands[command_index];
     KALDI_ASSERT(command.command_type == kNoOperation ||
                  command.command_type == kSetConst);
     command.command_type = kNoOperation;
   }
 }

◆ RemoveNoOps()

void RemoveNoOps ( NnetComputation * computation )

Removes commands of type kNoOperation in the computation.

Definition at line 703 of file nnet-optimize-utils.cc.

References NnetComputation::commands, and IsNoop().

Referenced by DerivativeTimeLimiter::LimitDerivTimes(), VariableMergingOptimizer::MergeVariables(), and RemoveUnnecessaryAllocation().

                                                {
   computation->commands.erase(
       std::remove_if(computation->commands.begin(),
                      computation->commands.end(),
                      IsNoop), computation->commands.end());
 }

◆ RemoveUnnecessaryAllocation()

void RemoveUnnecessaryAllocation	(	const Nnet &	nnet,
		NnetComputation *	computation
	)

This optimization detects cases where we deallocate a matrix, and then later allocate another matrix of the same size; and replaces them with commands of type kAllocFromOther or kAllocFromOtherZeroed.

Definition at line 355 of file nnet-optimize.cc.

References NnetComputation::Command::arg1, NnetComputation::Command::command_type, NnetComputation::commands, ComputeCommandPairs(), FixGotoLabel(), rnnlm::i, KALDI_ASSERT, kAllocMatrix, kDeallocMatrix, kaldi::kDefaultStride, kNoOperation, kSwapMatrix, NnetComputation::matrices, RemoveNoOps(), and NnetComputation::submatrices.

Referenced by Optimize().

                                                                {
   // For each size of matrix and stride-type, represented as a pair<int32,int32>
   // (the num-rows, and the num-cols * (stride-type == kDefaultStride ? 1 : -1), we
   // accumulate a list of indexes of deallocation commands that
   // are for that size, and a list of indexes of allocation commands
   // for that size.
   // For each distinct matrix size, we then call ComputeCommandPairs on those
   // two lists, to get pairs of (deallocation, allocation) command-indexes that
   // we can optimize out to a single command.
 
   // The map is from a (num-rows,num-columns) to two lists, of
   // (deallocation-commands, allocation-commands).  The order may seem
   // backwards, but that's the order of the pairs we are looking for.
   typedef unordered_map<std::pair<int32,int32>,
       std::pair<std::vector<int32>,std::vector<int32> >,
       PairHasher<int32> > MapType;
   MapType pair_map;
   int32 num_commands = computation->commands.size();
   for (int32 command_index = 0; command_index < num_commands; command_index++) {
     NnetComputation::Command &command = computation->commands[command_index];
     if (command.command_type == kAllocMatrix ||
         command.command_type == kDeallocMatrix) {
       int32 s = command.arg1, m = computation->submatrices[s].matrix_index,
           num_rows = computation->matrices[m].num_rows,
           num_cols = computation->matrices[m].num_cols,
           num_cols_mod = num_cols * (
               computation->matrices[m].stride_type == kDefaultStride ? 1 : -1);
       std::pair<int32,int32> p(num_rows, num_cols_mod);
       std::pair<std::vector<int32>,std::vector<int32> > &lists = pair_map[p];
       if (command.command_type == kDeallocMatrix)
         lists.first.push_back(command_index);
       else
         lists.second.push_back(command_index);
     }
   }
 
   MapType::const_iterator iter = pair_map.begin(), end = pair_map.end();
   std::vector<std::pair<int32,int32> > command_pairs;
   for (; iter != end; ++iter)
     ComputeCommandPairs(iter->second, &command_pairs);
 
   for (size_t i = 0; i < command_pairs.size(); i++) {
     int32 dealloc_index = command_pairs[i].first,
         alloc_index = command_pairs[i].second;
     NnetComputation::Command
         &dealloc_command = computation->commands[dealloc_index],
         &alloc_command = computation->commands[alloc_index];
     KALDI_ASSERT(dealloc_command.command_type ==
                  kDeallocMatrix);
     KALDI_ASSERT(alloc_command.command_type ==
                  kAllocMatrix);
     // remove the deallocation command.
     dealloc_command.command_type =  kNoOperation;
     alloc_command.arg2 = dealloc_command.arg1;
     alloc_command.command_type = kSwapMatrix;
   }
   RemoveNoOps(computation);
   FixGotoLabel(computation);
 }

◆ RemoveUnnecessaryZeroing()

void RemoveUnnecessaryZeroing	(	const Nnet &	nnet,
		NnetComputation *	computation
	)

This optimization function removes, where possible, commands of type type kSetConst.

(It can remove them where subsequent commands are going to set the matrix without reading its previous value).

Definition at line 262 of file nnet-optimize.cc.

References MatrixAccesses::accesses, NnetComputation::Command::alpha, ComputationVariables::AppendVariablesForMatrix(), NnetComputation::Command::command_type, NnetComputation::commands, rnnlm::i, Analyzer::Init(), MatrixAccesses::is_output, kNoOperation, kSetConst, kWriteAccess, Analyzer::matrix_accesses, Analyzer::variable_accesses, and Analyzer::variables.

Referenced by Optimize().

                                                             {
   Analyzer a;
   a.Init(nnet, *computation);
 
   // OK, now we'll work out which matrices have all their pieces (i.e. all the
   // variables belonging to that matrix) written to as the first instruction
   // apart from the initial zeroing.  These matrices can have the initial
   // zeroing replaced by a sizing operation that leaves the data undefined.
   int32 num_matrices = a.matrix_accesses.size();
   for (int32 matrix_index = 0; matrix_index < num_matrices; matrix_index++) {
     const MatrixAccesses &accesses = a.matrix_accesses[matrix_index];
     if (accesses.accesses.empty())
       continue;
     int32 zeroing_command_index = accesses.accesses[0].command_index;
     NnetComputation::Command *command =
         &(computation->commands[zeroing_command_index]);
     if (!(command->command_type == kSetConst &&
           command->alpha == 0.0)) {
       continue;  // First command is not a zeroing command
     }
     // OK, the first command that accesses this matrix is a zeroing command;
     // we're going to figure out whether it was necessary.
     std::vector<int32> variables_for_matrix;
     a.variables.AppendVariablesForMatrix(matrix_index, &variables_for_matrix);
     bool all_variables_ok = true;  // if this stays true, it means we don't need
                                    // the initial zeroing.
     for (size_t i = 0; i < variables_for_matrix.size(); i++) {
       int32 variable_index = variables_for_matrix[i];
       const std::vector<Access> &v_accesses =
           a.variable_accesses[variable_index];
       if (v_accesses.size() > 1 &&
           v_accesses[1].access_type != kWriteAccess) {
         all_variables_ok = false;  // first access after zeroing was not a write
         break;
       }
       if (v_accesses.size() == 1 &&
           accesses.is_output) {
         // the only command that touches this variable is the allocation, and it
         // is an output variable.  (this is unusual, but can happen e.g. if it's
         // a derivative, but due to min_deriv_time and max_deriv_time it ends up
         // always being zero.
         all_variables_ok = false;
         break;
       }
     }
     if (all_variables_ok) {
       // Here is where the change actually happens.
       // Remove the zeroing command.
       command->command_type = kNoOperation;
     }
   }
 }

◆ RenameOutputs()

void kaldi::nnet3::RenameOutputs	(	const std::string &	new_name,
		NnetExample *	eg
	)

Definition at line 31 of file nnet3-copy-egs.cc.

References NnetExample::io, and KALDI_ERR.

Referenced by main().

                                                                {
   bool found_output = false;
   for (std::vector<NnetIo>::iterator it = eg->io.begin();
        it != eg->io.end(); ++it) {
     if (it->name == "output") {
       it->name = new_name;
       found_output = true;
     }
   }
 
   if (!found_output)
     KALDI_ERR << "No io-node with name 'output'"
               << "exists in eg.";
 }

◆ RenumberComputation()

void RenumberComputation ( NnetComputation * computation )

This function detects submatrices and matrices that are never used (e.g.

due to changes made in other optimization code), and members of indexes, indexes_multi and indexes_ranges that are unused or are duplicates, and memo indexes that are unused; and it removes them from the computation by way of suitable renumbering. It does not remove no-ops from computation->commands_; to do that, call RemoveNoOps(computation).

Definition at line 693 of file nnet-optimize-utils.cc.

References ComputationRenumberer::Renumber().

Referenced by MatrixExtender::FixComputation(), DerivativeTimeLimiter::LimitDerivTimes(), VariableMergingOptimizer::MergeVariables(), Optimize(), and ComputationLoopedOptimizer::Optimize().

                                                        {
   ComputationRenumberer renumberer(computation);
   renumberer.Renumber();
 }

◆ ReplaceRowWithMatrixOps()

bool ReplaceRowWithMatrixOps ( NnetComputation * computation )

This function detects cases where commands of type kCopyRows, kAddRows or kAddToRows can be converted to commands of type kMatrixCopy or kMatrixAdd, and converts them (this may involve adding submatrices).

This function returns true if it made any changes to the computation; if it returns true, then after doing this you should at some point do RenumberComputation(), which will remove any now-unused members of computation->indexes.

Definition at line 2288 of file nnet-optimize-utils.cc.

References NnetComputation::Command::arg1, NnetComputation::Command::arg2, NnetComputation::Command::arg3, NnetComputation::Command::command_type, NnetComputation::commands, NnetComputation::indexes, IndexesHaveSpecialStructure(), kAddRows, KALDI_ASSERT, kCopyRows, kMatrixAdd, kMatrixCopy, and NnetComputation::NewSubMatrix().

Referenced by Optimize().

                                                            {
   bool ans = false;
   int32 num_commands = computation->commands.size(),
       num_indexes = computation->indexes.size();
   for (int32 command_index = 0; command_index < num_commands;
        command_index++) {
     // non-const because we'll be changing it.
     NnetComputation::Command &c = computation->commands[command_index];
 
     int32 first_nonnegative_pos,
         first_nonnegative_value,
         num_nonnegative_indexes;
     switch (c.command_type) {
       case kCopyRows: case kAddRows: {
         int32 indexes_index = c.arg3;
         KALDI_ASSERT(indexes_index < num_indexes);
         const std::vector<int32> &indexes = computation->indexes[indexes_index];
         if (IndexesHaveSpecialStructure(indexes,
                                         &first_nonnegative_pos,
                                         &first_nonnegative_value,
                                         &num_nonnegative_indexes)) {
           ans = true;
           c.arg1 = computation->NewSubMatrix(c.arg1, first_nonnegative_pos,
                                              num_nonnegative_indexes,
                                              0, -1);
           c.arg2 = computation->NewSubMatrix(c.arg2, first_nonnegative_value,
                                              num_nonnegative_indexes,
                                              0, -1);
           c.command_type = (c.command_type == kCopyRows ? kMatrixCopy :
                             kMatrixAdd);
         }
         break;
       }
       default:
         break;
     }
   }
   return ans;
 }

◆ RequestIsDecomposable()

bool RequestIsDecomposable	(	const ComputationRequest &	request,
		ComputationRequest *	mini_request,
		int32 *	num_n_values
	)

This function, used in 'shortcut' compilation where we first compile a smaller computation with the same structure but only 2 distinct 'n' values, works out whether a computation is 'decomposable'; if so, it returns true and outputs the 'mini_request' with the same structure, and the number of 'n' values.

A computation is decomposable if the following conditions hold:

All of its inputs and outputs contain 'n' values for all 0 <= n < N, for some N > 2. [we output this 'N' as 'num_n_values'].
All of its inputs and outputs have 'regular' structure: chiefly, that within vectors of Indexes, each (t, x) pair should be present for the same set of 'n' values (0, 1, ... N-1), and that we should be able to identify the stride of the 'n' index. For more precise details on this regular structure, look at the comment for the function FindNStride(), in nnet-optimize-utils.cc.

Definition at line 3852 of file nnet-optimize-utils.cc.

References rnnlm::i, ComputationRequest::inputs, IoSpecificationIsDecomposable(), KALDI_ASSERT, ComputationRequest::misc_info, ComputationRequest::need_model_derivative, ComputationRequest::outputs, and ComputationRequest::store_component_stats.

Referenced by CachingOptimizingCompiler::CompileViaShortcut().

                                                 {
   size_t num_inputs = request.inputs.size(),
       num_outputs = request.outputs.size();
   mini_request->inputs.resize(num_inputs);
   mini_request->outputs.resize(num_outputs);
   mini_request->need_model_derivative = request.need_model_derivative;
   mini_request->store_component_stats = request.store_component_stats;
   mini_request->misc_info = request.misc_info;
 
   KALDI_ASSERT(num_inputs != 0 && num_outputs != 0);
   for (size_t i = 0; i < num_inputs; i++) {
     int32 this_num_n_values = 0;
     if (!IoSpecificationIsDecomposable(request.inputs[i],
                                        &(mini_request->inputs[i]),
                                        &this_num_n_values))
       return false;
     if (i == 0) {
       *num_n_values = this_num_n_values;
     } else {
       if (this_num_n_values != *num_n_values)
         return false;  // .. which would be odd.
     }
   }
   for (size_t i = 0; i < num_outputs; i++) {
     int32 this_num_n_values = 0;
     if (!IoSpecificationIsDecomposable(request.outputs[i],
                                        &(mini_request->outputs[i]),
                                        &this_num_n_values))
       return false;
     if (this_num_n_values != *num_n_values)
       return false;  // .. which would be odd.
   }
   return true;
 }

◆ ResetGenerators()

void ResetGenerators ( Nnet * nnet )

This function calls 'ResetGenerator()' on all components in 'nnet' that inherit from class RandomComponent.

It's used when you need to ensure consistency in things like dropout masks, across subsequent neural net evaluations. You will likely want to call srand() before calling this.

Definition at line 582 of file nnet-utils.cc.

References Nnet::GetComponent(), Nnet::NumComponents(), and RandomComponent::ResetGenerator().

Referenced by Nnet::Copy(), NnetChainTrainer::Train(), NnetTrainer::Train(), and UnitTestNnetOptimizeWithOptions().

                                 {
   for (int32 c = 0; c < nnet->NumComponents(); c++) {
     Component *comp = nnet->GetComponent(c);
     RandomComponent *rc = dynamic_cast<RandomComponent*>(comp);
     if (rc != NULL)
       rc->ResetGenerator();
   }
 }

◆ ResetSeed()

static void kaldi::nnet3::ResetSeed	(	int32	rand_seed,
		const Component &	c
	)

static

Definition at line 27 of file nnet-component-test.cc.

References RandomComponent::ResetGenerator().

Referenced by TestSimpleComponentDataDerivative(), and TestSimpleComponentPropagateProperties().

                                                            {
   RandomComponent *rand_component =
     const_cast<RandomComponent*>(dynamic_cast<const RandomComponent*>(&c));
 
   if (rand_component != NULL) {
     srand(rand_seed);
     rand_component->ResetGenerator();
   }
 }

◆ RoundUpNumFrames()

void kaldi::nnet3::RoundUpNumFrames	(	int32	frame_subsampling_factor,
		int32 *	num_frames,
		int32 *	num_frames_overlap
	)

Definition at line 275 of file nnet-example-utils.cc.

References KALDI_ERR, and KALDI_LOG.

                                                  {
   if (*num_frames % frame_subsampling_factor != 0) {
     int32 new_num_frames = frame_subsampling_factor *
         (*num_frames / frame_subsampling_factor + 1);
     KALDI_LOG << "Rounding up --num-frames=" << (*num_frames)
               << " to a multiple of --frame-subsampling-factor="
               << frame_subsampling_factor
               << ", now --num-frames=" << new_num_frames;
     *num_frames = new_num_frames;
   }
   if (*num_frames_overlap % frame_subsampling_factor != 0) {
     int32 new_num_frames_overlap = frame_subsampling_factor *
         (*num_frames_overlap / frame_subsampling_factor + 1);
     KALDI_LOG << "Rounding up --num-frames-overlap=" << (*num_frames_overlap)
               << " to a multiple of --frame-subsampling-factor="
               << frame_subsampling_factor
               << ", now --num-frames-overlap=" << new_num_frames_overlap;
     *num_frames_overlap = new_num_frames_overlap;
   }
   if (*num_frames_overlap < 0 || *num_frames_overlap >= *num_frames) {
     KALDI_ERR << "--num-frames-overlap=" << (*num_frames_overlap) << " < "
               << "--num-frames=" << (*num_frames);
   }
 }

◆ RunNnetComputation()

static void kaldi::nnet3::RunNnetComputation	(	const MatrixBase< BaseFloat > &	features,
		const Nnet &	nnet,
		CachingOptimizingCompiler *	compiler,
		Vector< BaseFloat > *	xvector
	)

static

Definition at line 33 of file nnet3-xvector-compute.cc.

References NnetComputer::AcceptInput(), CachingOptimizingCompiler::Compile(), VectorBase< Real >::CopyFromVec(), NnetComputer::GetOutputDestructive(), IoSpecification::has_deriv, IoSpecification::indexes, ComputationRequest::inputs, IoSpecification::name, ComputationRequest::need_model_derivative, CuMatrixBase< Real >::NumCols(), MatrixBase< Real >::NumRows(), ComputationRequest::outputs, Vector< Real >::Resize(), CuMatrixBase< Real >::Row(), NnetComputer::Run(), and ComputationRequest::store_component_stats.

Referenced by main().

                                 {
   ComputationRequest request;
   request.need_model_derivative = false;
   request.store_component_stats = false;
   request.inputs.push_back(
     IoSpecification("input", 0, features.NumRows()));
   IoSpecification output_spec;
   output_spec.name = "output";
   output_spec.has_deriv = false;
   output_spec.indexes.resize(1);
   request.outputs.resize(1);
   request.outputs[0].Swap(&output_spec);
   std::shared_ptr<const NnetComputation> computation(compiler->Compile(request));
   Nnet *nnet_to_update = NULL;  // we're not doing any update.
   NnetComputer computer(NnetComputeOptions(), *computation,
                   nnet, nnet_to_update);
   CuMatrix<BaseFloat> input_feats_cu(features);
   computer.AcceptInput("input", &input_feats_cu);
   computer.Run();
   CuMatrix<BaseFloat> cu_output;
   computer.GetOutputDestructive("output", &cu_output);
   xvector->Resize(cu_output.NumCols());
   xvector->CopyFromVec(cu_output.Row(0));
 }

◆ ScaleBatchnormStats()

void ScaleBatchnormStats	(	BaseFloat	batchnorm_stats_scale,
		Nnet *	nnet
	)

This function scales the batchorm stats of any batchnorm components (components of type BatchNormComponent) in 'nnet' by the scale 'batchnorm_stats_scale'.

Definition at line 536 of file nnet-utils.cc.

References Nnet::GetComponent(), KALDI_ASSERT, Nnet::NumComponents(), and BatchNormComponent::Scale().

Referenced by CollapseModelConfig::CollapseModelConfig(), NnetChainTrainer::TrainInternal(), NnetTrainer::TrainInternal(), NnetChainTrainer::TrainInternalBackstitch(), and NnetTrainer::TrainInternalBackstitch().

                                      {
   KALDI_ASSERT(batchnorm_stats_scale >= 0.0 && batchnorm_stats_scale <= 1.0);
   if (batchnorm_stats_scale == 1.0)
     return;
   for (int32 c = 0; c < nnet->NumComponents(); c++) {
     Component *comp = nnet->GetComponent(c);
     BatchNormComponent *bc = dynamic_cast<BatchNormComponent*>(comp);
     if (bc != NULL)
       bc->Scale(batchnorm_stats_scale);
   }
 }

◆ ScaleNnet()

void ScaleNnet	(	BaseFloat	scale,
		Nnet *	nnet
	)

Scales the nnet parameters and stats by this scale.

Definition at line 312 of file nnet-utils.cc.

References Nnet::GetComponent(), Nnet::NumComponents(), and Component::Scale().

Referenced by main(), NnetChainComputeProb::NnetChainComputeProb(), NnetChainTrainer::NnetChainTrainer(), NnetComputeProb::NnetComputeProb(), NnetDiscriminativeComputeObjf::NnetDiscriminativeComputeObjf(), NnetDiscriminativeTrainer::NnetDiscriminativeTrainer(), NnetTrainer::NnetTrainer(), kaldi::ReadModels(), NnetDiscriminativeComputeObjf::Reset(), NnetChainComputeProb::Reset(), NnetComputeProb::Reset(), NnetDiscriminativeTrainer::Train(), NnetChainTrainer::TrainInternal(), NnetTrainer::TrainInternal(), NnetChainTrainer::TrainInternalBackstitch(), NnetTrainer::TrainInternalBackstitch(), UnitTestNnetModelDerivatives(), UnitTestNnetOptimizeWithOptions(), and UpdateNnetMovingAverage().

                                             {
   if (scale == 1.0) return;
   else {
     for (int32 c = 0; c < nnet->NumComponents(); c++) {
       Component *comp = nnet->GetComponent(c);
       comp->Scale(scale);
     }
   }
 }

◆ ScaleSupervisionWeight()

void kaldi::nnet3::ScaleSupervisionWeight	(	BaseFloat	weight,
		NnetExample *	eg
	)

Definition at line 47 of file nnet3-copy-egs.cc.

References NnetExample::io, and KALDI_ERR.

Referenced by main().

                                                                {
   if (weight == 1.0) return;
 
   bool found_output = false;
   for (std::vector<NnetIo>::iterator it = eg->io.begin();
        it != eg->io.end(); ++it) {
     if (it->name == "output") {
       it->features.Scale(weight);
       found_output = true;
     }
   }
 
   if (!found_output)
     KALDI_ERR << "No supervision with name 'output'"
               << "exists in eg.";
 }

◆ SelectFromExample()

bool kaldi::nnet3::SelectFromExample	(	const NnetExample &	eg,
		std::string	frame_str,
		int32	left_context,
		int32	right_context,
		int32	frame_shift,
		NnetExample *	eg_out
	)

This function is responsible for possibly selecting one frame from multiple supervised frames, and reducing the left and right context as specified.

If frame == "" it does not reduce the supervised frames; if frame == "random" it selects one random frame; otherwise it expects frame to be an integer, and will select only the output with that frame index (or return false if there was no such output).

If left_context != -1 it removes any inputs with t < (smallest output - left_context). If left_context != -1 it removes any inputs with t < (smallest output - left_context).

It returns true if it was able to select a frame. We only anticipate it ever returning false in situations where frame is an integer, and the eg came from the end of a file and has a smaller than normal number of supervised frames.

Definition at line 222 of file nnet3-copy-egs.cc.

References ContainsSingleExample(), kaldi::ConvertStringToInteger(), FilterExample(), KALDI_ERR, KALDI_WARN, kaldi::RandInt(), and ShiftExampleTimes().

Referenced by main().

                                             {
   static bool warned_left = false, warned_right = false;
   int32 min_input_t, max_input_t,
       min_output_t, max_output_t;
   if (!ContainsSingleExample(eg, &min_input_t, &max_input_t,
                              &min_output_t, &max_output_t))
     KALDI_ERR << "Too late to perform frame selection/context reduction on "
               << "these examples (already merged?)";
   if (frame_str != "") {
     // select one frame.
     if (frame_str == "random") {
       min_output_t = max_output_t = RandInt(min_output_t,
                                                           max_output_t);
     } else {
       int32 frame;
       if (!ConvertStringToInteger(frame_str, &frame))
         KALDI_ERR << "Invalid option --frame='" << frame_str << "'";
       if (frame < min_output_t || frame > max_output_t) {
         // Frame is out of range.  Should happen only rarely.  Calling code
         // makes sure of this.
         return false;
       }
       min_output_t = max_output_t = frame;
     }
   }
   if (left_context != -1) {
     if (!warned_left && min_input_t > min_output_t - left_context) {
       warned_left = true;
       KALDI_WARN << "You requested --left-context=" << left_context
                  << ", but example only has left-context of "
                  <<  (min_output_t - min_input_t)
                  << " (will warn only once; this may be harmless if "
           "using any --*left-context-initial options)";
     }
     min_input_t = std::max(min_input_t, min_output_t - left_context);
   }
   if (right_context != -1) {
     if (!warned_right && max_input_t < max_output_t + right_context) {
       warned_right = true;
       KALDI_WARN << "You requested --right-context=" << right_context
                 << ", but example only has right-context of "
                 <<  (max_input_t - max_output_t)
                  << " (will warn only once; this may be harmless if "
             "using any --*right-context-final options.";
     }
     max_input_t = std::min(max_input_t, max_output_t + right_context);
   }
   FilterExample(eg,
                 min_input_t, max_input_t,
                 min_output_t, max_output_t,
                 eg_out);
   if (frame_shift != 0) {
     std::vector<std::string> exclude_names;  // we can later make this
     exclude_names.push_back(std::string("ivector")); // configurable.
     ShiftExampleTimes(frame_shift, exclude_names, eg_out);
   }
   return true;
 }

◆ SeparateSubmatsWithLargeCounts()

void kaldi::nnet3::SeparateSubmatsWithLargeCounts	(	const std::vector< int32 > &	submats_to_separate,
		const std::vector< std::vector< std::pair< int32, int32 > > > &	submat_lists,
		std::vector< std::vector< std::pair< int32, int32 > > > *	reduced_submat_lists,
		std::vector< std::vector< std::pair< int32, int32 > > > *	split_lists
	)

This function, used in SplitLocations(), is used to make separate 'split lists' for certain high-count submatrix indexes, specified by the user in 'submats_to_separate'.

These split lists will be lists of pairs that are all either (-1, 1) or (submatrix_index, x) for a particular submatrix index (constant within the split list). These high-count lists will be written to 'split_lists'; they will eventually compile to AddRows() commands. We write the remaining members of the lists in 'submat_lists' (the ones that did not make it into 'split_lists') to 'reduced_submat_lists'.

Definition at line 73 of file nnet-compile-utils.cc.

References rnnlm::i, and KALDI_ASSERT.

Referenced by SplitLocations().

                                                                 {
   KALDI_ASSERT(split_lists->empty() && !submats_to_separate.empty());
   size_t num_to_separate = submats_to_separate.size(),
       num_rows = submat_lists.size();
   std::unordered_map<int32, size_t> submat_to_index;
   reduced_submat_lists->clear();
   reduced_submat_lists->resize(num_rows);
   split_lists->resize(num_to_separate);
   for (size_t i = 0; i < num_to_separate; i++) {
     (*split_lists)[i].resize(num_rows, std::pair<int32, int32>(-1, -1));
     int32 submat = submats_to_separate[i];
     submat_to_index[submat] = i;
   }
   for (size_t row = 0; row < submat_lists.size(); row++) {
     std::vector<std::pair<int32, int32> >::const_iterator
         iter = submat_lists[row].begin(), end = submat_lists[row].end();
     std::vector<std::pair<int32, int32> >
         &reduced_list = (*reduced_submat_lists)[row];
     // 'reduced_lists' will contain the pairs that don't make it into
     // 'split_lists'.
     for (; iter != end; ++iter) {
       int32 submat_index = iter->first;
       std::unordered_map<int32, size_t>::const_iterator map_iter =
           submat_to_index.find(submat_index);
       if (map_iter == submat_to_index.end()) { // not a large-count submatrix.
         reduced_list.push_back(*iter);
         continue;
       }
       size_t index = map_iter->second;
       std::pair<int32,int32> &p = (*split_lists)[index][row];
       if (p.first >= 0) {
         // we'd only reach here if the same submat index repeated in the same
         // row, which is possible but rare.
         reduced_list.push_back(*iter);
         continue;
       }
       p.first = submat_index;
       int32 src_row_index = iter->second;
       p.second = src_row_index;
     }
   }
 }

◆ SetBatchnormTestMode()

void SetBatchnormTestMode	(	bool	test_mode,
		Nnet *	nnet
	)

This function affects only components of type BatchNormComponent.

It sets "test mode" on such components (if you call it with test_mode = true, otherwise it would set normal mode, but this wouldn't be needed often). "test mode" means that instead of using statistics from the batch, it does a deterministic normalization based on statistics stored at training time.

Definition at line 564 of file nnet-utils.cc.

References Nnet::GetComponent(), Nnet::NumComponents(), and BatchNormComponent::SetTestMode().

Referenced by ComputeObjf(), main(), TestNnetDecodable(), and UnitTestNnetCompute().

                                                        {
   for (int32 c = 0; c < nnet->NumComponents(); c++) {
     Component *comp = nnet->GetComponent(c);
     BatchNormComponent *bc = dynamic_cast<BatchNormComponent*>(comp);
     if (bc != NULL)
       bc->SetTestMode(test_mode);
   }
 }

◆ SetDerivTimesOptions()

void kaldi::nnet3::SetDerivTimesOptions	(	const ComputationRequest &	request,
		NnetOptimizeOptions *	opt_config
	)

Definition at line 46 of file nnet-derivative-test.cc.

References ComputeMinAndMaxTimes(), ComputationRequest::inputs, KALDI_ASSERT, KALDI_LOG, NnetOptimizeOptions::max_deriv_time, NnetOptimizeOptions::min_deriv_time, ComputationRequest::outputs, and kaldi::RandInt().

Referenced by UnitTestNnetModelDerivatives().

                                                            {
   int32 min_t, max_t;
   KALDI_ASSERT(request.inputs[0].name == "input");
   const std::vector<Index> &input_indexes = request.inputs[0].indexes;
   ComputeMinAndMaxTimes(input_indexes, &min_t, &max_t);
 
   int32 orig_min_t = min_t, orig_max_t = max_t;
   int t_length = max_t + 1 - min_t;
   KALDI_ASSERT(t_length > 0);
   if (t_length == 1)
     return;
   if (RandInt(0, 2) == 0) {
     // remove as much as 4 frames from the left (but don't remove everything).
     min_t += std::min(4, RandInt(0, t_length - 1));
     opt_config->min_deriv_time = min_t;
     t_length = max_t + 1 - min_t;
     KALDI_ASSERT(t_length > 0);
   }
   if (RandInt(0, 2) == 0) {
     max_t -= std::min(4, RandInt(0, t_length - 1));
     opt_config->max_deriv_time = max_t;
     t_length = max_t + 1 - min_t;
     KALDI_ASSERT(t_length > 0);
   }
   if (RandInt(0, 4) == 0) {
     // ensure that all derivs will be pruned away;
     // this tests more code.
     min_t = orig_min_t - 10;
     max_t = min_t + 1;
   }
 
   int32 output_min_t, output_max_t;
   KALDI_ASSERT(request.outputs[0].name == "output");
   ComputeMinAndMaxTimes(request.outputs[0].indexes,
                         &output_min_t, &output_max_t);
 
   KALDI_LOG << "ComputationRequest has output (min,max) = (" << output_min_t
             << ',' << output_max_t << "), input (min,max) = (" << orig_min_t
             << ',' << orig_max_t << "), limiting deriv times to ("
             << opt_config->min_deriv_time << ','
             << opt_config->max_deriv_time << ')';
 }

◆ SetDropoutProportion()

void SetDropoutProportion	(	BaseFloat	dropout_proportion,
		Nnet *	nnet
	)

This function sets the dropout proportion in all dropout components to dropout_proportion value.

Definition at line 509 of file nnet-utils.cc.

References Nnet::GetComponent(), Nnet::NumComponents(), DropoutComponent::SetDropoutProportion(), DropoutMaskComponent::SetDropoutProportion(), and GeneralDropoutComponent::SetDropoutProportion().

                                       {
   for (int32 c = 0; c < nnet->NumComponents(); c++) {
     Component *comp = nnet->GetComponent(c);
     DropoutComponent *dc = dynamic_cast<DropoutComponent*>(comp);
     if (dc != NULL)
       dc->SetDropoutProportion(dropout_proportion);
     DropoutMaskComponent *mc =
         dynamic_cast<DropoutMaskComponent*>(nnet->GetComponent(c));
     if (mc != NULL)
       mc->SetDropoutProportion(dropout_proportion);
     GeneralDropoutComponent *gdc =
         dynamic_cast<GeneralDropoutComponent*>(nnet->GetComponent(c));
     if (gdc != NULL)
       gdc->SetDropoutProportion(dropout_proportion);
   }
 }

◆ SetDropoutTestMode()

void SetDropoutTestMode	(	bool	test_mode,
		Nnet *	nnet
	)

This function affects components of child-classes of RandomComponent.

It sets "test mode" on such components (if you call it with test_mode = true, otherwise it would set normal mode, but this wouldn't be needed often). "test mode" means that having a mask containing (1-dropout_prob) in all elements.

Definition at line 573 of file nnet-utils.cc.

References Nnet::GetComponent(), Nnet::NumComponents(), and RandomComponent::SetTestMode().

Referenced by ComputeObjf(), main(), TestNnetDecodable(), and UnitTestNnetCompute().

                                                      {
   for (int32 c = 0; c < nnet->NumComponents(); c++) {
     Component *comp = nnet->GetComponent(c);
     RandomComponent *rc = dynamic_cast<RandomComponent*>(comp);
     if (rc != NULL)
       rc->SetTestMode(test_mode);
   }
 }

◆ SetLearningRate()

void SetLearningRate	(	BaseFloat	learning_rate,
		Nnet *	nnet
	)

Sets the underlying learning rate for all the components in the nnet to this value.

this will get multiplied by the individual learning-rate-factors to produce the actual learning rates.

Definition at line 276 of file nnet-utils.cc.

References Nnet::GetComponent(), KALDI_ERR, kUpdatableComponent, Nnet::NumComponents(), Component::Properties(), and UpdatableComponent::SetUnderlyingLearningRate().

Referenced by main().

                                  {
   for (int32 c = 0; c < nnet->NumComponents(); c++) {
     Component *comp = nnet->GetComponent(c);
     if (comp->Properties() & kUpdatableComponent) {
       // For now all updatable components inherit from class UpdatableComponent.
       // If that changes in future, we will change this code.
       UpdatableComponent *uc = dynamic_cast<UpdatableComponent*>(comp);
       if (uc == NULL)
         KALDI_ERR << "Updatable component does not inherit from class "
             "UpdatableComponent; change this code.";
       uc->SetUnderlyingLearningRate(learning_rate);
     }
   }
 }

◆ SetNnetAsGradient()

void SetNnetAsGradient ( Nnet * nnet )

Sets nnet as gradient by Setting is_gradient_ to true and learning_rate_ to 1 for each UpdatableComponent in nnet.

Definition at line 292 of file nnet-utils.cc.

References Nnet::GetComponent(), KALDI_ASSERT, kUpdatableComponent, Nnet::NumComponents(), Component::Properties(), and UpdatableComponent::SetAsGradient().

Referenced by NnetChainComputeProb::NnetChainComputeProb(), NnetComputeProb::NnetComputeProb(), NnetDiscriminativeComputeObjf::NnetDiscriminativeComputeObjf(), NnetDiscriminativeComputeObjf::Reset(), NnetChainComputeProb::Reset(), NnetComputeProb::Reset(), UnitTestNnetModelDerivatives(), and UnitTestNnetOptimizeWithOptions().

                                    {
   for (int32 c = 0; c < nnet->NumComponents(); c++) {
     Component *comp = nnet->GetComponent(c);
     if (comp->Properties() & kUpdatableComponent) {
       UpdatableComponent *u_comp = dynamic_cast<UpdatableComponent*>(comp);
       KALDI_ASSERT(u_comp != NULL);
       u_comp->SetAsGradient();
     }
   }
 }

◆ SetPriors()

void kaldi::nnet3::SetPriors	(	const TransitionModel &	tmodel,
		const Vector< double > &	transition_accs,
		double	prior_floor,
		AmNnetSimple *	am_nnet
	)

Definition at line 28 of file nnet3-am-train-transitions.cc.

References VectorBase< Real >::Dim(), KALDI_ASSERT, AmNnetSimple::NumPdfs(), TransitionModel::NumPdfs(), AmNnetSimple::SetPriors(), and TransitionModel::TransitionIdToPdf().

Referenced by main().

                                       {
   KALDI_ASSERT(tmodel.NumPdfs() == am_nnet->NumPdfs());
   Vector<BaseFloat> pdf_counts(tmodel.NumPdfs());
   KALDI_ASSERT(transition_accs(0) == 0.0); // There is
   // no zero transition-id.
   for (int32 tid = 1; tid < transition_accs.Dim(); tid++) {
     int32 pdf = tmodel.TransitionIdToPdf(tid);
     pdf_counts(pdf) += transition_accs(tid);
   }
   BaseFloat sum = pdf_counts.Sum();
   KALDI_ASSERT(sum != 0.0);
   KALDI_ASSERT(prior_floor > 0.0 && prior_floor < 1.0);
   pdf_counts.Scale(1.0 / sum);
   pdf_counts.ApplyFloor(prior_floor);
   pdf_counts.Scale(1.0 / pdf_counts.Sum()); // normalize again.
   am_nnet->SetPriors(pdf_counts);
 }               

◆ SetRequireDirectInput()

void SetRequireDirectInput	(	bool	b,
		Nnet *	nnet
	)

Calls the corresponding function in any component of type StatisticsPoolingComponent; used as a way to compute the 'real' left-right context of networks including SatisticsPoolingComponent, which will give you the minimum chunk size they can consume.

Definition at line 303 of file nnet-utils.cc.

References Nnet::GetComponent(), and Nnet::NumComponents().

Referenced by main().

                                                {
   for (int32 c = 0; c < nnet->NumComponents(); c++) {
     Component *comp = nnet->GetComponent(c);
     if (dynamic_cast<StatisticsPoolingComponent*>(comp) != NULL)
       dynamic_cast<StatisticsPoolingComponent*>(comp)->SetRequireDirectInput(b);
   }
 }

◆ ShiftChainExampleTimes()

void ShiftChainExampleTimes	(	int32	frame_shift,
		const std::vector< std::string > &	exclude_names,
		NnetChainExample *	eg
	)

Shifts the time-index t of everything in the input of "eg" by adding "t_offset" to all "t" values– but excluding those with names listed in "exclude_names", e.g.

"ivector". This might be useful if you are doing subsampling of frames at the output, because shifted examples won't be quite equivalent to their non-shifted counterparts. "exclude_names" is a vector of names of nnet inputs that we avoid shifting the "t" values of– normally it will contain just the single string "ivector" because we always leave t=0 for any ivector.

Note: input features will be shifted by 'frame_shift', and indexes in the supervision in (eg->output) will be shifted by 'frame_shift' rounded to the closest multiple of the frame subsampling factor (e.g. 3). The frame subsampling factor is worked out from the time spacing between the indexes in the output.

Definition at line 353 of file nnet-chain-example.cc.

References NnetChainExample::inputs, KALDI_ASSERT, and NnetChainExample::outputs.

Referenced by NnetChainExampleStructureCompare::operator()().

                                                   {
   std::vector<NnetIo>::iterator input_iter = eg->inputs.begin(),
       input_end = eg->inputs.end();
   for (; input_iter != input_end; ++input_iter) {
     bool must_exclude = false;
     std::vector<std::string>::const_iterator exclude_iter = exclude_names.begin(),
         exclude_end = exclude_names.end();
     for (; exclude_iter != exclude_end; ++exclude_iter)
       if (input_iter->name == *exclude_iter)
         must_exclude = true;
     if (!must_exclude) {
       std::vector<Index>::iterator indexes_iter = input_iter->indexes.begin(),
           indexes_end = input_iter->indexes.end();
       for (; indexes_iter != indexes_end; ++indexes_iter)
         indexes_iter->t += frame_shift;
     }
   }
   // note: we'll normally choose a small enough shift that the output-data
   // shift will be zero after dividing by frame_subsampling_factor
   // (e.g. frame_subsampling_factor == 3 and shift = 0 or 1.
   std::vector<NnetChainSupervision>::iterator
       sup_iter = eg->outputs.begin(),
       sup_end = eg->outputs.end();
   for (; sup_iter != sup_end; ++sup_iter) {
     std::vector<Index> &indexes = sup_iter->indexes;
     KALDI_ASSERT(indexes.size() >= 2 && indexes[0].n == indexes[1].n &&
                  indexes[0].x == indexes[1].x);
     int32 frame_subsampling_factor = indexes[1].t - indexes[0].t;
     KALDI_ASSERT(frame_subsampling_factor > 0);
 
     // We need to shift by a multiple of frame_subsampling_factor.
     // Round to the closest multiple.
     int32 supervision_frame_shift =
         frame_subsampling_factor *
         std::floor(0.5 + (frame_shift * 1.0 / frame_subsampling_factor));
     if (supervision_frame_shift == 0)
       continue;
     std::vector<Index>::iterator indexes_iter = indexes.begin(),
         indexes_end = indexes.end();
     for (; indexes_iter != indexes_end; ++indexes_iter)
       indexes_iter->t += supervision_frame_shift;
   }
 }

◆ ShiftDiscriminativeExampleTimes()

void ShiftDiscriminativeExampleTimes	(	int32	frame_shift,
		const std::vector< std::string > &	exclude_names,
		NnetDiscriminativeExample *	eg
	)

Shifts the time-index t of everything in the input of "eg" by adding "t_offset" to all "t" values– but excluding those with names listed in "exclude_names", e.g.

"ivector". This might be useful if you are doing subsampling of frames at the output, because shifted examples won't be quite equivalent to their non-shifted counterparts. "exclude_names" is a vector of names of nnet inputs that we avoid shifting the "t" values of– normally it will contain just the single string "ivector" because we always leave t=0 for any ivector.

Note: input features will be shifted by 'frame_shift', and indexes in the supervision in (eg->output) will be shifted by 'frame_shift' rounded to the closest multiple of the frame subsampling factor (e.g. 3). The frame subsampling factor is worked out from the time spacing between the indexes in the output.

Definition at line 347 of file nnet-discriminative-example.cc.

References NnetDiscriminativeExample::inputs, KALDI_ASSERT, and NnetDiscriminativeExample::outputs.

Referenced by main(), and NnetDiscriminativeExampleStructureCompare::operator()().

                                                            {
   std::vector<NnetIo>::iterator input_iter = eg->inputs.begin(),
       input_end = eg->inputs.end();
   for (; input_iter != input_end; ++input_iter) {
     bool must_exclude = false;
     std::vector<string>::const_iterator exclude_iter = exclude_names.begin(),
         exclude_end = exclude_names.end();
     for (; exclude_iter != exclude_end; ++exclude_iter)
       if (input_iter->name == *exclude_iter)
         must_exclude = true;
     if (!must_exclude) {
       std::vector<Index>::iterator indexes_iter = input_iter->indexes.begin(),
           indexes_end = input_iter->indexes.end();
       for (; indexes_iter != indexes_end; ++indexes_iter)
         indexes_iter->t += frame_shift;
     }
   }
   // note: we'll normally choose a small enough shift that the output-data
   // shift will be zero after dividing by frame_subsampling_factor
   // (e.g. frame_subsampling_factor == 3 and shift = 0 or 1.
   std::vector<NnetDiscriminativeSupervision>::iterator
       sup_iter = eg->outputs.begin(),
       sup_end = eg->outputs.end();
   for (; sup_iter != sup_end; ++sup_iter) {
     std::vector<Index> &indexes = sup_iter->indexes;
     KALDI_ASSERT(indexes.size() >= 2 && indexes[0].n == indexes[1].n &&
                  indexes[0].x == indexes[1].x);
     int32 frame_subsampling_factor = indexes[1].t - indexes[0].t;
     KALDI_ASSERT(frame_subsampling_factor > 0);
 
     // We need to shift by a multiple of frame_subsampling_factor.
     // Round to the closest multiple.
     int32 supervision_frame_shift =
         frame_subsampling_factor *
         std::floor(0.5 + (frame_shift * 1.0 / frame_subsampling_factor));
     if (supervision_frame_shift == 0)
       continue;
     std::vector<Index>::iterator indexes_iter = indexes.begin(),
         indexes_end = indexes.end();
     for (; indexes_iter != indexes_end; ++indexes_iter)
       indexes_iter->t += supervision_frame_shift;
   }
 }

◆ ShiftExampleTimes()

void ShiftExampleTimes	(	int32	t_offset,
		const std::vector< std::string > &	exclude_names,
		NnetExample *	eg
	)

Shifts the time-index t of everything in the "eg" by adding "t_offset" to all "t" values.

This might be useful in things like clockwork RNNs that are not invariant to time-shifts, to ensure that we see different shifts of each example during training. "exclude_names" is a vector (not necessarily sorted) of names of nnet inputs that we avoid shifting the "t" values of– normally it will contain just the single string "ivector" because we always leave t=0 for any ivector.

Definition at line 174 of file nnet-example-utils.cc.

References NnetExample::io.

Referenced by SelectFromExample().

                                         {
   if (t_offset == 0)
     return;
   std::vector<NnetIo>::iterator iter = eg->io.begin(),
       end = eg->io.end();
   for (; iter != end; iter++) {
     bool name_is_excluded = false;
     std::vector<std::string>::const_iterator
         exclude_iter = exclude_names.begin(),
         exclude_end = exclude_names.end();
     for (; exclude_iter != exclude_end; ++exclude_iter) {
       if (iter->name == *exclude_iter) {
         name_is_excluded = true;
         break;
       }
     }
     if (!name_is_excluded) {
       // name is not something like "ivector" that we exclude from shifting.
       std::vector<Index>::iterator index_iter = iter->indexes.begin(),
           index_end = iter->indexes.end();
       for (; index_iter != index_end; ++index_iter)
         index_iter->t += t_offset;
     }
   }
 }

◆ SnipMultiRowOp()

static bool kaldi::nnet3::SnipMultiRowOp	(	NnetComputation *	computation,
		int32	command_index
	)

static

Definition at line 2434 of file nnet-optimize-utils.cc.

References NnetComputation::Command::arg1, NnetComputation::Command::arg2, NnetComputation::commands, FindNumLeadingAndTrailingNegatives(), NnetComputation::indexes_multi, KALDI_ASSERT, and NnetComputation::NewSubMatrix().

Referenced by SnipRowOps().

                                                 {
   NnetComputation::Command &c = computation->commands[command_index];
   KALDI_ASSERT(static_cast<size_t>(c.arg2) < computation->indexes_multi.size());
   const std::vector<std::pair<int32, int32> > &indexes_multi =
       computation->indexes_multi[c.arg2];
   int32 num_leading_negatives, num_trailing_negatives;
   FindNumLeadingAndTrailingNegatives(indexes_multi,
                                     &num_leading_negatives,
                                     &num_trailing_negatives);
   if (num_leading_negatives == 0 && num_trailing_negatives == 0)
     return false;
 
   int32 new_num_rows = static_cast<int32>(indexes_multi.size()) -
       num_leading_negatives - num_trailing_negatives;
   KALDI_ASSERT(new_num_rows > 0);
   std::vector<std::pair<int32, int32> > new_indexes_multi(
       indexes_multi.begin() + num_leading_negatives,
       indexes_multi.begin() + num_leading_negatives + new_num_rows);
   c.arg2 = computation->indexes_multi.size();
   computation->indexes_multi.push_back(std::vector<std::pair<int32, int32> >());
   computation->indexes_multi.back().swap(new_indexes_multi);
   c.arg1 = computation->NewSubMatrix(c.arg1,
                                      num_leading_negatives, new_num_rows,
                                      0, -1);
   return true;  // made a change.
 }

◆ SnipRangesRowOp()

static bool kaldi::nnet3::SnipRangesRowOp	(	NnetComputation *	computation,
		int32	command_index
	)

static

Definition at line 2507 of file nnet-optimize-utils.cc.

References NnetComputation::Command::arg1, NnetComputation::Command::arg3, NnetComputation::commands, FindNumLeadingAndTrailingIdenticals(), NnetComputation::indexes_ranges, KALDI_ASSERT, and NnetComputation::NewSubMatrix().

Referenced by SnipRowOps().

                                                  {
   NnetComputation::Command &c = computation->commands[command_index];
   KALDI_ASSERT(static_cast<size_t>(c.arg3) < computation->indexes_ranges.size());
   const std::vector<std::pair<int32, int32> > &indexes_ranges =
       computation->indexes_ranges[c.arg3];
   int32 num_leading_identicals, num_trailing_identicals;
   FindNumLeadingAndTrailingIdenticals(indexes_ranges,
                                     &num_leading_identicals,
                                     &num_trailing_identicals);
   if (num_leading_identicals == 0 && num_trailing_identicals == 0)
     return false;
 
   int32 new_num_rows = static_cast<int32>(indexes_ranges.size()) -
       num_leading_identicals - num_trailing_identicals;
   KALDI_ASSERT(new_num_rows > 0);
   std::vector<std::pair<int32, int32> > new_indexes_ranges(
       indexes_ranges.begin() + num_leading_identicals,
       indexes_ranges.begin() + num_leading_identicals + new_num_rows);
   c.arg3 = computation->indexes_ranges.size();
   computation->indexes_ranges.push_back(std::vector<std::pair<int32, int32> >());
   computation->indexes_ranges.back().swap(new_indexes_ranges);
   c.arg1 = computation->NewSubMatrix(c.arg1,
                                      num_leading_identicals, new_num_rows,
                                      0, -1);
   return true;  // made a change.
 }

◆ SnipRowOps()

bool SnipRowOps ( NnetComputation * computation )

This function detects cases where commands of type kCopyRows, kAddRows, kAddRowsMulti, kAddToRowsMulti, kCopyRowsMulti, kCopyToRowsMulti or kAddRowRanges use indexes that start or end with -1's or equivalents, and replace them with similar commands that act on a sub-matrix of the matrices they are currently acting on.

This will help efficiency by avoiding launching unnecessary copies of the kernel (that don't really have to do anything).

This function returns true if it made any changes to the computation; if it returns true, then after doing this you should at some point do RenumberComputation(), which will remove any now-unused members of computation->indexes.

Definition at line 2537 of file nnet-optimize-utils.cc.

References NnetComputation::Command::command_type, NnetComputation::commands, kAddRowRanges, kAddRows, kAddRowsMulti, kAddToRowsMulti, kCopyToRowsMulti, SnipMultiRowOp(), SnipRangesRowOp(), and SnipSingleRowOp().

Referenced by Optimize().

                                               {
   bool ans = false;
   int32 num_commands = computation->commands.size();
   for (int32 command_index = 0; command_index < num_commands;
        command_index++) {
     // non-const because we'll be changing it.
     NnetComputation::Command &c = computation->commands[command_index];
 
     // note: we can't do the snipping for commands of type case kCopyRows and case
     // kCopyRowsMulti, because the -1's aren't a pure no-op; they have the
     // meaning of setting the destination value to zero, so we can't prune
     // them away.
 
     switch (c.command_type) {
       case kAddRows: {
         if (SnipSingleRowOp(computation, command_index))
           ans = true;
         break;
       }
       case kAddRowsMulti: case kAddToRowsMulti:
       case kCopyToRowsMulti: {
         if (SnipMultiRowOp(computation, command_index))
           ans = true;
         break;
       }
       case kAddRowRanges: {
         if (SnipRangesRowOp(computation, command_index))
           ans = true;
         break;
       }
       default:
         break;
     }
   }
   return ans;
 }

◆ SnipSingleRowOp()

static bool kaldi::nnet3::SnipSingleRowOp	(	NnetComputation *	computation,
		int32	command_index
	)

static

Definition at line 2364 of file nnet-optimize-utils.cc.

References NnetComputation::Command::arg1, NnetComputation::Command::arg3, NnetComputation::commands, FindNumLeadingAndTrailingNegatives(), NnetComputation::indexes, KALDI_ASSERT, and NnetComputation::NewSubMatrix().

Referenced by SnipRowOps().

                                                  {
   NnetComputation::Command &c = computation->commands[command_index];
   KALDI_ASSERT(static_cast<size_t>(c.arg3) < computation->indexes.size());
   const std::vector<int32> &indexes = computation->indexes[c.arg3];
   int32 num_leading_negatives, num_trailing_negatives;
   FindNumLeadingAndTrailingNegatives(indexes,
                                     &num_leading_negatives,
                                     &num_trailing_negatives);
   if (num_leading_negatives == 0 && num_trailing_negatives == 0)
     return false;
 
   int32 new_num_rows = static_cast<int32>(indexes.size()) -
       num_leading_negatives - num_trailing_negatives;
   KALDI_ASSERT(new_num_rows > 0);
   std::vector<int32> new_indexes(indexes.begin() + num_leading_negatives,
                                  indexes.begin() + num_leading_negatives +
                                  new_num_rows);
   c.arg3 = computation->indexes.size();
   computation->indexes.push_back(std::vector<int32>());
   computation->indexes.back().swap(new_indexes);
   c.arg1 = computation->NewSubMatrix(c.arg1,
                                      num_leading_negatives, new_num_rows,
                                      0, -1);
   return true;  // made a change.
 }

◆ SplitComputationIntoSegments()

static void kaldi::nnet3::SplitComputationIntoSegments	(	const NnetComputation &	computation,
		std::vector< std::pair< int32, int32 > > *	segments
	)

static

Split the computation up into segments bounded by kNoOperationMarker.

For each segment, a pair of command-indexes (start, end) is output to the vector 'segments', so the commands in the segment (not including kNoOperationMarker) are numbered from start ... end - 1.

Definition at line 852 of file nnet-optimize.cc.

References NnetComputation::commands, and kNoOperationMarker.

Referenced by ConsolidateIoOperations().

                                                  {
 
   int32 num_commands = computation.commands.size();
   segments->clear();
   int32 cur_start = 0;
   for (int32 c = 0; c < num_commands; c++) {
     if (computation.commands[c].command_type == kNoOperationMarker) {
       segments->push_back(std::pair<int32, int32>(cur_start, c));
       cur_start = c + 1;
     }
   }
   segments->push_back(std::pair<int32, int32>(cur_start, num_commands));
 }

◆ SplitLocations()

void SplitLocations	(	const std::vector< std::vector< std::pair< int32, int32 > > > &	submat_lists,
		std::vector< std::vector< std::pair< int32, int32 > > > *	split_lists
	)

The input to this function is a vector (indexed by matrix-row-index) of lists of pairs (submat_index, row_index), and this function splits it up into a list of vectors of pairs, where those vectors are indexed by matrix-row-index.

In order to make the lists all the same length it may have to insert "dummy" pairs with value (-1, -1). In addition, this function implement certain heuristics to break up the list into pairs in a particular desirable way, which we will describe below.

Let the input be `submat_lists`, and let `num_rows = submat_lists.size()`. The value -1 is not expected to appear as either the .first or .second element of pairs in `submat_lists`.

Heuristics aside, what this function guarantees is as follows. Each pair p that is in an element of list `submat_lists[i]` (say `p = submat_lists[i][k]`), will be present as `(*split_lists)[j][i] == p`. Because we don't ban submat_lists[i] from containing duplicates, the technical definition is a little more complicated: that the count of any given pair p != (-1, -1) in `submat_lists[i][*]` is equal to the count of that same pair in `(*split_lists)[*][i]`.

Each pair present in split_lists is either (-1, -1), or will correspond to an element of submat_lists; thus the total number of pairs, excluding (-1, -1), in split_lists will be the same as the total number of pairs in submat_lists.

Note on expected input: submat_lists.dim() may be large e.g. 1024 (it usually represents a minibatch size), but the maximum size of the lists will usually be fairly small e.g. no more than 4 or so, as it represents the number of terms in a hand-coded summation expression.

The use of this function is in interpreting a command to set each row of a matrix to a sum of terms. Each pair represents an input term, interpreted as (index-of-matrix, row-index), which represents a vector that will form part of the sum.

It would be possible to simply pad at the end with (-1, -1), but this function also makes an attempt to pad more carefully so that for the most part, each output vector of pairs has inputs from only one matrix, i.e. the pair.first values are all the same. This will allow us to use a potentially more efficient command in the compiled code. It doesn't have to be 100% optimal. Note: in the most common case, all the lists will have the same length and padding will not be necessary at all.

See documentation here: Forward computation for Descriptors (SplitLocations)

Definition at line 120 of file nnet-compile-utils.cc.

References GetSubmatCounts(), rnnlm::i, and SeparateSubmatsWithLargeCounts().

Referenced by Compiler::CompileForwardFromSubmatLocationsList(), SplitLocationsBackward(), and UnitTestSplitLocations().

                                                                 {
   size_t num_rows = submat_lists.size(),
       num_output_lists = 0;
   auto iter = submat_lists.begin(), end = submat_lists.end();
   for (; iter != end; ++iter)
     if (iter->size() > num_output_lists)
       num_output_lists = iter->size();
   split_lists->clear();
   if (num_output_lists == 0)  // Odd, but could happen, maybe
     return;
   else if (num_output_lists == 1) {
     split_lists->resize(1);
     std::vector<std::pair<int32, int32> > &list = (*split_lists)[0];
     list.resize(num_rows, std::pair<int32, int32>(-1, -1));
     for (size_t i = 0; i < num_rows; i++) {
       if (!submat_lists[i].empty())
         list[i] = submat_lists[i][0];
     }
     return;
   }
 
   // counts for each submatrix index, of how many times it occurs.
   std::unordered_map<int32,int32> submat_counts;
   std::vector<int32> submats_with_large_counts;
   GetSubmatCounts(submat_lists, &submat_counts, &submats_with_large_counts);
   if (!submats_with_large_counts.empty()) {
     // There are submatrices with counts over half the num-rows.  We assign these
     // their own output lists.
 
     std::vector<std::vector<std::pair<int32, int32> > > reduced_submat_lists;
     SeparateSubmatsWithLargeCounts(submats_with_large_counts,
                                    submat_lists,
                                    &reduced_submat_lists,
                                    split_lists);
     // 'reduced_split_lists' is the result of recursing with input 'reduced_submat_lists';
     // we'll append its result to 'split_lists'.
     std::vector<std::vector<std::pair<int32, int32> > > reduced_split_lists;
     SplitLocations(reduced_submat_lists, &reduced_split_lists);
     size_t cur_num_lists = split_lists->size(),
         num_extra_lists = reduced_split_lists.size(),
         new_num_lists = cur_num_lists + num_extra_lists;
     split_lists->resize(new_num_lists);
     for (size_t i = 0; i < num_extra_lists; i++)
       (*split_lists)[cur_num_lists + i].swap(reduced_split_lists[i]);
     return;
     // and we're done.
   } else {
     // All the counts of submatrix indexes seem to be small so we are resigned to
     // only using AddRowsMulti commands.
     split_lists->resize(num_output_lists);
     for (size_t i = 0; i < num_output_lists; i++)
       (*split_lists)[i].resize(num_rows, std::pair<int32, int32>(-1, -1));
     for (size_t row = 0; row < num_rows; row++) {
       const std::vector<std::pair<int32, int32> > &this_list =
           submat_lists[row];
       size_t this_list_size = submat_lists[row].size();
       for (size_t i = 0; i < this_list_size; i++) {
         (*split_lists)[i][row] = this_list[i];
       }
     }
   }
 }

◆ SplitLocationsBackward()

void SplitLocationsBackward	(	const std::vector< std::vector< std::pair< int32, int32 > > > &	submat_lists,
		std::vector< std::vector< std::pair< int32, int32 > > > *	split_lists
	)

This function has the same interface as SplitLocations(); however, it ensures certain additional properties of the output "split_lists", which are necessary because of the way it is used in backprop code.

For each sub-list sublist = (*split_lists)[i], the properties it ensures are: Either:

all pairs in the list "sublist" are unique (except that the special pair (-1, -1) may be repeated), Or:
the .first values in the list "sublist" are all the same, and the .second have a special property [see the function HasContiguousProperty in nnet-compile.cc]- basically that if we list the .second elements, each unique number that appears there appears only in one contiguous range, e.g. the list [ 6 6 6 1 5 5 5 ] has this property, but [ 1 2 1 ] does not. (however, -1's are not subject to this limitation, so [ -1 4 -1 ] satisfies the property). This function ensures this property by first calling SplitLocations, and then doing further splitting as necessary to ensure the property. However, if as a result it needs to split any given initially-split list into more than 2 sub-lists, it will print a warning (once per process). If we have to split into too many lists it will generate inefficient computations, and we will need to extend the backprop code to support more general types of operation. If all elements of submat_lists are empty, the output split_lists will be the empty vector.

Definition at line 311 of file nnet-compile-utils.cc.

References ConvertToIndexes(), EnsureContiguousProperty(), rnnlm::i, rnnlm::j, SplitLocations(), and SplitPairList().

Referenced by Compiler::CompileBackwardFromSubmatLocationsList(), and UnitTestSplitLocationsBackward().

                                                                 {
   std::vector<std::vector<std::pair<int32, int32> > > split_lists_intermediate;
   // Split the submat_lists
   SplitLocations(submat_lists, &split_lists_intermediate);
   for (size_t i = 0; i < split_lists_intermediate.size(); i++) {
     int32 first_value;
     std::vector<int32> second_values;
     if (ConvertToIndexes(split_lists_intermediate[i],
                          &first_value, &second_values)) {
       // the .first values in split_lists_intermediate[i] are all the same (or
       // equal to -1).
       if (first_value == -1) {
         // all the .first values were equal to -1.  this is like a NULL marker.
         continue;
       }
       std::vector<std::vector<int32> > second_values_split;
       EnsureContiguousProperty(second_values, &second_values_split);
       if (second_values_split.size() == 1) {
         // this branch is an optimization for speed.
         split_lists->push_back(split_lists_intermediate[i]);
       } else {
         for (size_t j = 0; j < second_values_split.size(); j++) {
           split_lists->resize(split_lists->size() + 1);
           const std::vector<int32> &input_list = second_values_split[j];
           std::vector<std::pair<int32, int32> > &output_list =
               split_lists->back();
           output_list.resize(input_list.size());
           int32 size = input_list.size();
           for (int32 k = 0; k < size; k++) {
             int32 row = input_list[k];
             if (row == -1) output_list[k].first = -1;
             else output_list[k].first = first_value;
             output_list[k].second = row;
           }
         }
       }
     } else {
       // the .first values are not the same
       // splitting the list of pairs to ensure unique pairs, unless it is
       // (-1,-1)
       std::vector<std::vector<std::pair<int32, int32> > > new_split_lists;
       SplitPairList(split_lists_intermediate[i],
                     &new_split_lists);
       for (int32 j = 0; j < new_split_lists.size(); j++)  {
         split_lists->push_back(new_split_lists[j]);
       }
     }
   }
 }

◆ SplitPairList()

void kaldi::nnet3::SplitPairList	(	std::vector< std::pair< int32, int32 > > &	list,
		std::vector< std::vector< std::pair< int32, int32 > > > *	split_lists
	)

This function splits a vector of pairs into a list of vectors of pairs.

[note: by 'vector' we mean something that has a meaningful index that we care about; by 'list' we mean a collection of elements to be iterated over, without (in this case) meaningful indexes or even order.

Parameters

[in]	list	A vector of pairs; these pairs should be either (-1,-1) or (a,b) for a >= 0, b >= 0. At least one element of 'list' must be different from (-1,-1).
[out]	split_lists	A list, in arbitrary order, of vectors of pairs. It has the following relationship with 'list': Size: for each j, split_lists[j].size() == list.size(). Contents must match input: For each i: If list[i] == (-1, -1), then split_lists[j][i] == (-1, -1) for all j. If list[i] != (-1, -1), then split_lists[j][i] == (-1, -1) for all but one j, and for the remaining j, split_lists[j][i] == list[i]. Uniqueness: for no j should split_lists[j] contain any duplicate elements (except the pair (-1,-1), which is allowed to exist in duplicate form). To satisfy the above conditions, this function will create as many lists in split_lists (i.e. as many j values) as the number of times that the most frequent pair in 'list' repeats other than the pair (-1,-1), e.g. if the pair (10,11) appears 4 times in 'list' and that is the most, split_lists->size() == 4.

Definition at line 280 of file nnet-compile-utils.cc.

References rnnlm::i, KALDI_ASSERT, and KALDI_ERR.

Referenced by SplitLocationsBackward().

                                                                                {
   split_lists->clear();
   typedef unordered_map<std::pair<int32, int32>,
                         int32, PairHasher<int32> > MapType;
   // this maps a pair not equal to -1,-1, to the number of times we've already seen it.
   MapType pair_to_count;
   int32 cur_num_lists = 0;
 
   for (int32 i = 0; i < list.size(); i++)  {
     if (list[i].first == -1)
       continue;
     MapType::iterator iter = pair_to_count.find(list[i]);
     int32 this_count;
     if (iter == pair_to_count.end())
       pair_to_count[list[i]] = this_count = 1;
     else
       this_count = (++iter->second);
     if (this_count > cur_num_lists) {
       KALDI_ASSERT(this_count == cur_num_lists + 1);
       split_lists->resize(this_count);
       split_lists->back().resize(list.size(),
                                  std::pair<int32, int32>(-1, -1));
       cur_num_lists++;
     }
     (*split_lists)[this_count-1][i] = list[i];
   }
   if (split_lists->size() == 0)
     KALDI_ERR << "Input list has just dummy pairs";
 }

◆ SplitRowOps()

bool SplitRowOps ( NnetComputation * computation )

This function detects cases where commands of type kAddRowsMulti, kAddToRowsMulti, kCopyRowsMulti, kCopyToRowsMulti use indexes that correspond to at most two submatrices, in two distinct ranges without gaps filled by -1's, and could be converted to at most two commands of type kMatrixAdd, kMatrixCopy, kAddRows or kCopyRows.

(Note: it's important that this optimization takes place after SnipRowOps, because it doesn't remove the -1's from the edges of the indexes, it relies on that operation doing so). The "without-gaps" stipulation is just for convenience of implementation, to have fewer cases to worry about.

This function returns true if it made any changes to the computation; if it returns true, then after calling this you should at some point do RenumberComputation(), which will remove any now-unused members of computation->indexes.

Definition at line 2894 of file nnet-optimize-utils.cc.

References RowOpsSplitter::Split().

Referenced by Optimize().

                                                {
   RowOpsSplitter splitter(computation);
   return splitter.Split();
 }

◆ SummarizeVector() [1/3]

std::string SummarizeVector ( const VectorBase< float > & vec )

Returns a string that summarizes a vector fairly succintly, for printing stats in info lines.

For example: "[percentiles(0,1,2,5 10,20,50,80,90 95,98,99,100)=(0.001,0.003,0.003,0.004 \ 0.005,0.01,0.07,0.11,0.14 0.18,0.24,0.29,0.39), mean=0.0745, stddev=0.0611]"

Definition at line 111 of file nnet-parse.cc.

References VectorBase< Real >::Data(), VectorBase< Real >::Dim(), rnnlm::i, KALDI_ASSERT, rnnlm::n, PrintFloatSuccinctly(), kaldi::SplitStringToIntegers(), VectorBase< Real >::Sum(), and kaldi::VecVec().

Referenced by LstmNonlinearityComponent::ConsolidateMemory(), BatchNormComponent::Info(), LstmNonlinearityComponent::Info(), NonlinearComponent::Info(), PrintParameterStats(), SummarizeVector(), and UnitTestSummarizeVector().

                                                         {
   std::ostringstream os;
   if (vec.Dim() < 10) {
     os << "[ ";
     for (int32 i = 0; i < vec.Dim(); i++) {
       PrintFloatSuccinctly(os, vec(i));
       os << ' ';
     }
     os << "]";
   } else {
     // print out mean and standard deviation, and some selected values.
     BaseFloat mean = vec.Sum() / vec.Dim(),
         stddev = sqrt(VecVec(vec, vec) / vec.Dim() - mean * mean);
 
     std::string percentiles_str = "0,1,2,5 10,20,50,80,90 95,98,99,100";
     std::vector<int32> percentiles;
     bool ans = SplitStringToIntegers(percentiles_str, ", ", false,
                                      &percentiles);
     KALDI_ASSERT(ans);
     os << "[percentiles(" << percentiles_str << ")=(";
     Vector<BaseFloat> vec_sorted(vec);
     std::sort(vec_sorted.Data(), vec_sorted.Data() + vec_sorted.Dim());
     int32 n = vec.Dim() - 1;
     for (size_t i = 0; i < percentiles.size(); i++) {
       int32 percentile = percentiles[i];
       BaseFloat value = vec_sorted((n * percentile) / 100);
       PrintFloatSuccinctly(os, value);
       if (i + 1 < percentiles.size())
         os << (i == 3 || i == 8 ? ' ' : ',');
     }
     os << std::setprecision(3);
     os << "), mean=" << mean << ", stddev=" << stddev << "]";
   }
   return os.str();
 }

◆ SummarizeVector() [2/3]

std::string SummarizeVector ( const VectorBase< double > & vec )

Definition at line 147 of file nnet-parse.cc.

References SummarizeVector().

                                                          {
   Vector<float> vec_copy(vec);
   return SummarizeVector(vec_copy);
 }

◆ SummarizeVector() [3/3]

std::string SummarizeVector ( const CuVectorBase< BaseFloat > & cu_vec )

Definition at line 152 of file nnet-parse.cc.

References SummarizeVector().

                                                                  {
   Vector<float> vec(cu_vec);
   return SummarizeVector(vec);
 }

◆ SumVectorSizes() [1/2]

static int32 kaldi::nnet3::SumVectorSizes ( const std::vector< std::vector< int32 > > & vec )

static

Definition at line 1238 of file nnet-computation-graph.cc.

Referenced by ComputeComputationPhases(), and SumVectorSizes().

                                                                      {
   int32 ans = 0;
   std::vector<std::vector<int32> >::const_iterator iter = vec.begin(),
       end = vec.end();
   for (; iter != end; ++iter)
     ans += iter->size();
   return ans;
 }

◆ SumVectorSizes() [2/2]

static int32 kaldi::nnet3::SumVectorSizes ( const std::vector< std::vector< std::vector< int32 > > > & vec )

static

Definition at line 1247 of file nnet-computation-graph.cc.

References rnnlm::i, and SumVectorSizes().

                                                                                  {
   int32 ans = 0;
   for (size_t i = 0; i < vec.size(); i++)
     ans += SumVectorSizes(vec[i]);
   return ans;
 }

◆ TarjanSccRecursive()

void kaldi::nnet3::TarjanSccRecursive	(	int32	node,
		const std::vector< std::vector< int32 > > &	graph,
		int32 *	global_index,
		std::vector< TarjanNode > *	tarjan_nodes,
		std::vector< int32 > *	tarjan_stack,
		std::vector< std::vector< int32 > > *	sccs
	)

Definition at line 85 of file nnet-graph.cc.

References rnnlm::i, TarjanNode::index, KALDI_ASSERT, TarjanNode::lowlink, and TarjanNode::on_stack.

Referenced by FindSccsTarjan().

                                                             {
   KALDI_ASSERT(sccs != NULL);
   KALDI_ASSERT(tarjan_nodes != NULL);
   KALDI_ASSERT(tarjan_stack != NULL);
   KALDI_ASSERT(global_index != NULL);
   KALDI_ASSERT(node >= 0 && node < graph.size());
 
   // Initializes the current Tarjan node.
   (*tarjan_nodes)[node].index = *global_index;
   (*tarjan_nodes)[node].lowlink = *global_index;
   *global_index += 1;
   (*tarjan_nodes)[node].on_stack = true;
   tarjan_stack->push_back(node);
 
   // DFS from the current node.
   for (int32 i = 0; i < graph[node].size(); ++i) {
     int32 next = graph[node][i];
 
     if ((*tarjan_nodes)[next].index == -1) {
       // First time we see this node.
       TarjanSccRecursive(next, graph,
                          global_index, tarjan_nodes, tarjan_stack, sccs);
       (*tarjan_nodes)[node].lowlink = std::min((*tarjan_nodes)[node].lowlink,
                                                (*tarjan_nodes)[next].lowlink);
     } else if ((*tarjan_nodes)[next].on_stack) {
       // Next node is on the stack -- back edge. We can't use the lowlink of
       // next node, because that may point to the index of the root, while the
       // current node can't be the root.
       (*tarjan_nodes)[node].lowlink = std::min((*tarjan_nodes)[node].lowlink,
                                                (*tarjan_nodes)[next].index);
     }
   }
 
   // Output SCC.
   if ((*tarjan_nodes)[node].index == (*tarjan_nodes)[node].lowlink) {
     std::vector<int32> scc;
     int32 pop_node;
     do {
       pop_node = tarjan_stack->back();
       tarjan_stack->pop_back();
       (*tarjan_nodes)[pop_node].on_stack = false;
       scc.push_back(pop_node);
     } while (pop_node != node);
     KALDI_ASSERT(pop_node == node);
     sccs->push_back(scc);
   }
 }

◆ TestNnetComponentAddScale()

void kaldi::nnet3::TestNnetComponentAddScale ( Component * c )

Definition at line 76 of file nnet-component-test.cc.

References Component::Add(), CheckStringsApproxEqual(), Component::Copy(), Component::Info(), KALDI_ASSERT, and Component::Scale().

Referenced by UnitTestNnetComponent().

                                              {
   Component *c2 = c->Copy();
   Component *c3 = c2->Copy();
   c3->Add(0.5, *c2);
   c2->Scale(1.5);
   KALDI_ASSERT(CheckStringsApproxEqual(c2->Info(), c3->Info()));
   delete c2;
   delete c3;
 }

◆ TestNnetComponentCopy()

void kaldi::nnet3::TestNnetComponentCopy ( Component * c )

Definition at line 67 of file nnet-component-test.cc.

References Component::Copy(), Component::Info(), KALDI_ERR, and kaldi::StringsApproxEqual().

Referenced by UnitTestNnetComponent().

                                          {
   Component *c2 = c->Copy();
   if (!StringsApproxEqual(c->Info(), c2->Info())) {
     KALDI_ERR << "Expected info strings to be equal: '"
               << c->Info() << "' vs. '" << c2->Info() << "'";
   }
   delete c2;
 }

◆ TestNnetComponentIo()

void kaldi::nnet3::TestNnetComponentIo ( Component * c )

Definition at line 52 of file nnet-component-test.cc.

References CheckStringsApproxEqual(), KALDI_ASSERT, kaldi::Rand(), Component::ReadNew(), and Component::Write().

Referenced by UnitTestNnetComponent().

                                        {
   bool binary = (Rand() % 2 == 0);
   std::ostringstream os1;
   c->Write(os1, binary);
   std::istringstream is(os1.str());
   Component *c2 = Component::ReadNew(is, binary);
   std::ostringstream os2;
   c2->Write(os2, binary);
   if (!binary) {
     std::string s1 = os1.str(), s2 = os2.str();
     KALDI_ASSERT(CheckStringsApproxEqual(s1, s2));
   }
   delete c2;
 }

◆ TestNnetComponentUpdatable()

void kaldi::nnet3::TestNnetComponentUpdatable ( Component * c )

Definition at line 109 of file nnet-component-test.cc.

References CheckStringsApproxEqual(), Component::Copy(), UpdatableComponent::DotProduct(), UpdatableComponent::Info(), KALDI_ASSERT, KALDI_ERR, kUpdatableComponent, UpdatableComponent::NumParameters(), Component::Properties(), VectorBase< Real >::Scale(), Component::Scale(), kaldi::StringsApproxEqual(), and kaldi::VecVec().

Referenced by UnitTestNnetComponent().

                                               {
   if (!(c->Properties() & kUpdatableComponent))
     return;
   UpdatableComponent *uc = dynamic_cast<UpdatableComponent*>(c);
   if (uc == NULL) {
     KALDI_ASSERT(!(c->Properties() & kUpdatableComponent) &&
                  "Component returns updatable flag but does not inherit "
                  "from UpdatableComponent");
     return;
   }
   if(!(uc->Properties() & kUpdatableComponent)){
     // testing that if it declares itself as non-updatable,
     // Scale() and Add() have no effect.
     KALDI_ASSERT(uc->NumParameters() == 0);
     KALDI_ASSERT(uc->DotProduct(*uc) == 0);
     UpdatableComponent *uc2 = dynamic_cast<UpdatableComponent*>(uc->Copy());
     uc2->Scale(7.0);
     uc2->Add(3.0, *uc);
     KALDI_ASSERT(CheckStringsApproxEqual(uc2->Info(), uc->Info()));
     uc->Scale(0.0);
     KALDI_ASSERT(CheckStringsApproxEqual(uc2->Info(), uc->Info()));
     delete uc2;
   } else {
     KALDI_ASSERT(uc->NumParameters() != 0);
     UpdatableComponent *uc2 = dynamic_cast<UpdatableComponent*>(uc->Copy()),
         *uc3 = dynamic_cast<UpdatableComponent*>(uc->Copy());
 
     // testing some expected invariances of scale and add.
     uc2->Scale(5.0);
     uc2->Add(3.0, *uc3);
     uc3->Scale(8.0);
     // now they should both be scaled to 8 times the original component.
     if (!StringsApproxEqual(uc2->Info(), uc3->Info())) {
       KALDI_ERR << "Expected info strings to be equal: '"
                 << uc2->Info() << "' vs. '" << uc3->Info() << "'";
     }
     // testing that scaling by 0.5 works the same whether
     // done on the vectorized paramters or via Scale().
     Vector<BaseFloat> vec2(uc->NumParameters());
     uc2->Vectorize(&vec2);
     vec2.Scale(0.5);
     uc2->UnVectorize(vec2);
     uc3->Scale(0.5);
     KALDI_ASSERT(CheckStringsApproxEqual(uc2->Info(), uc3->Info()));
 
     // testing that Scale(0.0) works the same whether done on the vectorized
     // paramters or via SetZero(), and that unvectorizing something that's been
     // zeroed gives us zero parameters.
     uc2->Vectorize(&vec2);
     vec2.SetZero();
     uc2->UnVectorize(vec2);
     uc3->Scale(0.0);
     uc3->Vectorize(&vec2);
     KALDI_ASSERT(uc2->Info() == uc3->Info() && VecVec(vec2, vec2) == 0.0);
 
     delete uc2;
     delete uc3;
   }
 }

◆ TestNnetComponentVectorizeUnVectorize()

void kaldi::nnet3::TestNnetComponentVectorizeUnVectorize ( Component * c )

Definition at line 86 of file nnet-component-test.cc.

References kaldi::ApproxEqual(), CheckStringsApproxEqual(), Component::Copy(), UpdatableComponent::DotProduct(), rnnlm::i, UpdatableComponent::Info(), KALDI_ASSERT, kUpdatableComponent, UpdatableComponent::NumParameters(), Component::Properties(), Component::Scale(), UpdatableComponent::UnVectorize(), and UpdatableComponent::Vectorize().

Referenced by UnitTestNnetComponent().

                                                          {
   if (!(c->Properties() & kUpdatableComponent))
     return;
   UpdatableComponent *uc = dynamic_cast<UpdatableComponent*>(c);
   KALDI_ASSERT(uc != NULL);
   UpdatableComponent *uc2 = dynamic_cast<UpdatableComponent*>(uc->Copy());
   uc2->Scale(0.0);
   Vector<BaseFloat> params(uc2->NumParameters());
   uc2->Vectorize(&params);
   KALDI_ASSERT(params.Min()==0.0 && params.Sum()==0.0);
   uc->Vectorize(&params);
   uc2->UnVectorize(params);
   KALDI_ASSERT(CheckStringsApproxEqual(uc2->Info(), uc->Info()));
   BaseFloat x = uc2->DotProduct(*uc2), y = uc->DotProduct(*uc),
       z = uc2->DotProduct(*uc);
   KALDI_ASSERT(ApproxEqual(x, y) && ApproxEqual(y, z));
   Vector<BaseFloat> params2(uc2->NumParameters());
   uc2->Vectorize(&params2);
   for(int i = 0; i < params.Dim(); i++)
     KALDI_ASSERT(params(i) == params2(i));
   delete uc2;
 }

◆ TestNnetDecodable()

void kaldi::nnet3::TestNnetDecodable ( Nnet * nnet )

Definition at line 80 of file nnet-compute-test.cc.

References VectorBase< Real >::ApplyExp(), VectorBase< Real >::Dim(), NnetSimpleComputationOptions::frames_per_chunk, DecodableNnetSimple::GetOutputForFrame(), DecodableNnetSimpleLooped::GetOutputForFrame(), Nnet::Info(), Nnet::InputDim(), KALDI_ASSERT, NnetIsRecurrent(), Nnet::OutputDim(), kaldi::RandInt(), SetBatchnormTestMode(), SetDropoutTestMode(), and VectorBase< Real >::SetRandn().

Referenced by UnitTestNnetCompute().

                                    {
   int32 num_frames = 5 + RandInt(1, 100),
       input_dim = nnet->InputDim("input"),
       output_dim = nnet->OutputDim("output"),
       ivector_dim = std::max<int32>(0, nnet->InputDim("ivector"));
   Matrix<BaseFloat> input(num_frames, input_dim);
 
   SetBatchnormTestMode(true, nnet);
   SetDropoutTestMode(true, nnet);
 
   input.SetRandn();
   Vector<BaseFloat> ivector(ivector_dim);
   ivector.SetRandn();
 
   Vector<BaseFloat> priors(RandInt(0, 1) == 0 ? output_dim : 0);
   if (priors.Dim() != 0) {
     priors.SetRandn();
     priors.ApplyExp();
   }
 
   Matrix<BaseFloat> output1(num_frames, output_dim),
       output2(num_frames, output_dim);
 
   {
     NnetSimpleComputationOptions opts;
     opts.frames_per_chunk = RandInt(5, 25);
     CachingOptimizingCompiler compiler(*nnet);
     DecodableNnetSimple decodable(opts, *nnet, priors, input, &compiler,
                                   (ivector_dim != 0 ? &ivector : NULL));
     for (int32 t = 0; t < num_frames; t++) {
       SubVector<BaseFloat> row(output1, t);
       decodable.GetOutputForFrame(t, &row);
     }
   }
 
   {
     NnetSimpleLoopedComputationOptions opts;
     // caution: this may modify nnet, by changing how it consumes iVectors.
     DecodableNnetSimpleLoopedInfo info(opts, priors, nnet);
     DecodableNnetSimpleLooped decodable(info, input,
                                         (ivector_dim != 0 ? &ivector : NULL));
     for (int32 t = 0; t < num_frames; t++) {
       SubVector<BaseFloat> row(output2, t);
       decodable.GetOutputForFrame(t, &row);
     }
   }
 
 
   // the components that we exclude from this test, are excluded because they
   // all take "optional" right context, and this destroys the equivalence that
   // we are testing.
   if (!NnetIsRecurrent(*nnet) &&
       nnet->Info().find("statistics-extraction") == std::string::npos &&
       nnet->Info().find("TimeHeightConvolutionComponent") == std::string::npos &&
       nnet->Info().find("RestrictedAttentionComponent") == std::string::npos) {
     // this equivalence will not hold for recurrent nnets, or those that
     // have the statistics-extraction/statistics-pooling layers,
     // or in general for nnets with convolution components (because these
     // might have 'optional' context if required-time-offsets != time-offsets.
     for (int32 t = 0; t < num_frames; t++) {
       SubVector<BaseFloat> row1(output1, t),
           row2(output2, t);
       KALDI_ASSERT(row1.ApproxEqual(row2));
     }
   }
 }

◆ TestSimpleComponentDataDerivative()

bool kaldi::nnet3::TestSimpleComponentDataDerivative	(	const Component &	c,
		BaseFloat	perturb_delta
	)

Definition at line 309 of file nnet-component-test.cc.

References kaldi::ApproxEqual(), Component::Backprop(), Component::DeleteMemo(), GetPrecomputedIndexes(), rnnlm::i, Component::InputDim(), KALDI_LOG, KALDI_WARN, kBackpropNeedsInput, kBackpropNeedsOutput, kaldi::kDefaultStride, kInputContiguous, kOutputContiguous, kaldi::kSetZero, kaldi::kStrideEqualNumCols, kaldi::kTrans, Component::OutputDim(), Component::Propagate(), Component::Properties(), kaldi::Rand(), kaldi::RandInt(), ResetSeed(), CuMatrixBase< Real >::SetRandn(), kaldi::TraceMatMat(), and Component::Type().

Referenced by UnitTestNnetComponent().

                                                                 {
   MatrixStrideType input_stride_type = (c.Properties()&kInputContiguous) ?
       kStrideEqualNumCols : kDefaultStride;
   MatrixStrideType output_stride_type = (c.Properties()&kOutputContiguous) ?
       kStrideEqualNumCols : kDefaultStride;
 
   int32 input_dim = c.InputDim(),
       output_dim = c.OutputDim(),
       num_rows = RandInt(1, 100),
       rand_seed = Rand();
   int32 properties = c.Properties();
   CuMatrix<BaseFloat> input_data(num_rows, input_dim, kSetZero, input_stride_type),
       output_data(num_rows, output_dim, kSetZero, output_stride_type),
       output_deriv(num_rows, output_dim, kSetZero, output_stride_type);
   input_data.SetRandn();
   output_deriv.SetRandn();
 
   ResetSeed(rand_seed, c);
   ComponentPrecomputedIndexes *indexes = GetPrecomputedIndexes(c, num_rows);
   void *memo = c.Propagate(indexes, input_data, &output_data);
 
   CuMatrix<BaseFloat> input_deriv(num_rows, input_dim, kSetZero, input_stride_type),
       empty_mat;
   c.Backprop("foobar", indexes,
              ((properties & kBackpropNeedsInput) ? input_data : empty_mat),
              ((properties & kBackpropNeedsOutput) ? output_data : empty_mat),
              output_deriv, memo, NULL, &input_deriv);
   c.DeleteMemo(memo);
 
   int32 test_dim = 3;
   BaseFloat original_objf = TraceMatMat(output_deriv, output_data, kTrans);
   Vector<BaseFloat> measured_objf_change(test_dim),
       predicted_objf_change(test_dim);
   for (int32 i = 0; i < test_dim; i++) {
     CuMatrix<BaseFloat> perturbed_input_data(num_rows, input_dim,
                                              kSetZero, input_stride_type),
         perturbed_output_data(num_rows, output_dim,
                               kSetZero, output_stride_type);
     perturbed_input_data.SetRandn();
     perturbed_input_data.Scale(perturb_delta);
     // at this point, perturbed_input_data contains the offset at the input data.
     predicted_objf_change(i) = TraceMatMat(perturbed_input_data, input_deriv,
                                            kTrans);
     perturbed_input_data.AddMat(1.0, input_data);
 
     ResetSeed(rand_seed, c);
     c.DeleteMemo(c.Propagate(indexes, perturbed_input_data, &perturbed_output_data));
     measured_objf_change(i) = TraceMatMat(output_deriv, perturbed_output_data,
                                           kTrans) - original_objf;
   }
   KALDI_LOG << "Predicted objf-change = " << predicted_objf_change;
   KALDI_LOG << "Measured objf-change = " << measured_objf_change;
   BaseFloat threshold = 0.1;
   bool ans = ApproxEqual(predicted_objf_change, measured_objf_change, threshold);
   if (!ans)
     KALDI_WARN << "Data-derivative test failed, component-type="
                << c.Type() << ", input-dim=" << input_dim
                << ", output-dim=" << output_dim;
   if (c.Type() == "NormalizeComponent" && input_dim == 1) {
     // derivatives are mathematically zero, but the measured and predicted
     // objf have different roundoff and the relative differences are large.
     // this is not unexpected.
     KALDI_LOG << "Accepting deriv differences since it is NormalizeComponent "
               << "with dim=1.";
     return true;
   }
   else if (c.Type() == "ClipGradientComponent") {
     KALDI_LOG << "Accepting deriv differences since "
               << "it is ClipGradientComponent.";
     return true;
   }
   delete indexes;
   return ans;
 }

◆ TestSimpleComponentModelDerivative()

bool kaldi::nnet3::TestSimpleComponentModelDerivative	(	const Component &	c,
		BaseFloat	perturb_delta,
		bool	test_derivative
	)

Definition at line 390 of file nnet-component-test.cc.

References kaldi::ApproxEqual(), Component::Backprop(), Component::Copy(), Component::DeleteMemo(), UpdatableComponent::DotProduct(), GetPrecomputedIndexes(), rnnlm::i, Component::InputDim(), KALDI_ASSERT, KALDI_LOG, KALDI_WARN, kBackpropNeedsInput, kBackpropNeedsOutput, kaldi::kDefaultStride, kInputContiguous, kOutputContiguous, kaldi::kSetZero, kaldi::kStrideEqualNumCols, kaldi::kTrans, kUpdatableComponent, Component::OutputDim(), UpdatableComponent::PerturbParams(), Component::Propagate(), Component::Properties(), kaldi::RandInt(), Component::Scale(), UpdatableComponent::SetAsGradient(), CuMatrixBase< Real >::SetRandn(), kaldi::TraceMatMat(), and Component::Type().

Referenced by UnitTestNnetComponent().

                                                               {
   int32 input_dim = c.InputDim(),
       output_dim = c.OutputDim(),
       num_rows = RandInt(1, 100);
   int32 properties = c.Properties();
   if ((properties & kUpdatableComponent) == 0) {
     // nothing to test.
     return true;
   }
   MatrixStrideType input_stride_type = (c.Properties()&kInputContiguous) ?
       kStrideEqualNumCols : kDefaultStride;
   MatrixStrideType output_stride_type = (c.Properties()&kOutputContiguous) ?
       kStrideEqualNumCols : kDefaultStride;
 
   CuMatrix<BaseFloat> input_data(num_rows, input_dim, kSetZero, input_stride_type),
       output_data(num_rows, output_dim, kSetZero, output_stride_type),
       output_deriv(num_rows, output_dim, kSetZero, output_stride_type);
   input_data.SetRandn();
   output_deriv.SetRandn();
 
   ComponentPrecomputedIndexes *indexes = GetPrecomputedIndexes(c, num_rows);
   void *memo = c.Propagate(indexes, input_data, &output_data);
 
   BaseFloat original_objf = TraceMatMat(output_deriv, output_data, kTrans);
 
   Component *c_copy = c.Copy();
 
   const UpdatableComponent *uc = dynamic_cast<const UpdatableComponent*>(&c);
   UpdatableComponent *uc_copy = dynamic_cast<UpdatableComponent*>(c_copy);
   KALDI_ASSERT(uc != NULL && uc_copy != NULL);
   if (test_derivative) {
     uc_copy->Scale(0.0);
     uc_copy->SetAsGradient();
   }
 
   CuMatrix<BaseFloat> input_deriv(num_rows, input_dim,
                                   kSetZero, input_stride_type),
       empty_mat;
   c.Backprop("foobar", indexes,
              ((properties & kBackpropNeedsInput) ? input_data : empty_mat),
              ((properties & kBackpropNeedsOutput) ? output_data : empty_mat),
              output_deriv, memo, c_copy,
              (RandInt(0, 1) == 0 ? &input_deriv : NULL));
   c.DeleteMemo(memo);
 
   if (!test_derivative) { // Just testing that the model update is downhill.
     CuMatrix<BaseFloat> new_output_data(num_rows, output_dim,
                                         kSetZero, output_stride_type);
     c.DeleteMemo(c_copy->Propagate(indexes, input_data, &new_output_data));
 
     BaseFloat new_objf = TraceMatMat(output_deriv, new_output_data, kTrans);
     bool ans = (new_objf > original_objf);
     if (!ans) {
       KALDI_WARN << "After update, new objf is not better than the original objf: "
                  << new_objf << " <= " << original_objf;
     }
     delete c_copy;
     delete indexes;
     return ans;
   } else {
     // check that the model derivative is accurate.
     int32 test_dim = 3;
 
     Vector<BaseFloat> measured_objf_change(test_dim),
         predicted_objf_change(test_dim);
     for (int32 i = 0; i < test_dim; i++) {
       CuMatrix<BaseFloat> perturbed_output_data(num_rows, output_dim,
                                                 kSetZero, output_stride_type);
       Component *c_perturbed = c.Copy();
       UpdatableComponent *uc_perturbed =
           dynamic_cast<UpdatableComponent*>(c_perturbed);
       KALDI_ASSERT(uc_perturbed != NULL);
       uc_perturbed->PerturbParams(perturb_delta);
 
       predicted_objf_change(i) = uc_copy->DotProduct(*uc_perturbed) -
           uc_copy->DotProduct(*uc);
       c_perturbed->Propagate(indexes, input_data, &perturbed_output_data);
       measured_objf_change(i) = TraceMatMat(output_deriv, perturbed_output_data,
                                             kTrans) - original_objf;
       delete c_perturbed;
     }
     KALDI_LOG << "Predicted objf-change = " << predicted_objf_change;
     KALDI_LOG << "Measured objf-change = " << measured_objf_change;
     BaseFloat threshold = 0.1;
 
     bool ans = ApproxEqual(predicted_objf_change, measured_objf_change,
                            threshold);
     if (!ans)
       KALDI_WARN << "Model-derivative test failed, component-type="
                  << c.Type() << ", input-dim=" << input_dim
                  << ", output-dim=" << output_dim;
     delete c_copy;
     delete indexes;
     return ans;
   }
 }

◆ TestSimpleComponentPropagateProperties()

void kaldi::nnet3::TestSimpleComponentPropagateProperties ( const Component & c )

Definition at line 210 of file nnet-component-test.cc.

References CuMatrixBase< Real >::Add(), kaldi::AssertEqual(), Component::Backprop(), Component::Copy(), CuMatrixBase< Real >::CopyFromMat(), Component::DeleteMemo(), GetPrecomputedIndexes(), Component::InputDim(), KALDI_ERR, kBackpropAdds, kBackpropInPlace, kBackpropNeedsInput, kBackpropNeedsOutput, kaldi::kDefaultStride, kInputContiguous, kOutputContiguous, kPropagateAdds, kPropagateInPlace, kaldi::kSetZero, kaldi::kStrideEqualNumCols, kaldi::kUndefined, Component::OutputDim(), Component::Propagate(), Component::Properties(), kaldi::Rand(), kaldi::RandInt(), ResetSeed(), CuMatrixBase< Real >::SetRandn(), and Component::Type().

Referenced by UnitTestNnetComponent().

                                                                 {
   int32 properties = c.Properties();
   Component *c_copy = NULL;
   int32 rand_seed = Rand();
 
   if (RandInt(0, 1) == 0)
     c_copy = c.Copy();  // This will test backprop with an updatable component.
   MatrixStrideType input_stride_type = (c.Properties()&kInputContiguous) ?
       kStrideEqualNumCols : kDefaultStride;
   MatrixStrideType output_stride_type = (c.Properties()&kOutputContiguous) ?
       kStrideEqualNumCols : kDefaultStride;
   MatrixStrideType both_stride_type =
       (c.Properties()&(kInputContiguous|kOutputContiguous)) ?
       kStrideEqualNumCols : kDefaultStride;
 
   int32 input_dim = c.InputDim(),
       output_dim = c.OutputDim(),
       num_rows = RandInt(1, 100);
   CuMatrix<BaseFloat> input_data(num_rows, input_dim, kUndefined,
                                  input_stride_type);
   input_data.SetRandn();
   CuMatrix<BaseFloat> output_data3(num_rows, input_dim, kSetZero,
                                    output_stride_type);
   output_data3.CopyFromMat(input_data);
   CuMatrix<BaseFloat>
       output_data1(num_rows, output_dim, kSetZero, output_stride_type),
       output_data2(num_rows, output_dim, kSetZero, output_stride_type);
   output_data2.Add(1.0);
 
   if ((properties & kPropagateAdds) && (properties & kPropagateInPlace)) {
     KALDI_ERR << "kPropagateAdds and kPropagateInPlace flags are incompatible.";
   }
 
   ResetSeed(rand_seed, c);
   ComponentPrecomputedIndexes *indexes = GetPrecomputedIndexes(c, num_rows);
   void *memo = c.Propagate(indexes, input_data, &output_data1);
 
   ResetSeed(rand_seed, c);
   c.DeleteMemo(c.Propagate(indexes, input_data, &output_data2));
   if (properties & kPropagateInPlace) {
     ResetSeed(rand_seed, c);
     c.DeleteMemo(c.Propagate(indexes, output_data3, &output_data3));
     if (!output_data1.ApproxEqual(output_data3)) {
       KALDI_ERR << "Test of kPropagateInPlace flag for component of type "
                 << c.Type() << " failed.";
     }
   }
   if (properties & kPropagateAdds)
     output_data2.Add(-1.0); // remove the offset
   AssertEqual(output_data1, output_data2);
 
 
   CuMatrix<BaseFloat> output_deriv(num_rows, output_dim, kSetZero, output_stride_type);
   output_deriv.SetRandn();
   CuMatrix<BaseFloat> input_deriv1(num_rows, input_dim, kSetZero, input_stride_type),
       input_deriv2(num_rows, input_dim, kSetZero, input_stride_type);
   CuMatrix<BaseFloat> input_deriv3(num_rows, output_dim, kSetZero, both_stride_type);
   input_deriv3.CopyFromMat(output_deriv);
 
   input_deriv2.Add(1.0);
   CuMatrix<BaseFloat> empty_mat;
 
   // test with input_deriv1 that's zero
   c.Backprop("foobar", indexes,
              ((properties & kBackpropNeedsInput) ? input_data : empty_mat),
              ((properties & kBackpropNeedsOutput) ? output_data1 : empty_mat),
              output_deriv,
              memo,
              c_copy,
              &input_deriv1);
   // test with input_deriv2 that's all ones.
   c.Backprop("foobar", indexes,
              ((properties & kBackpropNeedsInput) ? input_data : empty_mat),
              ((properties & kBackpropNeedsOutput) ? output_data1 : empty_mat),
              output_deriv,
              memo,
              c_copy,
              &input_deriv2);
   // test backprop in place, if supported.
   if (properties & kBackpropInPlace) {
     c.Backprop("foobar", indexes,
                ((properties & kBackpropNeedsInput) ? input_data : empty_mat),
                ((properties & kBackpropNeedsOutput) ? output_data1 : empty_mat),
                input_deriv3,
                memo,
                c_copy,
                &input_deriv3);
   }
   c.DeleteMemo(memo);
 
   if (properties & kBackpropAdds)
     input_deriv2.Add(-1.0);  // subtract the offset.
   AssertEqual(input_deriv1, input_deriv2);
   if (properties & kBackpropInPlace)
     AssertEqual(input_deriv1, input_deriv3);
   delete c_copy;
   delete indexes;
 }

◆ UnitTestCindexIo()

void kaldi::nnet3::UnitTestCindexIo ( )

Definition at line 62 of file nnet-common-test.cc.

References rnnlm::i, KALDI_ERR, Index::n, kaldi::RandInt(), ReadCindexVector(), Index::t, WriteCindexVector(), and Index::x.

Referenced by main().

                         {
   std::vector<Cindex> cindexes(RandInt(0, 15));
 
   for (int32 i = 0; i < cindexes.size(); i++) {
     if (i == 0 || RandInt(0, 4) == 0) {
       cindexes[i].first = RandInt(-256, 256);
     } else {
       cindexes[i].first = cindexes[i-1].first;
     }
     Index &index = cindexes[i].second;
     if (i == 0) {
       if (RandInt(0, 3) == 0) {
         index.n = 0;
         index.x = 0;
         if (RandInt(0, 1) == 0)
           index.t = RandInt(-5, 5);
         else if (RandInt(0, 1) == 0) {
           index.t = 124;
         } else if (RandInt(0, 1) == 0) {
           index.t = -124;
         } else if (RandInt(0, 1) == 0) {
           index.t = std::numeric_limits<int32>::min();
         } else {
           index.t = 0;
         }
       } else if (RandInt(0, 1) == 0) {
         index.t = 0;
         index.x = 0;
         index.n = RandInt(0, 1);
       } else {
         index.t = RandInt(-3, 3);
         if (RandInt(0, 1) == 0)
           index.t = std::numeric_limits<int32>::min();
         index.x = RandInt(-1,1);
         index.n = RandInt(-1,1);
       }
     } else {
       if (RandInt(0, 3) == 0) {
         cindexes[i].second.n = cindexes[i-1].second.n;
         cindexes[i].second.x = cindexes[i-1].second.x;
         if (RandInt(0, 1) == 0) {
           cindexes[i].second.t = cindexes[i-1].second.t + RandInt(-127, 127);
         } else if (RandInt(0, 1) == 0) {
           cindexes[i].second.t = cindexes[i-1].second.t + 124;
         } else if (RandInt(0, 1) == 0) {
           cindexes[i].second.t = cindexes[i-1].second.t + -124;
         } else if (RandInt(0, 1) == 0) {
           cindexes[i].second.t = std::numeric_limits<int32>::min();
         } else {
           cindexes[i].second.t = RandInt(-2, 2);
         }
       } else if (RandInt(0, 1) == 0) {
         cindexes[i].second.t = cindexes[i-1].second.t;
         cindexes[i].second.x = cindexes[i-1].second.x;
         cindexes[i].second.n = cindexes[i-1].second.n + RandInt(-2,2);
       } else if (RandInt(0, 1) == 0) {
         cindexes[i].second.t = cindexes[i-1].second.t + RandInt(-2, 2);
         cindexes[i].second.x = cindexes[i-1].second.x + RandInt(-2, 2);
         cindexes[i].second.n = cindexes[i-1].second.n + RandInt(-2,2);
         if (RandInt(0, 3) == 0)
           cindexes[i].second.t = std::numeric_limits<int32>::min();
 
       } else {
         cindexes[i].second.t = RandInt(-128, 128);
         cindexes[i].second.x = RandInt(-128, 128);
         cindexes[i].second.n = RandInt(-128, 128);
         if (RandInt(0, 3) == 0)
           cindexes[i].second.t = std::numeric_limits<int32>::min();
       }
     }
   }
 
   if (RandInt(0, 10) == 0) {
     // trying to reproduce a failure
     Cindex temp(0, Index(0, 0, 0));
     cindexes.clear();
     cindexes.resize(4, temp);
     cindexes[RandInt(0, 3)].second.t = std::numeric_limits<int32>::min();
   }
 
   std::ostringstream os;
   bool binary = (RandInt(0, 1) == 0);
   WriteCindexVector(os, binary, cindexes);
   std::vector<Cindex> cindexes2;
   if (RandInt(0, 1) == 0)
     cindexes2 = cindexes;
   std::istringstream is(os.str());
   ReadCindexVector(is, binary, &cindexes2);
 
   std::ostringstream os2;
   WriteCindexVector(os2, binary, cindexes2);
 
   if (cindexes != cindexes2 || os.str() != os2.str()) {
     WriteCindexVector(std::cerr, false, cindexes);
     std::cerr << "  vs. \n";
     WriteCindexVector(std::cerr, false, cindexes2);
     std::cerr << "\n";
     KALDI_ERR << "Indexes differ.";
   }
 }

◆ UnitTestComputationRequestIo()

void kaldi::nnet3::UnitTestComputationRequestIo ( ComputationRequest * request )

Definition at line 56 of file nnet-compute-test.cc.

References KALDI_ASSERT, kaldi::Rand(), ComputationRequest::Read(), and ComputationRequest::Write().

Referenced by UnitTestNnetCompute().

                                                                {
   bool binary = (Rand() % 2 == 0);
   std::ostringstream os;
   request->Write(os, binary);
   const std::string &original_output = os.str();
   std::istringstream request_is(original_output);
   request->Read(request_is, binary);
   std::istringstream request_is2(original_output);
   ComputationRequest request2;
   request2.Read(request_is2, binary);
 
   std::ostringstream os2, os3;
   request->Write(os2, binary);
   request2.Write(os3, binary);
   KALDI_ASSERT(*request == request2);
 
   if (binary) {
     KALDI_ASSERT(os2.str() == original_output);
     KALDI_ASSERT(os3.str() == original_output);
   }
 }

◆ UnitTestComputeGraphTranspose()

void kaldi::nnet3::UnitTestComputeGraphTranspose ( )

Definition at line 148 of file nnet-graph-test.cc.

References AssertGraphEqual(), BuildTestGraph(), BuildTestGraphTranspose(), ComputeGraphTranspose(), and KALDI_ASSERT.

Referenced by main().

                                      {
   std::vector<std::vector<int32> > graph;
   BuildTestGraph(&graph);
 
   std::vector<std::vector<int32> > graph_transpose;
   ComputeGraphTranspose(graph, &graph_transpose);
 
   std::vector<std::vector<int32> > ref_graph_transpose;
   BuildTestGraphTranspose(&ref_graph_transpose);
   KALDI_ASSERT(AssertGraphEqual(graph_transpose, ref_graph_transpose));
 }

◆ UnitTestComputeTopSortOrder()

void kaldi::nnet3::UnitTestComputeTopSortOrder ( )

Definition at line 187 of file nnet-graph-test.cc.

References AssertVectorEqual(), BuildTestSccGraph(), BuildTestTopSortOrder(), ComputeTopSortOrder(), and KALDI_ASSERT.

Referenced by main().

                                    {
   std::vector<std::vector<int32> > scc_graph;
   BuildTestSccGraph(&scc_graph);
 
   std::vector<int32> node_to_order;
   ComputeTopSortOrder(scc_graph, &node_to_order);
 
   std::vector<int32> ref_node_to_order;
   BuildTestTopSortOrder(&ref_node_to_order);
   KALDI_ASSERT(AssertVectorEqual(node_to_order, ref_node_to_order));
 }

◆ UnitTestComputeTopSortOrder2()

void kaldi::nnet3::UnitTestComputeTopSortOrder2 ( )

Definition at line 199 of file nnet-graph-test.cc.

References AssertVectorEqual(), ComputeTopSortOrder(), and KALDI_ASSERT.

Referenced by main().

                                     {
   // The outer vector is indexed by node ID, and each nested vector contains
   // the node IDs for its successors in the graph. For example, if there are
   // arcs from node 0 to nodes 1 and 2, then the vector at graph[0] will be (1, 2)
   std::vector<std::vector<int32> > graph;
 
   // Build a test graph:
   // 0 ---> 1 ---> 2 ---> 4
   //   `--> 3 -----^
   graph.resize(5);
   graph[0].push_back(1); graph[0].push_back(3);
   graph[1].push_back(2);
   graph[2].push_back(4);
   graph[3].push_back(2);
   // graph[4] is empty(has no successors)
 
   // fill in the desired(topological) mapping node->order
   std::vector<int32> ref_node_to_order;
   ref_node_to_order.push_back(0); // node 0 comes first
   ref_node_to_order.push_back(2); // node 1 comes third
   ref_node_to_order.push_back(3); // node 2 comes fourth
   ref_node_to_order.push_back(1); // node 3 comes second
   ref_node_to_order.push_back(4); // node 4 comes last
 
   std::vector<int32> computed_node_to_order;
   ComputeTopSortOrder(graph, &computed_node_to_order);
   KALDI_ASSERT(AssertVectorEqual(ref_node_to_order, computed_node_to_order));
 }

◆ UnitTestConvertRepeatedToBlockAffine()

void kaldi::nnet3::UnitTestConvertRepeatedToBlockAffine ( )

Definition at line 50 of file nnet-utils-test.cc.

References ConvertRepeatedToBlockAffine(), Nnet::GetComponent(), rnnlm::i, KALDI_ASSERT, Nnet::NumComponents(), Nnet::ReadConfig(), and Component::Type().

Referenced by main().

                                             {
   // a test without a composite component.
   std::string config =
     "component name=repeated-affine1 type=RepeatedAffineComponent "
     "input-dim=100 output-dim=200 num-repeats=20\n"
     "component name=relu1 type=RectifiedLinearComponent dim=200\n"
     "component name=block-affine1 type=BlockAffineComponent "
     "input-dim=200 output-dim=100 num-blocks=10\n"
     "component name=relu2 type=RectifiedLinearComponent dim=100\n"
     "component name=repeated-affine2 type=NaturalGradientRepeatedAffineComponent "
     "input-dim=100 output-dim=200 num-repeats=10\n"
     "\n"
     "input-node name=input dim=100\n"
     "component-node name=repeated-affine1 component=repeated-affine1 input=input\n"
     "component-node name=relu1 component=relu1 input=repeated-affine1\n"
     "component-node name=block-affine1 component=block-affine1 input=relu1\n"
     "component-node name=relu2 component=relu2 component=relu2 input=block-affine1\n"
     "component-node name=repeated-affine2 component=repeated-affine2 input=relu2\n"
     "output-node name=output input=repeated-affine2\n";
 
   Nnet nnet;
   std::istringstream is(config);
   nnet.ReadConfig(is);
   ConvertRepeatedToBlockAffine(&nnet);
 
   for(int i = 0; i < nnet.NumComponents(); i++) {
     Component *c = nnet.GetComponent(i);
     KALDI_ASSERT(c->Type() != "RepeatedAffineComponent"
                  && c->Type() != "NaturalGradientRepeatedAffineComponent");
   }
 }

◆ UnitTestConvertRepeatedToBlockAffineComposite()

void kaldi::nnet3::UnitTestConvertRepeatedToBlockAffineComposite ( )

Definition at line 82 of file nnet-utils-test.cc.

References ConvertRepeatedToBlockAffine(), GenerateConfigSequenceCompositeBlock(), Nnet::GetComponent(), CompositeComponent::GetComponent(), rnnlm::i, KALDI_ASSERT, Nnet::NumComponents(), CompositeComponent::NumComponents(), NnetGenerationOptions::output_dim, Nnet::ReadConfig(), and Component::Type().

Referenced by main().

                                                      {
   // test that repeated affine components nested within a CompositeComponent
   // are converted.
   struct NnetGenerationOptions gen_config;
   gen_config.output_dim = 0;
   std::vector<std::string> configs;
   // this function generates a neural net with one component:
   // a composite component.
   GenerateConfigSequenceCompositeBlock(gen_config, &configs);
   Nnet nnet;
   std::istringstream is(configs[0]);
   nnet.ReadConfig(is);
   KALDI_ASSERT(nnet.NumComponents() == 1);
   ConvertRepeatedToBlockAffine(&nnet);
   CompositeComponent *cc = dynamic_cast<CompositeComponent*>(nnet.GetComponent(0));
   for(int i = 0; i < cc->NumComponents(); i++) {
     const Component *c = cc->GetComponent(i);
     KALDI_ASSERT(c->Type() == "BlockAffineComponent");
   }
 }

◆ UnitTestDescriptorIo()

void kaldi::nnet3::UnitTestDescriptorIo ( )

Definition at line 89 of file nnet-descriptor-test.cc.

References DescriptorTokenize(), GenRandDescriptor(), rnnlm::i, KALDI_ASSERT, KALDI_LOG, KALDI_WARN, Descriptor::Parse(), kaldi::Rand(), and Descriptor::WriteConfig().

Referenced by main().

                             {
   for (int32 i = 0; i < 100; i++) {
     int32 num_nodes = 1 + Rand() % 5;
     std::vector<std::string> node_names(num_nodes);
     for (int32 i = 0; i < node_names.size(); i++) {
       std::ostringstream ostr;
       ostr << "a" << (i+1);
       node_names[i] = ostr.str();
     }
     Descriptor desc;
     std::ostringstream ostr;
     GenRandDescriptor(num_nodes, &desc);
     desc.WriteConfig(ostr, node_names);
 
     Descriptor desc2(desc), desc3, desc4;
     desc3 = desc;
     std::vector<std::string> tokens;
     DescriptorTokenize(ostr.str(), &tokens);
     tokens.push_back("end of input");
     std::istringstream istr(ostr.str());
     const std::string *next_token = &(tokens[0]);
     bool ans = desc4.Parse(node_names, &next_token);
     KALDI_ASSERT(ans);
 
     std::ostringstream ostr2;
     desc2.WriteConfig(ostr2, node_names);
     std::ostringstream ostr3;
     desc3.WriteConfig(ostr3, node_names);
     std::ostringstream ostr4;
     desc4.WriteConfig(ostr4, node_names);
 
     KALDI_ASSERT(ostr.str() == ostr2.str());
     KALDI_ASSERT(ostr.str() == ostr3.str());
     KALDI_LOG << "x = " << ostr.str();
     KALDI_LOG << "y = " << ostr4.str();
     if (ostr.str() != ostr4.str()) {
       KALDI_WARN << "x and y differ: checking that it's due to Offset normalization.";
       KALDI_ASSERT(ostr.str().find("Offset(Offset") != std::string::npos ||
                    (ostr.str().find("Offset(") != std::string::npos &&
                     ostr.str().find(", 0)") != std::string::npos));
     }
   }
 }

◆ UnitTestDescriptorTokenize()

void kaldi::nnet3::UnitTestDescriptorTokenize ( )

Definition at line 27 of file nnet-parse-test.cc.

References DescriptorTokenize(), and KALDI_ASSERT.

Referenced by main().

                                   {
   std::vector<std::string> lines;
 
   std::string str = "(,test )";
   KALDI_ASSERT(DescriptorTokenize(str, &lines));
   KALDI_ASSERT(lines[0] == "(" && lines[1] == "," && lines[2] == "test" && lines[3] == ")");
 
   str = "(,1test )";
   KALDI_ASSERT(!DescriptorTokenize(str, &lines));
 
   str = "t (,-1 )";
   KALDI_ASSERT(DescriptorTokenize(str, &lines));
   KALDI_ASSERT(lines.size() == 5 && lines[0] == "t" && lines[3] == "-1");
 
   str = "   sd , -112 )";
   KALDI_ASSERT(DescriptorTokenize(str, &lines));
   KALDI_ASSERT(lines.size() == 4 && lines[0] == "sd" && lines[2] == "-112");
 
   str = "   sd , +112 )";
   KALDI_ASSERT(DescriptorTokenize(str, &lines));
   KALDI_ASSERT(lines.size() == 4 && lines[0] == "sd" && lines[2] == "+112");
 
   str = "foo";
   KALDI_ASSERT(DescriptorTokenize(str, &lines));
   KALDI_ASSERT(lines.size() == 1 && lines[0] == "foo");
 
 }

◆ UnitTestEnsureContiguousProperty()

void kaldi::nnet3::UnitTestEnsureContiguousProperty ( )

Definition at line 222 of file nnet-compile-utils-test.cc.

References EnsureContiguousProperty(), HasContiguousProperty(), rnnlm::i, rnnlm::j, KALDI_ASSERT, and kaldi::RandInt().

Referenced by main().

                                         {
   for (int32 k = 0; k < 10; k++) {
     int32 size = RandInt(0, 5);
     std::vector<int32> indexes(size);
     for (int32 i = 0; i < size; i++)
       indexes[i] = RandInt(-1, 4);
     std::vector<std::pair<int32, int32> > reverse_indexes;
     bool ans = HasContiguousProperty(indexes, &reverse_indexes);
     if (ans) { // has contiguous property -> EnsureContiguousProperty should do
                // nothing.
       std::vector<std::vector<int32> > indexes_split;
       EnsureContiguousProperty(indexes, &indexes_split);
       if (indexes.size() == 0 ||
           *std::max_element(indexes.begin(), indexes.end()) == -1) {
         KALDI_ASSERT(indexes_split.size() == 0);
       } else {
         KALDI_ASSERT(indexes_split.size() == 1 &&
                      indexes_split[0] == indexes);
       }
     } else {
       std::vector<std::vector<int32> > indexes_split;
       EnsureContiguousProperty(indexes, &indexes_split);
       KALDI_ASSERT(indexes_split.size() > 1);
       for (int32 i = 0; i < indexes.size(); i++) {
         int32 this_val = indexes[i];
         bool found = (this_val == -1);  // not looking for anything if
                                         // this_val is -1.
         for (int32 j = 0; j < indexes_split.size(); j++) {
           if (found) {
             KALDI_ASSERT(indexes_split[j][i] == -1);
           } else {
             if (indexes_split[j][i] == this_val) {
               found = true;
             } else {
               KALDI_ASSERT(indexes_split[j][i] == -1);
             }
           }
         }
         KALDI_ASSERT(found);
         for (int32 j = 0; j < indexes_split.size(); j++) {
           KALDI_ASSERT(indexes_split[j].size() == indexes.size() &&
                        HasContiguousProperty(indexes_split[j], &reverse_indexes));
         }
       }
     }
   }
 }

◆ UnitTestFindSccs()

void kaldi::nnet3::UnitTestFindSccs ( )

Definition at line 160 of file nnet-graph-test.cc.

References AssertGraphEqual(), BuildTestGraph(), BuildTestSccs(), FindSccs(), and KALDI_ASSERT.

Referenced by main().

                         {
   std::vector<std::vector<int32> > graph;
   BuildTestGraph(&graph);
 
   std::vector<std::vector<int32> > sccs;
   FindSccs(graph, &sccs);
 
   std::vector<std::vector<int32> > ref_sccs;
   BuildTestSccs(&ref_sccs);
   KALDI_ASSERT(AssertGraphEqual(sccs, ref_sccs));
 }

◆ UnitTestGeneralDescriptor()

void kaldi::nnet3::UnitTestGeneralDescriptor ( )

Definition at line 135 of file nnet-descriptor-test.cc.

References GeneralDescriptor::ConvertToDescriptor(), DescriptorTokenize(), GenRandDescriptor(), rnnlm::i, KALDI_ERR, KALDI_LOG, KALDI_WARN, GeneralDescriptor::Parse(), kaldi::Rand(), and Descriptor::WriteConfig().

Referenced by main().

                                  {
   for (int32 i = 0; i < 100; i++) {
     int32 num_nodes = 1 + Rand() % 5;
     std::vector<std::string> node_names(num_nodes);
     for (int32 i = 0; i < node_names.size(); i++) {
       std::ostringstream ostr;
       ostr << "a" << (i+1);
       node_names[i] = ostr.str();
     }
     Descriptor desc;
     std::ostringstream ostr;
     GenRandDescriptor(num_nodes, &desc);
     desc.WriteConfig(ostr, node_names);
 
     Descriptor desc2(desc), desc3;
     desc3 = desc;
     std::vector<std::string> tokens;
     DescriptorTokenize(ostr.str(), &tokens);
     tokens.push_back("end of input");
     std::istringstream istr(ostr.str());
     const std::string *next_token = &(tokens[0]);
 
 
     GeneralDescriptor *gen_desc = GeneralDescriptor::Parse(node_names,
                                                            &next_token);
 
     if (*next_token != "end of input")
       KALDI_ERR << "Parsing Descriptor, expected end of input but got "
                 << "'" <<  *next_token << "'";
 
     Descriptor *desc4 = gen_desc->ConvertToDescriptor();
     std::ostringstream ostr2;
     desc4->WriteConfig(ostr2, node_names);
     KALDI_LOG << "Original descriptor was: " << ostr.str();
     KALDI_LOG << "Parsed descriptor was: " << ostr2.str();
     if (ostr2.str() != ostr.str())
       KALDI_WARN << "Strings differed.  Check manually.";
 
     delete gen_desc;
     delete desc4;
   }
 }

◆ UnitTestGeneralDescriptorSpecial()

void kaldi::nnet3::UnitTestGeneralDescriptorSpecial ( )

Definition at line 200 of file nnet-descriptor-test.cc.

References KALDI_ASSERT, and NormalizeTextDescriptor().

Referenced by main().

                                         {
   std::vector<std::string> names;
   names.push_back("a");
   names.push_back("b");
   names.push_back("c");
   names.push_back("d");
   KALDI_ASSERT(NormalizeTextDescriptor(names, "a") == "a");
   KALDI_ASSERT(NormalizeTextDescriptor(names, "Scale(-1.0, a)") == "Scale(-1, a)");
   KALDI_ASSERT(NormalizeTextDescriptor(names, "Scale(-1.0, Scale(-2.0, a))") == "Scale(2, a)");
   KALDI_ASSERT(NormalizeTextDescriptor(names, "Scale(2.0, Sum(Scale(2.0, a), b, c))") ==
                "Sum(Scale(4, a), Sum(Scale(2, b), Scale(2, c)))");
   KALDI_ASSERT(NormalizeTextDescriptor(names, "Const(1.0, 512)") == "Const(1, 512)");
   KALDI_ASSERT(NormalizeTextDescriptor(names, "Sum(Const(1.0, 512), Scale(-1.0, a))") ==
                "Sum(Const(1, 512), Scale(-1, a))");
   KALDI_ASSERT(NormalizeTextDescriptor(names, "Offset(Offset(a, 3, 5), 2, 1)")
                == "Offset(a, 5, 6)");
 
   KALDI_ASSERT(NormalizeTextDescriptor(names, "Offset(Sum(a, b), 2, 1)") ==
                "Sum(Offset(a, 2, 1), Offset(b, 2, 1))");
   KALDI_ASSERT(NormalizeTextDescriptor(names, "Sum(Append(a, b), Append(c, d))") ==
                "Append(Sum(a, c), Sum(b, d))");
   KALDI_ASSERT(NormalizeTextDescriptor(names, "Append(Append(a, b), Append(c, d))") ==
                "Append(a, b, c, d)");
   KALDI_ASSERT(NormalizeTextDescriptor(names, "Sum(a, b, c, d)") ==
                "Sum(a, Sum(b, Sum(c, d)))");
   KALDI_ASSERT(NormalizeTextDescriptor(names, "Sum(a)") == "a");
   KALDI_ASSERT(NormalizeTextDescriptor(names, "Offset(a, 0)") == "a");
 }

◆ UnitTestHasContiguousProperty()

void kaldi::nnet3::UnitTestHasContiguousProperty ( )

Definition at line 185 of file nnet-compile-utils-test.cc.

References HasContiguousProperty(), rnnlm::i, rnnlm::j, KALDI_ASSERT, KALDI_LOG, and kaldi::RandInt().

Referenced by main().

                                      {
   for (int32 k = 0; k < 10; k++) {
     int32 size = RandInt(0, 5);
     std::vector<int32> indexes(size);
     for (int32 i = 0; i < size; i++)
       indexes[i] = RandInt(-1, 4);
     std::vector<std::pair<int32, int32> > reverse_indexes;
     bool ans = HasContiguousProperty(indexes, &reverse_indexes);
     if (!ans) { // doesn't have contiguous propety.
       KALDI_LOG << "no.";
       bool found_example = false;
       for (int32 i = 0; i < size; i++) {
         if (indexes[i] != -1) {
           bool found_not_same = false;
           for (int32 j = i + 1; j < size; j++) {
             if (indexes[j] != indexes[i]) found_not_same = true;
             else if (found_not_same) found_example = true;  // found something like x y x.
           }
         }
       }
       KALDI_ASSERT(found_example);
     } else {
       KALDI_LOG << "yes.";
       for (int32 i = 0; i < reverse_indexes.size(); i++) {
         for (int32 j = reverse_indexes[i].first;
              j < reverse_indexes[i].second; j++) {
           KALDI_ASSERT(indexes[j] == i);
           indexes[j] = -1;
         }
       }
       for (int32 i = 0; i < size; i++)  // make sure all indexes covered.
         KALDI_ASSERT(indexes[i] == -1);
     }
   }
 }

◆ UnitTestIndexIo()

void kaldi::nnet3::UnitTestIndexIo ( )

Definition at line 28 of file nnet-common-test.cc.

References rnnlm::i, KALDI_ERR, kaldi::RandInt(), ReadIndexVector(), and WriteIndexVector().

Referenced by main().

                        {
   std::vector<Index> indexes(RandInt(0, 10));
 
   for (int32 i = 0; i < indexes.size(); i++) {
     if (i == 0 || RandInt(0, 1) == 0) {
       indexes[i].n = RandInt(-1, 2);
       indexes[i].t = RandInt(-150, 150);
       indexes[i].x = RandInt(-1, 1);
     } else {
       // this case gets optimized while writing. (if abs(diff-in-t) < 125).
       indexes[i].n = indexes[i-1].n;
       indexes[i].t = indexes[i-1].t + RandInt(-127, 127);
       indexes[i].x = indexes[i-1].x;
     }
   }
 
   std::ostringstream os;
   bool binary = (RandInt(0, 1) == 0);
   WriteIndexVector(os, binary, indexes);
 
   std::vector<Index> indexes2;
   if (RandInt(0, 1) == 0)
     indexes2 = indexes;
   std::istringstream is(os.str());
   ReadIndexVector(is, binary, &indexes2);
   if (indexes != indexes2) {
     WriteIndexVector(std::cerr, false, indexes);
     std::cerr << "  vs. \n";
     WriteIndexVector(std::cerr, false, indexes2);
     std::cerr << "\n";
     KALDI_ERR << "Indexes differ.";
   }
 }

◆ UnitTestMakeSccGraph()

void kaldi::nnet3::UnitTestMakeSccGraph ( )

Definition at line 172 of file nnet-graph-test.cc.

References AssertGraphEqual(), BuildTestGraph(), BuildTestSccGraph(), BuildTestSccs(), KALDI_ASSERT, and MakeSccGraph().

Referenced by main().

                             {
   std::vector<std::vector<int32> > graph;
   BuildTestGraph(&graph);
 
   std::vector<std::vector<int32> > sccs;
   BuildTestSccs(&sccs);
 
   std::vector<std::vector<int32> > scc_graph;
   MakeSccGraph(graph, sccs, &scc_graph);
 
   std::vector<std::vector<int32> > ref_scc_graph;
   BuildTestSccGraph(&ref_scc_graph);
   KALDI_ASSERT(AssertGraphEqual(scc_graph, ref_scc_graph));
 }

◆ UnitTestNameMatchesPattern()

void kaldi::nnet3::UnitTestNameMatchesPattern ( )

Definition at line 76 of file nnet-parse-test.cc.

References KALDI_ASSERT, and NameMatchesPattern().

Referenced by main().

                                    {
   KALDI_ASSERT(NameMatchesPattern("hello", "hello"));
   KALDI_ASSERT(!NameMatchesPattern("hello", "hellox"));
   KALDI_ASSERT(!NameMatchesPattern("hellox", "hello"));
   KALDI_ASSERT(NameMatchesPattern("hellox", "hello*"));
   KALDI_ASSERT(NameMatchesPattern("hello", "hello*"));
   KALDI_ASSERT(NameMatchesPattern("", "*"));
   KALDI_ASSERT(NameMatchesPattern("x", "*"));
   KALDI_ASSERT(NameMatchesPattern("foo12bar", "foo*bar"));
   KALDI_ASSERT(NameMatchesPattern("foo12bar", "foo*"));
   KALDI_ASSERT(NameMatchesPattern("foo12bar", "*bar"));
 }

◆ UnitTestNnetAnalyze()

void kaldi::nnet3::UnitTestNnetAnalyze ( )

Definition at line 39 of file nnet-analyze-test.cc.

References ComputationChecker::Check(), CheckComputationOptions::check_rewrite, NnetComputation::commands, ComputeExampleComputationRequestSimple(), Compiler::CreateComputation(), ComputationAnalysis::DataInvalidatedCommand(), ComputationAnalysis::FirstNontrivialAccess(), GenerateConfigSequence(), NnetComputation::GetSubmatrixStrings(), Analyzer::Init(), rnnlm::j, KALDI_LOG, ComputationAnalysis::LastAccess(), ComputationAnalysis::LastWriteAccess(), rnnlm::n, NnetComputation::Print(), PrintCommand(), kaldi::RandInt(), Nnet::ReadConfig(), and NnetComputation::submatrices.

Referenced by main().

                            {
   for (int32 n = 0; n < 20; n++) {
     struct NnetGenerationOptions gen_config;
 
     std::vector<std::string> configs;
     GenerateConfigSequence(gen_config, &configs);
     Nnet nnet;
     for (size_t j = 0; j < configs.size(); j++) {
       KALDI_LOG << "Input config[" << j << "] is: " << configs[j];
       std::istringstream is(configs[j]);
       nnet.ReadConfig(is);
     }
 
     ComputationRequest request;
     std::vector<Matrix<BaseFloat> > inputs;
     ComputeExampleComputationRequestSimple(nnet, &request, &inputs);
 
     NnetComputation computation;
     Compiler compiler(request, nnet);
 
     CompilerOptions opts;
     compiler.CreateComputation(opts, &computation);
 
     std::ostringstream os;
     computation.Print(os, nnet);
     KALDI_LOG << "Generated computation is: " << os.str();
 
     CheckComputationOptions check_config;
     // we can do the rewrite check since it's before optimization.
     check_config.check_rewrite = true;
     ComputationChecker checker(check_config, nnet, computation);
     checker.Check();
 
     Analyzer analyzer;
     analyzer.Init(nnet, computation);
     ComputationAnalysis analysis(computation, analyzer);
     // The following output is to be eyeballed by a person.
     std::vector<std::string> submatrix_strings;
     computation.GetSubmatrixStrings(nnet, &submatrix_strings);
     int32 nc = computation.commands.size();
     for (int32 n = 0; n < 30; n++) {
       int32 s = RandInt(1, computation.submatrices.size() - 1);
       int32 c = RandInt(0, nc - 1);
       KALDI_LOG << "First nontrivial access of submatrix " << submatrix_strings[s]
                 << " is command "
                 << PrintCommand(nc, analysis.FirstNontrivialAccess(s));
       KALDI_LOG << "Last access of submatrix " << submatrix_strings[s]
                 << " is command " << PrintCommand(nc, analysis.LastAccess(s));
       KALDI_LOG << "Last write access of submatrix " << submatrix_strings[s]
                 << " is command " << PrintCommand(nc, analysis.LastWriteAccess(s));
       KALDI_LOG << "Data present in " << submatrix_strings[s]
                 << " at command " << c << " is invalidated at command "
                 << PrintCommand(nc, analysis.DataInvalidatedCommand(c, s));
     }
   }
 }

◆ UnitTestNnetCompile()

void kaldi::nnet3::UnitTestNnetCompile ( )

Definition at line 29 of file nnet-compile-test.cc.

References ComputeExampleComputationRequestSimple(), Compiler::CreateComputation(), GenerateConfigSequence(), rnnlm::j, KALDI_LOG, rnnlm::n, ComputationRequest::Print(), NnetComputation::Print(), and Nnet::ReadConfig().

Referenced by main().

                            {
   for (int32 n = 0; n < 20; n++) {
     struct NnetGenerationOptions gen_config;
     std::vector<std::string> configs;
     GenerateConfigSequence(gen_config, &configs);
     Nnet nnet;
     for (size_t j = 0; j < configs.size(); j++) {
       KALDI_LOG << "Input config[" << j << "] is: " << configs[j];
       std::istringstream is(configs[j]);
       nnet.ReadConfig(is);
     }
 
     ComputationRequest request;
     std::vector<Matrix<BaseFloat> > inputs;
     ComputeExampleComputationRequestSimple(nnet, &request, &inputs);
     KALDI_LOG << "Computation request is:";
     request.Print(std::cerr);
 
     NnetComputation computation;
     Compiler compiler(request, nnet);
 
     CompilerOptions opts;
     compiler.CreateComputation(opts, &computation);
 
     std::ostringstream os;
     computation.Print(os, nnet);
     KALDI_LOG << "Generated computation is: " << os.str();
   }
 }

◆ UnitTestNnetCompileLooped()

void kaldi::nnet3::UnitTestNnetCompileLooped ( )

Definition at line 120 of file nnet-compile-test.cc.

References NnetGenerationOptions::allow_ivector, CompileLooped(), CreateLoopedComputationRequestSimple(), GenerateConfigSequence(), GetChunkSize(), Nnet::Info(), rnnlm::j, KALDI_LOG, ModifyNnetIvectorPeriod(), rnnlm::n, ComputationRequest::Print(), NnetComputation::Print(), kaldi::RandInt(), and Nnet::ReadConfig().

Referenced by main().

                                  {
   for (int32 n = 0; n < 20; n++) {
     struct NnetGenerationOptions gen_config;
     gen_config.allow_ivector = true;
 
     std::vector<std::string> configs;
     GenerateConfigSequence(gen_config, &configs);
     Nnet nnet;
     for (size_t j = 0; j < configs.size(); j++) {
       KALDI_LOG << "Input config[" << j << "] is: " << configs[j];
       std::istringstream is(configs[j]);
       nnet.ReadConfig(is);
     }
 
     ComputationRequest request1, request2, request3;
     int32 chunk_size_min = RandInt(5, 15);
     int32 frame_subsampling_factor = RandInt(1, 3),
         extra_left_context_begin = RandInt(0, 10),
         extra_right_context = RandInt(0, 10),
         num_sequences = RandInt(1, 2);
     int32 chunk_size = GetChunkSize(nnet, frame_subsampling_factor,
                                     chunk_size_min),
         ivector_period = chunk_size;
 
 
 
     ModifyNnetIvectorPeriod(ivector_period, &nnet);
     KALDI_LOG << "Nnet info after modifying ivector period is: "
               << nnet.Info();
     CreateLoopedComputationRequestSimple(
         nnet, chunk_size, frame_subsampling_factor,
         ivector_period, extra_left_context_begin, extra_right_context,
         num_sequences, &request1, &request2, &request3);
 
     KALDI_LOG << "Computation request 1 is:";
     request1.Print(std::cerr);
     KALDI_LOG << "Computation request 2 is:";
     request2.Print(std::cerr);
     KALDI_LOG << "Computation request 3 is:";
     request3.Print(std::cerr);
 
     NnetOptimizeOptions optimize_opts;
     // todo: set optimize-looped=true.
     NnetComputation computation;
     CompileLooped(nnet, optimize_opts,
                   request1, request2, request3,
                   &computation);
     KALDI_LOG << "Compiled looped computation is ";
     computation.Print(std::cerr, nnet);
   }
 }

◆ UnitTestNnetCompileMulti()

void kaldi::nnet3::UnitTestNnetCompileMulti ( )

Definition at line 63 of file nnet-compile-test.cc.

References NnetGenerationOptions::allow_use_of_x_dim, ComputeExampleComputationRequestSimple(), GenerateConfigSequence(), rnnlm::i, ComputationRequest::inputs, rnnlm::j, KALDI_LOG, rnnlm::n, ComputationRequest::need_model_derivative, ComputationRequest::outputs, ComputationRequest::Print(), Nnet::ReadConfig(), and ComputationRequest::store_component_stats.

Referenced by main().

                                 {
   for (int32 n = 0; n < 20; n++) {
     struct NnetGenerationOptions gen_config;
     gen_config.allow_use_of_x_dim = false;
 
     std::vector<std::string> configs;
     GenerateConfigSequence(gen_config, &configs);
     Nnet nnet;
     for (size_t j = 0; j < configs.size(); j++) {
       KALDI_LOG << "Input config[" << j << "] is: " << configs[j];
       std::istringstream is(configs[j]);
       nnet.ReadConfig(is);
     }
 
     ComputationRequest request1, request2;
     std::vector<Matrix<BaseFloat> > inputs1, inputs2;
     ComputeExampleComputationRequestSimple(nnet, &request1, &inputs1);
     ComputeExampleComputationRequestSimple(nnet, &request2, &inputs2);
 
 
     KALDI_LOG << "Computation request 1 is:";
     request1.Print(std::cerr);
     KALDI_LOG << "Computation request 2 is:";
     request2.Print(std::cerr);
 
     std::vector<const ComputationRequest*> requests;
     request2.store_component_stats = request1.store_component_stats;
     request1.need_model_derivative = false;
     request2.need_model_derivative = false;
     requests.push_back(&request1);
     requests.push_back(&request2);
 
     // set all the x indexes to 1 for request 2 (they would otherwise
     // be zero).  This ensures that there is no overlap
     // between the inputs and outputs on the two requests.
     for (int32 i = 0; i < request2.inputs.size(); i++)
       for (int32 j = 0; j < request2.inputs[i].indexes.size(); j++)
         request2.inputs[i].indexes[j].x = 1;
     for (int32 i = 0; i < request2.outputs.size(); i++)
       for (int32 j = 0; j < request2.outputs[i].indexes.size(); j++)
         request2.outputs[i].indexes[j].x = 1;
 
 
     NnetComputation computation;
     Compiler compiler(requests, nnet);
 
     CompilerOptions opts;
     compiler.CreateComputation(opts, &computation);
 
     std::ostringstream os;
     computation.Print(os, nnet);
     KALDI_LOG << "Generated computation is: " << os.str();
   }
 }

◆ UnitTestNnetComponent()

void kaldi::nnet3::UnitTestNnetComponent ( )

Definition at line 490 of file nnet-component-test.cc.

References GenerateRandomSimpleComponent(), Component::Info(), KALDI_ERR, KALDI_LOG, rnnlm::n, TestNnetComponentAddScale(), TestNnetComponentCopy(), TestNnetComponentIo(), TestNnetComponentUpdatable(), TestNnetComponentVectorizeUnVectorize(), TestSimpleComponentDataDerivative(), TestSimpleComponentModelDerivative(), and TestSimpleComponentPropagateProperties().

Referenced by main().

                              {
   for (int32 n = 0; n < 200; n++)  {
     Component *c = GenerateRandomSimpleComponent();
     KALDI_LOG << c->Info();
     TestNnetComponentIo(c);
     TestNnetComponentCopy(c);
     TestNnetComponentAddScale(c);
     TestNnetComponentVectorizeUnVectorize(c);
     TestNnetComponentUpdatable(c);
     TestSimpleComponentPropagateProperties(*c);
     if (!TestSimpleComponentDataDerivative(*c, 1.0e-04) &&
         !TestSimpleComponentDataDerivative(*c, 1.0e-03) &&
         !TestSimpleComponentDataDerivative(*c, 1.0e-05) &&
         !TestSimpleComponentDataDerivative(*c, 1.0e-06))
       KALDI_ERR << "Component data-derivative test failed";
 
     if (!TestSimpleComponentModelDerivative(*c, 1.0e-04, false) &&
         !TestSimpleComponentModelDerivative(*c, 1.0e-03, false) &&
         !TestSimpleComponentModelDerivative(*c, 1.0e-06, false))
       KALDI_ERR << "Component downhill-update test failed";
 
     if (!TestSimpleComponentModelDerivative(*c, 1.0e-04, true) &&
         !TestSimpleComponentModelDerivative(*c, 1.0e-03, true) &&
         !TestSimpleComponentModelDerivative(*c, 1.0e-05, true) &&
         !TestSimpleComponentModelDerivative(*c, 1.0e-06, true))
       KALDI_ERR << "Component model-derivative test failed";
 
     delete c;
   }
 }

◆ UnitTestNnetComputationIo()

void kaldi::nnet3::UnitTestNnetComputationIo ( NnetComputation * computation )

Definition at line 34 of file nnet-compute-test.cc.

References KALDI_ERR, kaldi::Rand(), NnetComputation::Read(), and NnetComputation::Write().

Referenced by UnitTestNnetCompute().

                                                              {
   bool binary = (Rand() % 2 == 0);
   std::ostringstream os;
   computation->Write(os, binary);
   const std::string &original_output = os.str();
   std::istringstream computation_is(original_output);
   computation->Read(computation_is, binary);
   std::istringstream computation_is2(original_output);
   NnetComputation computation2;
   computation2.Read(computation_is2, binary);
 
   std::ostringstream os2, os3;
   computation->Write(os2, binary);
   computation2.Write(os3, binary);
 
   if (binary) {
     if (!(os2.str() == original_output)) {
       KALDI_ERR << "Outputs differ for computation";
     }
   }
 }

◆ UnitTestNnetCompute()

void kaldi::nnet3::UnitTestNnetCompute ( )

Definition at line 147 of file nnet-compute-test.cc.

References NnetComputer::AcceptInput(), kaldi::ApproxEqual(), ComputationChecker::Check(), CheckComputationOptions::check_rewrite, CollapseModel(), NnetComputation::ComputeCudaIndexes(), ComputeExampleComputationRequestSimple(), Compiler::CreateComputation(), NnetComputeOptions::debug, GenerateConfigSequence(), NnetComputer::GetOutput(), rnnlm::i, ComputationRequest::inputs, rnnlm::j, KALDI_ERR, KALDI_LOG, MaxOutputTimeInRequest(), rnnlm::n, Optimize(), ComputationRequest::outputs, NnetComputation::Print(), kaldi::RandInt(), Nnet::ReadConfig(), NnetComputer::Run(), SetBatchnormTestMode(), SetDropoutTestMode(), CuMatrixBase< Real >::SetRandn(), CuMatrixBase< Real >::Sum(), TestNnetDecodable(), UnitTestComputationRequestIo(), and UnitTestNnetComputationIo().

Referenced by main().

                            {
   for (int32 n = 0; n < 20; n++) {
     struct NnetGenerationOptions gen_config;
     bool test_collapse_model = (RandInt(0, 1) == 0);
 
     std::vector<std::string> configs;
     GenerateConfigSequence(gen_config, &configs);
     Nnet nnet;
     for (size_t j = 0; j < configs.size(); j++) {
       KALDI_LOG << "Input config[" << j << "] is: " << configs[j];
       std::istringstream is(configs[j]);
       nnet.ReadConfig(is);
     }
 
     ComputationRequest request;
     std::vector<Matrix<BaseFloat> > inputs;
     ComputeExampleComputationRequestSimple(nnet, &request, &inputs);
 
     // Test CollapseModel().  Note: lines with 'collapse' in some part of them
     // are not necessary for the rest of the test to run; they only test
     // CollapseModel().
     if (test_collapse_model) {
       // this model collapsing code requires that test mode is set for batchnorm
       // and dropout components.
       SetBatchnormTestMode(true, &nnet);
       SetDropoutTestMode(true, &nnet);
     }
 
     NnetComputation computation;
     Compiler compiler(request, nnet);
     CompilerOptions opts;
     compiler.CreateComputation(opts, &computation);
 
     Nnet nnet_collapsed(nnet);
     CollapseModelConfig collapse_config;
     NnetComputation computation_collapsed;
 
     if (test_collapse_model) {
       CollapseModel(collapse_config, &nnet_collapsed);
       Compiler compiler_collapsed(request, nnet_collapsed);
       compiler_collapsed.CreateComputation(opts, &computation_collapsed);
       computation_collapsed.ComputeCudaIndexes();
     }
 
 
     {
       std::ostringstream os;
       computation.Print(os, nnet);
       KALDI_LOG << "Generated computation is: " << os.str();
       UnitTestNnetComputationIo(&computation);
       UnitTestComputationRequestIo(&request);
     }
     CheckComputationOptions check_config;
     // we can do the rewrite check since it's before optimization.
     check_config.check_rewrite = true;
     ComputationChecker checker(check_config, nnet, computation);
     checker.Check();
 
     if (RandInt(0, 1) == 0) {
       NnetOptimizeOptions opt_config;
 
       Optimize(opt_config, nnet,
                MaxOutputTimeInRequest(request),
                &computation);
       {
         std::ostringstream os;
         computation.Print(os, nnet);
         KALDI_LOG << "Optimized computation is: " << os.str();
       }
     }
 
     NnetComputeOptions compute_opts;
     if (RandInt(0, 1) == 0)
       compute_opts.debug = true;
 
     computation.ComputeCudaIndexes();
     NnetComputer computer(compute_opts,
                           computation,
                           nnet,
                           &nnet);
     // provide the input to the computation.
     for (size_t i = 0; i < request.inputs.size(); i++) {
       CuMatrix<BaseFloat> temp(inputs[i]);
       KALDI_LOG << "Input sum is " << temp.Sum();
       computer.AcceptInput(request.inputs[i].name, &temp);
 
     }
     computer.Run();
 
 
     const CuMatrixBase<BaseFloat> &output(computer.GetOutput("output"));
 
     KALDI_LOG << "Output sum is " << output.Sum();
 
     if (test_collapse_model) {
       NnetComputer computer_collapsed(compute_opts,
                                       computation_collapsed,
                                       nnet_collapsed,
                                       &nnet_collapsed);
       for (size_t i = 0; i < request.inputs.size(); i++) {
         CuMatrix<BaseFloat> temp(inputs[i]);
         KALDI_LOG << "Input sum is " << temp.Sum();
         computer_collapsed.AcceptInput(request.inputs[i].name, &temp);
       }
       computer_collapsed.Run();
       const CuMatrixBase<BaseFloat> &output_collapsed(
           computer_collapsed.GetOutput("output"));
       KALDI_LOG << "Output sum [collapsed] is " << output_collapsed.Sum();
       if (!ApproxEqual(output, output_collapsed)) {
         KALDI_ERR << "Regular and collapsed computations' outputs differ";
       }
     }
 
     CuMatrix<BaseFloat> output_deriv(output.NumRows(), output.NumCols());
     output_deriv.SetRandn();
     // output_deriv sum won't be informative so don't print it.
     if (request.outputs[0].has_deriv) {
       computer.AcceptInput("output", &output_deriv);
       computer.Run();
       for (size_t i = 0; i < request.inputs.size(); i++) {
         if (request.inputs[i].has_deriv) {
           const CuMatrixBase<BaseFloat> &in_deriv =
               computer.GetOutput(request.inputs[i].name);
           KALDI_LOG << "Input-deriv sum for input '"
                     << request.inputs[i].name << "' is " << in_deriv.Sum();
         }
       }
     }
     TestNnetDecodable(&nnet);
   }
 }

◆ UnitTestNnetContext()

void kaldi::nnet3::UnitTestNnetContext ( )

Definition at line 29 of file nnet-utils-test.cc.

References ComputeSimpleNnetContext(), GenerateConfigSequence(), KALDI_LOG, rnnlm::n, NnetInfo(), and Nnet::ReadConfig().

Referenced by main().

                            {
   for (int32 n = 0; n < 20; n++) {
     struct NnetGenerationOptions gen_config;
     
     std::vector<std::string> configs;
     GenerateConfigSequence(gen_config, &configs);
     Nnet nnet;
     std::istringstream is(configs[0]);
     nnet.ReadConfig(is);
 
     // this test doesn't really test anything except that it runs;
     // we manually inspect the output.
     int32 left_context, right_context;
     ComputeSimpleNnetContext(nnet, &left_context, &right_context);
     KALDI_LOG << "Left,right-context= " << left_context << ","
               << right_context << " for config: " << configs[0];
 
     KALDI_LOG << "Info for nnet is: " << NnetInfo(nnet);
   }
 }

◆ UnitTestNnetExample()

void kaldi::nnet3::UnitTestNnetExample ( )

Definition at line 35 of file nnet-example-test.cc.

References ExampleApproxEqual(), GenerateSimpleNnetTrainingExample(), KALDI_ASSERT, rnnlm::n, kaldi::RandInt(), NnetExample::Read(), and NnetExample::Write().

Referenced by main().

                            {
   for (int32 n = 0; n < 50; n++) {
 
     NnetExample eg;
     int32 num_supervised_frames = RandInt(1, 10),
                    left_context = RandInt(0, 5),
                   right_context = RandInt(0, 5),
                       input_dim = RandInt(1, 10),
                      output_dim = RandInt(5, 10),
                     ivector_dim = RandInt(-1, 2);
     GenerateSimpleNnetTrainingExample(num_supervised_frames, left_context,
                                       right_context, input_dim, output_dim,
                                       ivector_dim, &eg);
     bool binary = (RandInt(0, 1) == 0);
     std::ostringstream os;
     eg.Write(os, binary);
     NnetExample eg_copy;
     if (RandInt(0, 1) == 0)
       eg_copy = eg;
     std::istringstream is(os.str());
     eg_copy.Read(is, binary);
     std::ostringstream os2;
     eg_copy.Write(os2, binary);
     if (binary) {
       KALDI_ASSERT(os.str() == os2.str());
       KALDI_ASSERT(eg_copy == eg);
     }
     KALDI_ASSERT(ExampleApproxEqual(eg, eg_copy, 0.1));
   }
 }

◆ UnitTestNnetInputDerivatives()

void kaldi::nnet3::UnitTestNnetInputDerivatives ( )

Definition at line 246 of file nnet-derivative-test.cc.

References NnetComputer::AcceptInput(), CuMatrixBase< Real >::AddMat(), kaldi::ApproxEqual(), ComputationChecker::Check(), CheckComputationOptions::check_rewrite, ComputeExampleComputationRequestSimple(), rnnlm::d, NnetComputeOptions::debug, GenerateConfigSequence(), NnetComputer::GetOutput(), rnnlm::i, ComputationRequest::inputs, rnnlm::j, KALDI_ASSERT, KALDI_ERR, KALDI_LOG, KALDI_WARN, kaldi::kTrans, MaxOutputTimeInRequest(), rnnlm::n, ComputationRequest::need_model_derivative, NnetIsRecurrent(), Optimize(), Nnet::OutputDim(), ComputationRequest::outputs, kaldi::RandInt(), Nnet::ReadConfig(), CuMatrix< Real >::Resize(), NnetComputer::Run(), CuMatrixBase< Real >::SetRandn(), and kaldi::TraceMatMat().

Referenced by main().

                                     {
   int32 N = 20;
   for (int32 n = 0; n < N; n++) {
     struct NnetGenerationOptions gen_config;
     //gen_config.allow_nonlinearity = false;
     //gen_config.allow_recursion = false;
     //gen_config.allow_final_nonlinearity = true;
     bool allow_optimization = true;
 
     std::vector<std::string> configs;
     GenerateConfigSequence(gen_config, &configs);
     Nnet nnet;
     for (size_t j = 0; j < configs.size(); j++) {
       KALDI_LOG << "Input config[" << j << "] is: " << configs[j];
       std::istringstream is(configs[j]);
       nnet.ReadConfig(is);
     }
 
     ComputationRequest request;
     std::vector<Matrix<BaseFloat> > inputs;
     ComputeExampleComputationRequestSimple(nnet, &request, &inputs);
 
     // make sure that all inputs and outputs have derivatives requested/provided,
     // and that the model-update (need_model_derivative) is not requested.
     request.need_model_derivative = false;
     for (int32 i = 0; i < request.inputs.size(); i++)
       request.inputs[i].has_deriv = true;
     request.outputs[0].has_deriv = true;
 
     NnetComputation computation;
     Compiler compiler(request, nnet);
 
     CompilerOptions opts;
     compiler.CreateComputation(opts, &computation);
     {
       std::ostringstream os;
       computation.Print(os, nnet);
       KALDI_LOG << "Generated computation is: " << os.str();
     }
     CheckComputationOptions check_config;
     // we can do the rewrite check since it's before optimization.
     check_config.check_rewrite = true;
     ComputationChecker checker(check_config, nnet, computation);
     checker.Check();
 
     if (RandInt(0, 3) != 0 && allow_optimization) {
       NnetOptimizeOptions opt_config;
       // opt_config.initialize_undefined = false;  // temp
       Optimize(opt_config, nnet,
                MaxOutputTimeInRequest(request),
                &computation);
       std::ostringstream os;
       computation.Print(os, nnet);
       KALDI_LOG << "Optimized computation is: " << os.str();
     }
 
     NnetComputeOptions compute_opts;
     if (RandInt(0, 1) == 0)
       compute_opts.debug = true;
     computation.ComputeCudaIndexes();
 
 
     int32 num_directions = 3;  // must be >= 1.  Best if it's >1, will reduce
                                // the probability of random failures.
 
     // the order of these vectors is:
     // [ un-perturbed, perturbed-1, perturbed-2, perturbed-3, un-perturbed ].
     // we compute un-perturbed twice to double-check the model did not change.
     std::vector<BaseFloat> measured_objf(num_directions + 2, 0.0),
         predicted_objf_change(num_directions + 2, 0.0);
     BaseFloat delta = 1.0e-03;
 
     // output_deriv is the derivative of the objective function w.r.t. the
     // (single) output.  We make the objf a linear function of the output and
     // just set the output_deriv to be a random matrix, which defines the
     // objective function.
     CuMatrix<BaseFloat> output_deriv;
     output_deriv.Resize(request.outputs[0].indexes.size(),
                         nnet.OutputDim("output"));
     output_deriv.SetRandn();
 
     std::vector<CuMatrix<BaseFloat> > delta_inputs(inputs.size());
     std::vector<CuMatrix<BaseFloat> > input_derivs(inputs.size());
 
     // pass 0 is the forward pass with the un-perturbed features; so is
     // pass num_directions + 1.
     // Other passes are with various differently-perturbed versions of
     // the features.
     for (int32 pass = 0; pass <= num_directions + 1; pass++) {
       // the only reason we might need to provide the &nnet parameter is if the
       // StoreStats() operation had been requested.  We made sure no model update
       // is being performed.
       NnetComputer computer(compute_opts,
                             computation,
                             nnet,
                             &nnet);
 
 
       // provide the input to the computations.
       for (size_t i = 0; i < request.inputs.size(); i++) {
 
         CuMatrix<BaseFloat> temp(inputs[i]);
         if (pass > 0 && pass <= num_directions) {  // Perturb the input randomly.
           delta_inputs[i].Resize(inputs[i].NumRows(), inputs[i].NumCols());
           delta_inputs[i].SetRandn();
           delta_inputs[i].Scale(delta);
           // if there are >1 inputs, sometimes set the delta for input 0 to
           // zero.  might sometimes give more accurate test of error in iVector
           // derivative computation.
           if (i == 0 && request.inputs.size() > 1 && RandInt(0, 1) == 0)
             delta_inputs[i].SetZero();
           temp.AddMat(1.0, delta_inputs[i]);
           predicted_objf_change[pass] += TraceMatMat(input_derivs[i],
                                                      delta_inputs[i], kTrans);
         }
         computer.AcceptInput(request.inputs[i].name, &temp);
       }
 
       KALDI_LOG << "Running forward computation";
       computer.Run();
 
       const CuMatrixBase<BaseFloat> &output(computer.GetOutput("output"));
       KALDI_LOG << "Output sum for pass " << pass << " is " << output.Sum();
       BaseFloat objf = TraceMatMat(output, output_deriv, kTrans);
       measured_objf[pass] = objf;
 
       if (pass == 0) {
         // We need to compute the input derivatives.
         CuMatrix<BaseFloat> temp(output_deriv);
         computer.AcceptInput("output", &temp);
         KALDI_LOG << "Running backward computation";
         computer.Run();
         for (size_t i = 0; i < request.inputs.size(); i++) {
           input_derivs[i] = computer.GetOutput(request.inputs[i].name);
           KALDI_LOG << "Input-deriv norm for '" << request.inputs[i].name
                     << "' is " << input_derivs[i].FrobeniusNorm();
         }
       }
     }
     KALDI_ASSERT(ApproxEqual(measured_objf[0],
                              measured_objf[num_directions + 1]));
 
     Vector<BaseFloat> predicted_objf_change_vec(num_directions),
         measured_objf_change_vec(num_directions);
     for (int32 d = 0; d < num_directions; d++) {
       BaseFloat predicted_change = predicted_objf_change[d+1],
                  measured_change = measured_objf[d+1] - measured_objf[0];
       predicted_objf_change_vec(d) = predicted_change;
       measured_objf_change_vec(d) = measured_change;
     }
     KALDI_LOG << "Vector of predicted objf-change is: "
               << predicted_objf_change_vec;
     KALDI_LOG << "Vector of measured objf-change is: "
               << measured_objf_change_vec;
      BaseFloat delta_thresh_warn = 0.05, delta_thresh_fail = 0.25;
     if (!ApproxEqual(predicted_objf_change_vec,
                      measured_objf_change_vec, delta_thresh_fail)) {
       if (NnetIsRecurrent(nnet)) {
         KALDI_WARN << "Predicted and measured objf-changes differ too much. "
                    << "(would normally be beyond error threshold, but this "
                    << "nnet is recurrent, so letting it pass.";
       } else {
         KALDI_ERR << "Predicted and measured objf-changes differ too much.";
       }
     } else if (!ApproxEqual(predicted_objf_change_vec,
                             measured_objf_change_vec, delta_thresh_warn)) {
       KALDI_WARN << "Predicted and measured objf-changes differ quite a lot";
     }
   }
 }

◆ UnitTestNnetIo()

void kaldi::nnet3::UnitTestNnetIo ( )

Definition at line 27 of file nnet-nnet-test.cc.

References GenerateConfigSequence(), KALDI_ASSERT, rnnlm::n, kaldi::Rand(), Nnet::Read(), Nnet::ReadConfig(), and Nnet::Write().

Referenced by main().

                       {
   for (int32 n = 0; n < 100; n++) {
     struct NnetGenerationOptions gen_config;
     
     bool binary = (Rand() % 2 == 0);
     std::vector<std::string> configs;
     GenerateConfigSequence(gen_config, &configs);
     Nnet nnet;
     std::istringstream is(configs[0]);
     nnet.ReadConfig(is);
 
     std::ostringstream os;
     nnet.Write(os, binary);
     const std::string &original_output = os.str();
     std::istringstream nnet_is(original_output);
     nnet.Read(nnet_is, binary);
     std::istringstream nnet_is2(original_output);
     Nnet nnet2;
     nnet2.Read(nnet_is2, binary);
       
     std::ostringstream os2, os3;
     nnet.Write(os2, binary);
     
     nnet2.Write(os3, binary);
     if (binary) {
       KALDI_ASSERT(os2.str() == original_output);
       KALDI_ASSERT(os3.str() == original_output);
     }
   }
 }

◆ UnitTestNnetMergeExamples()

void kaldi::nnet3::UnitTestNnetMergeExamples ( )

Definition at line 67 of file nnet-example-test.cc.

References GenerateSimpleNnetTrainingExample(), rnnlm::i, KALDI_LOG, MergeExamples(), rnnlm::n, kaldi::RandInt(), and NnetExample::Write().

Referenced by main().

                                  {
   for (int32 n = 0; n < 50; n++) {
     int32 num_supervised_frames = RandInt(1, 10),
                    left_context = RandInt(0, 5),
                   right_context = RandInt(0, 5),
                       input_dim = RandInt(1, 10),
                      output_dim = RandInt(5, 10),
                     ivector_dim = RandInt(-1, 2);
 
     int32 num_egs = RandInt(1, 4);
     std::vector<NnetExample> egs_to_be_merged(num_egs);
     for (int32 i = 0; i < num_egs; i++) {
       NnetExample eg;
       // sometimes omit the ivector.  just tests things a bit more
       // thoroughly.
       GenerateSimpleNnetTrainingExample(num_supervised_frames, left_context,
                                         right_context, input_dim, output_dim,
                                         RandInt(0, 1) == 0 ? 0 : ivector_dim,
                                         &eg);
       KALDI_LOG << i << "'th example to be merged is: ";
       eg.Write(std::cerr, false);
       egs_to_be_merged[i].Swap(&eg);
     }
     NnetExample eg_merged;
     bool compress = (RandInt(0, 1) == 0);
     MergeExamples(egs_to_be_merged, compress, &eg_merged);
     KALDI_LOG << "Merged example is: ";
     eg_merged.Write(std::cerr, false);
   }
 }

◆ UnitTestNnetModelDerivatives()

void kaldi::nnet3::UnitTestNnetModelDerivatives ( )

Definition at line 91 of file nnet-derivative-test.cc.

References NnetComputer::AcceptInput(), kaldi::ApproxEqual(), CachingOptimizingCompiler::Compile(), ComputeExampleComputationRequestSimple(), rnnlm::d, NnetComputeOptions::debug, DotProduct(), GenerateConfigSequence(), NnetComputer::GetOutput(), rnnlm::i, ComputationRequest::inputs, rnnlm::j, KALDI_ERR, KALDI_LOG, KALDI_WARN, kaldi::kTrans, rnnlm::n, ComputationRequest::need_model_derivative, NnetIsRecurrent(), Nnet::OutputDim(), ComputationRequest::outputs, PerturbParams(), NnetComputation::Print(), kaldi::RandInt(), Nnet::ReadConfig(), CuMatrix< Real >::Resize(), NnetComputer::Run(), ScaleNnet(), SetDerivTimesOptions(), SetNnetAsGradient(), CuMatrixBase< Real >::SetRandn(), and kaldi::TraceMatMat().

Referenced by main().

                                     {
   int32 N = 20;
   for (int32 n = 0; n < N; n++) {
     struct NnetGenerationOptions gen_config;
     //gen_config.allow_nonlinearity = false;
     //gen_config.allow_recursion = false;
     //gen_config.allow_final_nonlinearity = true;
 
     bool limit_deriv_times = (RandInt(0, 2) == 0);
 
     std::vector<std::string> configs;
     GenerateConfigSequence(gen_config, &configs);
     Nnet nnet;
     for (size_t j = 0; j < configs.size(); j++) {
       KALDI_LOG << "Input config[" << j << "] is: " << configs[j];
       std::istringstream is(configs[j]);
       nnet.ReadConfig(is);
     }
 
     ComputationRequest request;
     std::vector<Matrix<BaseFloat> > inputs;
     ComputeExampleComputationRequestSimple(nnet, &request, &inputs);
 
     // make sure that a model-derivative is requested, and an output-derivative
     // is supplied.
     request.need_model_derivative = true;
     request.outputs[0].has_deriv = true;
     // whether input-derivatives are required or not does not matter,
     // so leave it as it is in that regard.
 
     NnetOptimizeOptions optimize_opts;
     CachingOptimizingCompilerOptions compiler_opts;
     if (limit_deriv_times) {
       SetDerivTimesOptions(request, &optimize_opts);
     }
 
     CachingOptimizingCompiler compiler(nnet, optimize_opts,
                                        compiler_opts);
 
     const NnetComputation &computation = *(compiler.Compile(request));
 
     {
       std::ostringstream os;
       computation.Print(os, nnet);
       KALDI_LOG << "Optimized computation is: " << os.str();
     }
 
     Nnet nnet_deriv(nnet);
     ScaleNnet(0.0, &nnet_deriv);
     SetNnetAsGradient(&nnet_deriv);     // forces "simple" update and unit
                                         // learning rate.
 
     int32 num_directions = 4;  // must be >= 1.  Best if it's >1, will reduce
                                // the probability of random failures.
 
     // the order of these vectors is:
     // [ un-perturbed, perturbed-1, perturbed-2, perturbed-3 ].
     std::vector<BaseFloat> measured_objf(num_directions + 1, 0.0),
         predicted_objf_change(num_directions + 1, 0.0);
     BaseFloat delta = 5.0e-04;
 
     // output_deriv is the derivative of the objective function w.r.t. the
     // (single) output.  We make the objf a linear function of the output and
     // just set the output_deriv to be a random matrix, which defines the
     // objective function.
     CuMatrix<BaseFloat> output_deriv;
     output_deriv.Resize(request.outputs[0].indexes.size(),
                         nnet.OutputDim("output"));
     output_deriv.SetRandn();
 
 
     NnetComputeOptions compute_opts;
     if (RandInt(0, 1) == 0)
       compute_opts.debug = true;
 
     // pass 0 is the forward pass with the un-perturbed model.
     // Other passes are with various differently-perturbed versions of
     // the model.
     for (int32 pass = 0; pass <= num_directions; pass++) {
       Nnet nnet_copy(nnet);
       if (pass > 0)
         PerturbParams(delta, &nnet_copy);
 
       NnetComputer computer(compute_opts,
                             computation,
                             nnet_copy,
                             (pass == 0 ? &nnet_deriv : &nnet_copy));
 
 
       // provide the input to the computation.
       for (size_t i = 0; i < request.inputs.size(); i++) {
         CuMatrix<BaseFloat> temp(inputs[i]);
         computer.AcceptInput(request.inputs[i].name, &temp);
       }
 
       KALDI_LOG << "Running forward computation";
       computer.Run();
 
       const CuMatrixBase<BaseFloat> &output(computer.GetOutput("output"));
       KALDI_LOG << "Output sum for pass " << pass << " is " << output.Sum();
       BaseFloat objf = TraceMatMat(output, output_deriv, kTrans);
       measured_objf[pass] = objf;
 
       if (pass == 0) {
         // we need to do the backward computation (to get the model derivative)
         CuMatrix<BaseFloat> temp(output_deriv);
         computer.AcceptInput("output", &temp);
         KALDI_LOG << "Running backward computation";
         computer.Run();
       } else {
         // work out the predicted objf-change as dot-product of deriv and
         // parameter-change.  The expression below can be interpreted as
         // DotProduct(nnet_copy - nnet, nnet_deriv).
         predicted_objf_change[pass] = DotProduct(nnet_copy, nnet_deriv) -
                                       DotProduct(nnet, nnet_deriv);
       }
     }
 
     Vector<BaseFloat> predicted_objf_change_vec(num_directions),
         measured_objf_change_vec(num_directions);
     for (int32 d = 0; d < num_directions; d++) {
       BaseFloat predicted_change = predicted_objf_change[d+1],
                  measured_change = measured_objf[d+1] - measured_objf[0];
       predicted_objf_change_vec(d) = predicted_change;
       measured_objf_change_vec(d) = measured_change;
     }
     KALDI_LOG << "Vector of predicted objf-change is: "
               << predicted_objf_change_vec;
     KALDI_LOG << "Vector of measured objf-change is: "
               << measured_objf_change_vec;
     BaseFloat delta_thresh_warn = 0.05, delta_thresh_fail = 0.25;
     if (limit_deriv_times) {
       KALDI_LOG << "Not checking that predicted/measured changes matched "
                 << "because we limited times of derivatives.";
     } else {
       if (!ApproxEqual(predicted_objf_change_vec,
                        measured_objf_change_vec, delta_thresh_fail)) {
         if (NnetIsRecurrent(nnet)) {
           KALDI_WARN << "Predicted and measured objf-changes differ too much. "
                      << "(would normally be beyond error threshold, but this "
                      << "nnet is recurrent, so letting it pass.";
         } else {
           KALDI_ERR << "Predicted and measured objf-changes differ too much.";
         }
       }
       if (!ApproxEqual(predicted_objf_change_vec,
                        measured_objf_change_vec, delta_thresh_warn)) {
         KALDI_WARN << "Predicted and measured objf-changes differ quite a lot.";
       }
     }
   }
 }

◆ UnitTestNnetOptimize()

static void kaldi::nnet3::UnitTestNnetOptimize ( )

static

Definition at line 295 of file nnet-optimize-test.cc.

References KALDI_LOG, and UnitTestNnetOptimizeInternal().

Referenced by main().

                                    {
   for (int32 srand_seed = 0; srand_seed < 40; srand_seed++) {
     KALDI_LOG << "About to run UnitTestNnetOptimizeInternal with srand_seed = "
               << srand_seed;
     UnitTestNnetOptimizeInternal(srand_seed);
   }
 }

◆ UnitTestNnetOptimizeInternal()

static void kaldi::nnet3::UnitTestNnetOptimizeInternal ( int32 srand_seed )

static

Definition at line 194 of file nnet-optimize-test.cc.

Referenced by UnitTestNnetOptimize().

                                                            {
   NnetOptimizeOptions optimize_all;
   CachingOptimizingCompilerOptions compiler_all;
 
   // randomly sometimes set min_deriv and max_deriv to small/large values,
   // which will cause some of the LimitDerivativeTimes() code to be called
   // (without really changing anything).
   if (RandInt(0, 3) == 0) optimize_all.min_deriv_time = -200;
   if (RandInt(0, 3) == 0) optimize_all.max_deriv_time = 1000;
 
   // this is useful for debugging as it removes nans:
   // optimize_all.initialize_undefined = false;
   bool success = UnitTestNnetOptimizeWithOptions(srand_seed, optimize_all,
                                                  compiler_all);
   if (success)
     return;
 
   // Test failed with full optimization. Slowly retry with various
   // optimizations switched off.
   NnetOptimizeOptions optimize = optimize_all;
   CachingOptimizingCompilerOptions compiler = compiler_all;
 
 
   compiler.use_shortcut = false;
   bool succ_no_shortcut = UnitTestNnetOptimizeWithOptions(srand_seed, optimize,
                                                           compiler);
   compiler = compiler_all;
 
 
   optimize.propagate_in_place = false;
   bool succ_no_propagate_in_place = UnitTestNnetOptimizeWithOptions(srand_seed, optimize,
                                                                     compiler);
   optimize = optimize_all;
 
   optimize.backprop_in_place = false;
   bool succ_no_backprop_in_place = UnitTestNnetOptimizeWithOptions(srand_seed, optimize,
                                                                    compiler);
   optimize = optimize_all;
 
   optimize.optimize_row_ops = false;
   bool succ_no_row_ops = UnitTestNnetOptimizeWithOptions(srand_seed, optimize,
                                                          compiler);
   optimize = optimize_all;
 
   optimize.convert_addition = false;
   bool succ_no_convert_addition = UnitTestNnetOptimizeWithOptions(srand_seed, optimize,
                                                                   compiler);
   optimize = optimize_all;
 
   optimize.remove_assignments = false;
   bool succ_no_remove_assignments = UnitTestNnetOptimizeWithOptions(srand_seed, optimize,
                                                                     compiler);
   optimize = optimize_all;
 
   optimize.initialize_undefined = false;
   bool succ_no_initialize_undefined = UnitTestNnetOptimizeWithOptions(srand_seed, optimize,
                                                                       compiler);
   optimize = optimize_all;
 
   optimize.allocate_from_other = false;
   bool succ_no_allocate_from_other = UnitTestNnetOptimizeWithOptions(srand_seed, optimize,
                                                                      compiler);
   optimize = optimize_all;
 
   optimize.move_sizing_commands = false;
   bool succ_no_move_sizing_commands = UnitTestNnetOptimizeWithOptions(srand_seed, optimize,
                                                                       compiler);
   optimize = optimize_all;
 
   optimize.snip_row_ops = false;
   bool succ_no_snip_row_ops = UnitTestNnetOptimizeWithOptions(srand_seed, optimize,
                                                               compiler);
   optimize = optimize_all;
 
 
   optimize.min_deriv_time = std::numeric_limits<int32>::min();
   optimize.max_deriv_time = std::numeric_limits<int32>::max();
   optimize.max_deriv_time_relative = std::numeric_limits<int32>::max();
   bool succ_no_deriv_time = UnitTestNnetOptimizeWithOptions(srand_seed, optimize,
                                                             compiler);
   optimize = optimize_all;
 
 
 #define KALDI_SUCCFAIL(b) ((b) ? "SUCCESS" : "FAILURE")
   KALDI_ERR
     << "Test failed with all optimizations enabled. Retried test with the "
     << "following optimizations turned off:"
     << "\n  use_shortcut         ... " << KALDI_SUCCFAIL(succ_no_shortcut)
     << "\n  propagate_in_place   ... " << KALDI_SUCCFAIL(succ_no_propagate_in_place)
     << "\n  backprop_in_place    ... " << KALDI_SUCCFAIL(succ_no_backprop_in_place)
     << "\n  optimize_row_ops     ... " << KALDI_SUCCFAIL(succ_no_row_ops)
     << "\n  convert_addition     ... " << KALDI_SUCCFAIL(succ_no_convert_addition)
     << "\n  remove_assignments   ... " << KALDI_SUCCFAIL(succ_no_remove_assignments)
     << "\n  initialize_undefined ... " << KALDI_SUCCFAIL(succ_no_initialize_undefined)
     << "\n  allocate_from_other  ... " << KALDI_SUCCFAIL(succ_no_allocate_from_other)
     << "\n  move_sizing_commands ... " << KALDI_SUCCFAIL(succ_no_move_sizing_commands)
     << "\n  snip_row_ops         ... " << KALDI_SUCCFAIL(succ_no_snip_row_ops)
     << "\n  no_deriv_time        ... " << KALDI_SUCCFAIL(succ_no_deriv_time);
 #undef KALDI_SUCCFAIL
 }

◆ UnitTestNnetOptimizeWithOptions()

static bool kaldi::nnet3::UnitTestNnetOptimizeWithOptions	(	int32	srand_seed,
		NnetOptimizeOptions	opt_config,
		CachingOptimizingCompilerOptions	compiler_config
	)

static

Definition at line 33 of file nnet-optimize-test.cc.

References NnetComputer::AcceptInput(), kaldi::ApproxEqual(), ComputationChecker::Check(), CheckComputationOptions::check_rewrite, CachingOptimizingCompiler::Compile(), NnetComputation::ComputeCudaIndexes(), ComputeExampleComputationRequestSimple(), Compiler::CreateComputation(), NnetComputeOptions::debug, GenerateConfigSequence(), NnetComputer::GetOutput(), rnnlm::i, ComputationRequest::inputs, rnnlm::j, KALDI_LOG, KALDI_WARN, NnetParametersAreIdentical(), ComputationRequest::outputs, NnetComputation::Print(), kaldi::RandInt(), Nnet::ReadConfig(), ResetGenerators(), NnetComputer::Run(), ScaleNnet(), SetNnetAsGradient(), CuMatrixBase< Real >::SetRandn(), and CuMatrixBase< Real >::Sum().

Referenced by UnitTestNnetOptimizeInternal().

                                                                                               {
 
   //opt_config.convert_addition = false;
   //opt_config.remove_assignments = false;
   //opt_config.move_sizing_commands = false;
   //opt_config.allocate_from_other = false;
 
   srand(srand_seed);  // so that we can compare between differnt optimization types
                       // with the randomly generated network staying the same.
 
   struct NnetGenerationOptions gen_config;
 
   std::vector<std::string> configs;
   GenerateConfigSequence(gen_config, &configs);
   Nnet nnet;
   for (size_t j = 0; j < configs.size(); j++) {
     KALDI_LOG << "Input config[" << j << "] is: " << configs[j];
     std::istringstream is(configs[j]);
     nnet.ReadConfig(is);
   }
 
   ComputationRequest request;
   std::vector<Matrix<BaseFloat> > inputs;
   ComputeExampleComputationRequestSimple(nnet, &request, &inputs);
 
   NnetComputation computation;
   Compiler compiler(request, nnet);
 
   CompilerOptions opts;
   compiler.CreateComputation(opts, &computation);
   {
     std::ostringstream os;
     computation.Print(os, nnet);
     KALDI_LOG << "Generated computation with no optimization or shortcut is: " << os.str();
   }
   CheckComputationOptions check_config;
   // we can do the rewrite check since it's before optimization.
   check_config.check_rewrite = true;
   ComputationChecker checker(check_config, nnet, computation);
   checker.Check();
 
   CachingOptimizingCompiler opt_compiler(nnet, opt_config, compiler_config);
 
   const NnetComputation &computation_opt = *opt_compiler.Compile(request);
 
   {
     std::ostringstream os;
     computation_opt.Print(os, nnet);
     KALDI_LOG << "Optimized computation is: " << os.str();
   }
 
   NnetComputeOptions compute_opts;
   if (RandInt(0, 1) == 0)
     compute_opts.debug = true;
 
   computation.ComputeCudaIndexes();
   // computation_opt has already had this function called.
 
   Nnet nnet_to_update(nnet);  // copy of the nnet that we update...  needed to
   // test the consolidation of backprop commands,
   // otherwise the optimized and non-optimized
   // comptuations differ.
   ScaleNnet(0.0, &nnet_to_update);
   // with natural gradient, the consolidation would affect the final model
   // params -> test just the gradient.
   SetNnetAsGradient(&nnet_to_update);
 
   NnetComputer computer(compute_opts,
                         computation,
                         nnet,
                         &nnet_to_update);
 
   Nnet nnet_opt(nnet);  // copy of the nnet for the optimized computation.
   // necessary in case backprop changes parameters.
   Nnet nnet_opt_to_update(nnet_opt);
   ScaleNnet(0.0, &nnet_opt_to_update);
   SetNnetAsGradient(&nnet_opt_to_update);
 
   // NnetComputer for the optimized version of the computation.
   NnetComputer computer_opt(compute_opts,
                             computation_opt,
                             nnet_opt,
                             &nnet_opt_to_update);
 
   // provide the input to the computations.
   for (size_t i = 0; i < request.inputs.size(); i++) {
     CuMatrix<BaseFloat> temp(inputs[i]);
     KALDI_LOG << "Input sum is " << temp.Sum();
     computer.AcceptInput(request.inputs[i].name, &temp);
     CuMatrix<BaseFloat> temp2(inputs[i]);
     computer_opt.AcceptInput(request.inputs[i].name, &temp2);
   }
 
 
 
 
   KALDI_LOG << "Running non-optimized forward computation";
   srand(srand_seed);
   ResetGenerators(&nnet);
   computer.Run();
   KALDI_LOG << "Running optimized forward computation";
   srand(srand_seed);
   ResetGenerators(&nnet_opt);
   computer_opt.Run();
 
   const CuMatrixBase<BaseFloat> &output(computer.GetOutput("output"));
   KALDI_LOG << "Output sum (not optimized) is " << output.Sum();
   const CuMatrixBase<BaseFloat> &output_opt(computer_opt.GetOutput("output"));
   KALDI_LOG << "Output sum (optimized) is " << output_opt.Sum();
   if (!ApproxEqual(output, output_opt)) {
     KALDI_WARN << "Non-optimized and optimized versions of the computation give "
                << "different outputs: " << output << " vs. " << output_opt;
     return false;
   }
 
   CuMatrix<BaseFloat> output_deriv(output.NumRows(), output.NumCols());
   output_deriv.SetRandn();
   CuMatrix<BaseFloat> output_deriv_opt(output_deriv);
 
   if (request.outputs[0].has_deriv) {
     computer.AcceptInput("output", &output_deriv);
     computer_opt.AcceptInput("output", &output_deriv_opt);
 
     KALDI_LOG << "Running non-optimized backward computation";
     computer.Run();
     KALDI_LOG << "Running optimized backward computation";
     computer_opt.Run();
     for (size_t i = 0; i < request.inputs.size(); i++) {
       if (request.inputs[i].has_deriv) {
         const CuMatrixBase<BaseFloat> &in_deriv =
             computer.GetOutput(request.inputs[i].name);
         const CuMatrixBase<BaseFloat> &in_deriv_opt =
             computer_opt.GetOutput(request.inputs[i].name);
         KALDI_LOG << "Input-deriv sum for input '" << request.inputs[i].name
                   << "' (non-optimized) is " << in_deriv.Sum();
         KALDI_LOG << "Input-deriv sum for input '" << request.inputs[i].name
                   << "' (optimized) is " << in_deriv_opt.Sum();
         if (!ApproxEqual(in_deriv, in_deriv_opt)) {
           KALDI_WARN << "Non-optimized and optimized versions of the "
                      << "computation give different input-derivs.";
           return false;
         }
       }
     }
   }
 
   if (!NnetParametersAreIdentical(nnet_to_update,
                                   nnet_opt_to_update, 1.0e-05)) {
     KALDI_WARN << "Neural networks differ after training, between "
                << "optimized and non-optimized computation.";
     return false;
   } else {
     return true;
   }
 }

◆ UnitTestPreconditionDirectionsOnline()

void kaldi::nnet3::UnitTestPreconditionDirectionsOnline ( )

Definition at line 262 of file natural-gradient-online-test.cc.

References CuVectorBase< Real >::AddDiagMatMat(), MatrixBase< Real >::AddVecVec(), kaldi::AssertEqual(), KALDI_ASSERT, kaldi::kNoTrans, kaldi::kTrans, CuMatrixBase< Real >::NumRows(), OnlineNaturalGradientSimple::PreconditionDirections(), OnlineNaturalGradient::PreconditionDirections(), kaldi::Rand(), kaldi::RandInt(), VectorBase< Real >::Scale(), MatrixBase< Real >::Set(), VectorBase< Real >::SetRandn(), MatrixBase< Real >::SetRandn(), OnlineNaturalGradientSimple::SetRank(), OnlineNaturalGradient::SetRank(), kaldi::TraceMatMat(), and OnlineNaturalGradient::TurnOnDebug().

Referenced by main().

                                             {
   MatrixIndexT R = 1 + Rand() % 30,  // rank of correction
       N = (2 * R) + Rand() % 30,  // batch size
       D = R + 1 + Rand() % 20; // problem dimension.  Must be > R.
 
   // Test sometimes with features that are all-zero or all-one; this will
   // help to make sure low-rank or zero input doesn't crash the code.
   bool zero = false;
   bool one = false;
   if (Rand() % 3 == 0) zero = true;
   //else if (Rand() % 2 == 0) one = true;
 
   CuVector<BaseFloat> row_prod1(N);
   BaseFloat gamma1, gamma2;
   BaseFloat big_eig_factor = RandInt(1, 20);
   big_eig_factor = big_eig_factor * big_eig_factor;
   Vector<BaseFloat> big_eig_vector(D);
   big_eig_vector.SetRandn();
   big_eig_vector.Scale(big_eig_factor);
 
   OnlineNaturalGradientSimple preconditioner1;
   OnlineNaturalGradient preconditioner2;
   preconditioner1.SetRank(R);
   preconditioner2.SetRank(R);
   preconditioner2.TurnOnDebug();
 
   int32 num_iters = 100;
   for (int32 iter = 0; iter < num_iters; iter++) {
     Matrix<BaseFloat> M_cpu(N, D);
     if (one) M_cpu.Set(1.0);
     else if (!zero) {
       M_cpu.SetRandn();
       Vector<BaseFloat> rand_vec(N);
       rand_vec.SetRandn();
       M_cpu.AddVecVec(1.0, rand_vec, big_eig_vector);
     }
     CuMatrix<BaseFloat> M(M_cpu);
 
     CuMatrix<BaseFloat> Mcopy1(M), Mcopy2(M);
 
     preconditioner1.PreconditionDirections(&Mcopy1, &row_prod1, &gamma1);
 
     preconditioner2.PreconditionDirections(&Mcopy2, &gamma2);
 
     BaseFloat trace1 = TraceMatMat(M, M, kTrans),
         trace2 = TraceMatMat(Mcopy1, Mcopy1, kTrans);
     AssertEqual(trace1, trace2 * gamma2 * gamma2, 1.0e-02);
 
     AssertEqual(Mcopy1, Mcopy2);
     AssertEqual(gamma1, gamma2, 1.0e-02);
 
     // make sure positive definite
     CuVector<BaseFloat> inner_prods(M.NumRows());
     inner_prods.AddDiagMatMat(1.0, M, kNoTrans, Mcopy1, kTrans, 0.0);
     KALDI_ASSERT(inner_prods.Min() >= 0.0);
   }
   return;
 }

◆ UnitTestSplitLocations()

void kaldi::nnet3::UnitTestSplitLocations ( bool verbose )

Definition at line 275 of file nnet-compile-utils-test.cc.

References ComparePair::ComparePair(), ConvertToIndexes(), rnnlm::i, rnnlm::j, KALDI_ASSERT, KALDI_LOG, PrintVectorVectorPair(), kaldi::Rand(), and SplitLocations().

Referenced by main().

                                           {
   int32 minibatch_size = Rand() % 1024 + 100;
   int32 num_submat_indexes = Rand() % 10 + 1;
   int32 max_submat_list_size = Rand() % 10 + 1;
   int32 min_num_kaddrows = Rand() % 2; // minimum number of kAddRows compatible
   // lists expected in the final split lists. This value will be used to
   // create input submat_lists so that this is guaranteed
   max_submat_list_size = min_num_kaddrows + max_submat_list_size;
 
   std::vector<std::pair<int32, int32> > all_pairs;
   all_pairs.reserve(minibatch_size * max_submat_list_size);
   std::vector<std::vector<std::pair<int32, int32> > >
       submat_lists(minibatch_size),
       split_lists;
   std::vector<int32> submat_indexes(num_submat_indexes);
   for (int32 i = 0; i < num_submat_indexes; i++)  {
     submat_indexes[i] = Rand();
   }
 
   // generating submat_lists
   int32 max_generated_submat_list_size = 0;
   for (int32 i = 0; i < minibatch_size; i++)  {
     int32 num_locations = Rand() % max_submat_list_size + 1;
     max_generated_submat_list_size =
         max_generated_submat_list_size < num_locations ?
         num_locations : max_generated_submat_list_size;
     submat_lists[i].reserve(num_locations);
     for (int32 j = 0; j < num_locations; j++) {
       // note from dan: I edited the following line to resolve a valgrind error
       // but cannot really understand at this point what this code is doing.
       if (j <= min_num_kaddrows && j < num_submat_indexes) {
         // since we need min_num_kaddrows in the split_lists we ensure that
         // we add a pair with the same first element in all the submat_lists
         submat_lists[i].push_back(std::make_pair(submat_indexes[j],
                                                  Rand() % minibatch_size));
       }
       submat_lists[i].push_back(
           std::make_pair(submat_indexes[Rand() % num_submat_indexes],
                          Rand() % minibatch_size));
     }
     all_pairs.insert(all_pairs.end(), submat_lists[i].begin(),
                      submat_lists[i].end());
   }
 
   SplitLocations(submat_lists, &split_lists);
   if (verbose)  {
     KALDI_LOG << "submat_list";
     PrintVectorVectorPair(submat_lists);
     KALDI_LOG << "split_lists";
     PrintVectorVectorPair(split_lists);
     KALDI_LOG << "===========================";
     KALDI_LOG << split_lists.size();
   }
   int32 num_kaddrows_in_output = 0;
   int32 first_value;
   std::vector<int32> second_values;
   // ensure that elements in submat_lists are also present
   // in split_lists
   for (int32 i = 0 ; i < split_lists.size(); i++) {
     second_values.clear();
     if (ConvertToIndexes(split_lists[i], &first_value, &second_values)) {
       // Checking if ConvertToIndexes did a proper conversion of the indexes
       for (int32 j = 0; j < second_values.size(); j++)  {
         if (split_lists[i][j].first != -1)
           KALDI_ASSERT((split_lists[i][j].first == first_value) &&
                        (split_lists[i][j].second == second_values[j]));
       }
       num_kaddrows_in_output++;
     }
     for (int32 j = 0; j < split_lists[i].size(); j++) {
       if (split_lists[i][j].first == -1)
         continue;
       std::vector<std::pair<int32, int32> >::iterator iter =
           std::find_if(all_pairs.begin(), all_pairs.end(),
                     ComparePair(split_lists[i][j]));
       KALDI_ASSERT(iter != all_pairs.end());
       all_pairs.erase(iter);
     }
   }
   KALDI_ASSERT(all_pairs.size() == 0);
   // ensure that there are at least as many kAddRows compatible split_lists as
   // specified
   KALDI_ASSERT(num_kaddrows_in_output >= min_num_kaddrows);
 }

◆ UnitTestSplitLocationsBackward()

void kaldi::nnet3::UnitTestSplitLocationsBackward ( bool verbose )

Definition at line 70 of file nnet-compile-utils-test.cc.

References ComparePair::ComparePair(), ConvertToIndexes(), rnnlm::i, rnnlm::j, KALDI_ASSERT, KALDI_LOG, PrintVectorVectorPair(), kaldi::Rand(), and SplitLocationsBackward().

Referenced by main().

                                                   {
   int32 minibatch_size = Rand() % 1024 + 100;
   int32 num_submat_indexes = Rand() % 10 + 1;
   int32 max_submat_list_size = Rand() % 10 + 1;
   int32 min_num_kaddrows = Rand() % 2; // minimum number of kAddRows compatible
   // lists expected in the final split lists. This value will be used to
   // create input submat_lists so that this is guaranteed
   max_submat_list_size = min_num_kaddrows + max_submat_list_size;
 
   std::vector<std::pair<int32, int32> > all_pairs;
   all_pairs.reserve(minibatch_size * max_submat_list_size);
   std::vector<std::vector<std::pair<int32, int32> > >
       submat_lists(minibatch_size),
       split_lists;
   std::vector<int32> submat_indexes(num_submat_indexes);
   for (int32 i = 0; i < num_submat_indexes; i++)  {
     submat_indexes[i] = Rand();
   }
 
   // generating submat_lists
   int32 max_generated_submat_list_size = 0;
   for (int32 i = 0; i < minibatch_size; i++)  {
     int32 num_locations = Rand() % max_submat_list_size + 1;
     max_generated_submat_list_size =
         max_generated_submat_list_size < num_locations ?
         num_locations : max_generated_submat_list_size;
     submat_lists[i].reserve(num_locations);
     for (int32 j = 0; j < num_locations; j++) {
       if (j <= min_num_kaddrows && j < num_submat_indexes)
         // since we need min_num_kaddrows in the split_lists we ensure that
         // we add a pair with the same first element in all the submat_lists
         submat_lists[i].push_back(std::make_pair(submat_indexes[j],
                            Rand() % minibatch_size));
       submat_lists[i].push_back(
           std::make_pair(submat_indexes[Rand() % num_submat_indexes],
                          Rand() % minibatch_size));
     }
     all_pairs.insert(all_pairs.end(), submat_lists[i].begin(),
                      submat_lists[i].end());
   }
 
   SplitLocationsBackward(submat_lists, &split_lists);
   // Checking split_lists has all the necessary properties
   for (int32 i = 0; i < split_lists.size(); i++)  {
     int32 first_value;
     std::vector<int32> second_values;
     if (ConvertToIndexes(split_lists[i], &first_value, &second_values))  {
       // checking for contiguity and uniqueness of .second elements
       std::vector<int32> occurred_values;
       int32 prev_value = -10; // using a negative value as all indices are > 0
       for (int32 j = 0; j < second_values.size(); j++)  {
         if (second_values[j] == -1)
           continue;
         if (second_values[j] != prev_value) {
           std::vector<int32>::iterator iter = std::find(occurred_values.begin(),
                                                         occurred_values.end(),
                                                         second_values[j]);
           KALDI_ASSERT(iter == occurred_values.end());
         }
       }
     } else {
       std::vector<std::pair<int32, int32> > list_of_pairs;
       // checking for uniques of elements in the list
       for (int32 j = 0; j < split_lists[i].size(); j++)  {
         if (split_lists[i][j].first == -1)
           continue;
         std::vector<std::pair<int32, int32> >::const_iterator iter =
             std::find_if(list_of_pairs.begin(), list_of_pairs.end(),
                          PairIsEqualComparator(split_lists[i][j]));
         KALDI_ASSERT(iter == list_of_pairs.end());
         list_of_pairs.push_back(split_lists[i][j]);
       }
     }
   }
   if (verbose)  {
     KALDI_LOG << "submat_list";
     PrintVectorVectorPair(submat_lists);
     KALDI_LOG << "split_lists";
     PrintVectorVectorPair(split_lists);
     KALDI_LOG << "===========================";
   }
   int32 num_kaddrows_in_output = 0;
   int32 first_value;
   std::vector<int32> second_values;
   // ensure that elements in submat_lists are also present
   // in split_lists
   for (int32 i = 0 ; i < split_lists.size(); i++) {
     second_values.clear();
     if (ConvertToIndexes(split_lists[i], &first_value, &second_values)) {
       // Checking if ConvertToIndexes did a proper conversion of the indexes
       KALDI_ASSERT(second_values.size() == split_lists[i].size());
       for (int32 j = 0; j < second_values.size(); j++)  {
         if (split_lists[i][j].first != -1)
           KALDI_ASSERT((split_lists[i][j].first == first_value) &&
                        (split_lists[i][j].second == second_values[j]));
       }
       num_kaddrows_in_output++;
     }
     for (int32 j = 0; j < split_lists[i].size(); j++) {
       if (split_lists[i][j].first == -1)
         continue;
       std::vector<std::pair<int32, int32> >::iterator iter =
           std::find_if(all_pairs.begin(), all_pairs.end(),
                     ComparePair(split_lists[i][j]));
       KALDI_ASSERT(iter != all_pairs.end());
       all_pairs.erase(iter);
     }
   }
   KALDI_ASSERT(all_pairs.size() == 0);
   // ensure that there are at least as many kAddRows compatible split_lists as
   // specified
   KALDI_ASSERT(num_kaddrows_in_output >= min_num_kaddrows);
 }

◆ UnitTestSummarizeVector()

void kaldi::nnet3::UnitTestSummarizeVector ( )

Definition at line 55 of file nnet-parse-test.cc.

References KALDI_LOG, kaldi::kCopyData, Vector< Real >::Resize(), VectorBase< Real >::SetRandn(), and SummarizeVector().

Referenced by main().

                                {
   // will be eyeballed by a human.
   Vector<BaseFloat> vec(9);
   vec.SetRandn();
   vec(0) = 1024.2343;
   vec(1) = 0.01;
   vec(2) = 0.001234;
   vec(3) = 0.000198;
   vec(3) = 1.98e-09;
   vec(4) = 153.0;
   vec(5) = 0.154;
   vec(6) = 1.2;
   vec(7) = 9.2;
   vec(8) = 10.8;
 
   KALDI_LOG << "vec = " << vec << " -> " << SummarizeVector(vec);
 
   vec.Resize(20, kCopyData);
   KALDI_LOG << "vec = " << vec << " -> " << SummarizeVector(vec);
 }

◆ UnVectorizeNnet()

void UnVectorizeNnet	(	const VectorBase< BaseFloat > &	params,
		Nnet *	dest
	)

Copies the parameters from params to *dest.

the dimension of params must be equal to NumParameters(*dest).

Definition at line 401 of file nnet-utils.cc.

References VectorBase< Real >::Dim(), Nnet::GetComponent(), KALDI_ASSERT, KALDI_ERR, kUpdatableComponent, Nnet::NumComponents(), NumParameters(), UpdatableComponent::NumParameters(), Component::Properties(), and UpdatableComponent::UnVectorize().

                                  {
   KALDI_ASSERT(parameters.Dim() == NumParameters(*dest));
   int32 dim_offset = 0;
   for (int32 c = 0; c < dest->NumComponents(); c++) {
     Component *comp = dest->GetComponent(c);
     if (comp->Properties() & kUpdatableComponent) {
       // For now all updatable components inherit from class UpdatableComponent.
       // If that changes in future, we will change this code.
       UpdatableComponent *uc = dynamic_cast<UpdatableComponent*>(comp);
       if (uc == NULL)
         KALDI_ERR << "Updatable component does not inherit from class "
             "UpdatableComponent; change this code.";
       int32 this_dim = uc->NumParameters();
       const SubVector<BaseFloat> this_part(parameters, dim_offset, this_dim);
       uc->UnVectorize(this_part);
       dim_offset += this_dim;
     }
   }
 }

◆ UpdateNnetMovingAverage()

void kaldi::nnet3::UpdateNnetMovingAverage	(	int32	num_models,
		const Nnet &	nnet,
		Nnet *	moving_average_nnet
	)

Definition at line 66 of file nnet3-combine.cc.

References AddNnet(), KALDI_ASSERT, NumParameters(), and ScaleNnet().

Referenced by main().

                                                  {
   KALDI_ASSERT(NumParameters(nnet) == NumParameters(*moving_average_nnet));
   ScaleNnet((num_models - 1.0) / num_models, moving_average_nnet);
   AddNnet(nnet, 1.0 / num_models, moving_average_nnet);
 }

◆ UpdateNnetWithMaxChange() [1/2]

bool UpdateNnetWithMaxChange	(	const Nnet &	delta_nnet,
		BaseFloat	max_param_change,
		BaseFloat	max_change_scale,
		BaseFloat	scale,
		Nnet *	nnet,
		std::vector< int32 > *	num_max_change_per_component_applied,
		int32 *	num_max_change_global_applied
	)

This function does the operation '*nnet += scale * delta_nnet', while respecting any max-parameter-change (max-param-change) specified in the updatable components, and also the global max-param-change specified as 'max_param_change'.

With max-changes taken into account, the operation of this function is equivalent to the following, although it's done more efficiently:

Nnet temp_nnet(delta_nnet);
ScaleNnet(1.0 / max_change_scale, &temp_nnet);
[ Scale down parameters for each component of temp_nnet as needed so
their Euclidean norms do not exceed their per-component max-changes ]
[ Scale down temp_nnet as needed so its Euclidean norm does not exceed
  the global max-change ]
ScaleNnet(max_change_scale, &temp_nnet);  // undo the previous scaling.
AddNnet(temp_nnet, scale, nnet);

Parameters

[in]	delta_nnet	The copy of 'nnet' neural network that contains the proposed change in parameters. Normally this will previously have been set to: (delta_nnet = parameter-derivative-on-current-minibatch learning-rate per parameter), with any natural gradient applied as specified in the components; but this may be different if momentum or backstitch are used.
[in]	max_param_change	The global max-param-change specified on the command line (e.g. 2.0), which specifies the largest change allowed to '*nnet' in Euclidean norm. If <= 0, no global max-param-change will be enforced, but any max-change values specified in the components will still be enforced; see UpdatableComponent::MaxChange(), and search for 'max-change' in the configs or nnet3-info output).
[in]	max_change_scale	This value, which will normally be 1.0, is used to scale all per-component max-change values and the global 'max_param_change', before applying them (so we use 'max_change_scale * uc->MaxChange()' as the per-component max-change, and 'max_change_scale * max_param_change' as the global max-change).
[in]	scale	This value, which will normally be 1.0, is a scaling factor used when adding to 'nnet', applied after any max-changes. It is provided for backstitch-related purposes.
[in,out]	nnet	The nnet which we add to.
[out]	num_max_change_per_component_applied	We add to the elements of this the count for each per-component max-change.
[out]	num_max_change_global_applied	We to this the count for the global max-change.

Definition at line 2106 of file nnet-utils.cc.

References AddNnetComponents(), VectorBase< Real >::Dim(), UpdatableComponent::DotProduct(), Nnet::GetComponent(), Nnet::GetComponentName(), rnnlm::i, KALDI_ASSERT, KALDI_ERR, KALDI_LOG, KALDI_VLOG, KALDI_WARN, kUpdatableComponent, UpdatableComponent::MaxChange(), Nnet::NumComponents(), NumUpdatableComponents(), Component::Properties(), and VectorBase< Real >::Scale().

Referenced by CollapseModelConfig::CollapseModelConfig(), NnetChainTrainer::TrainInternal(), NnetTrainer::TrainInternal(), NnetChainTrainer::TrainInternalBackstitch(), NnetTrainer::TrainInternalBackstitch(), and UpdateNnetWithMaxChange().

                                                                    {
   KALDI_ASSERT(nnet != NULL);
   // computes scaling factors for per-component max-change
   const int32 num_updatable = NumUpdatableComponents(delta_nnet);
   Vector<BaseFloat> scale_factors = Vector<BaseFloat>(num_updatable);
   BaseFloat param_delta_squared = 0.0;
   int32 num_max_change_per_component_applied_per_minibatch = 0;
   BaseFloat min_scale = 1.0;
   std::string component_name_with_min_scale;
   BaseFloat max_change_with_min_scale;
   int32 i = 0;
   for (int32 c = 0; c < delta_nnet.NumComponents(); c++) {
     const Component *comp = delta_nnet.GetComponent(c);
     if (comp->Properties() & kUpdatableComponent) {
       const UpdatableComponent *uc =
           dynamic_cast<const UpdatableComponent*>(comp);
       if (uc == NULL)
         KALDI_ERR << "Updatable component does not inherit from class "
                   << "UpdatableComponent; change this code.";
       BaseFloat max_param_change_per_comp = uc->MaxChange();
       KALDI_ASSERT(max_param_change_per_comp >= 0.0);
       BaseFloat dot_prod = uc->DotProduct(*uc);
       if (max_param_change_per_comp != 0.0 &&
           std::sqrt(dot_prod) * std::abs(scale) >
           max_param_change_per_comp * max_change_scale) {
         scale_factors(i) = max_param_change_per_comp * max_change_scale /
             (std::sqrt(dot_prod) * std::abs(scale));
         (*num_max_change_per_component_applied)[i]++;
         num_max_change_per_component_applied_per_minibatch++;
         KALDI_VLOG(2) << "Parameters in " << delta_nnet.GetComponentName(c)
                       << " change too big: " << std::sqrt(dot_prod) << " * "
                       << scale << " > " << "max-change * max-change-scale="
                       << max_param_change_per_comp << " * " << max_change_scale
                       << ", scaling by " << scale_factors(i);
       } else {
         scale_factors(i) = 1.0;
       }
       if (i == 0 || scale_factors(i) < min_scale) {
         min_scale =  scale_factors(i);
         component_name_with_min_scale = delta_nnet.GetComponentName(c);
         max_change_with_min_scale = max_param_change_per_comp;
       }
       param_delta_squared += std::pow(scale_factors(i),
                                       static_cast<BaseFloat>(2.0)) * dot_prod;
       i++;
     }
   }
   KALDI_ASSERT(i == scale_factors.Dim());
   BaseFloat param_delta = std::sqrt(param_delta_squared);
   // computes the scale for global max-change
   param_delta *= std::abs(scale);
   if (max_param_change != 0.0) {
     if (param_delta > max_param_change * max_change_scale) {
       if (param_delta - param_delta != 0.0) {
         KALDI_WARN << "Infinite parameter change, will not apply.";
         return false;
       } else {
         scale *= max_param_change * max_change_scale / param_delta;
         (*num_max_change_global_applied)++;
       }
     }
   }
   if ((max_param_change != 0.0 &&
       param_delta > max_param_change * max_change_scale &&
       param_delta - param_delta == 0.0) || min_scale < 1.0) {
     std::ostringstream ostr;
     if (min_scale < 1.0)
       ostr << "Per-component max-change active on "
            << num_max_change_per_component_applied_per_minibatch
            << " / " << num_updatable << " Updatable Components."
            << " (Smallest factor=" << min_scale << " on "
            << component_name_with_min_scale
            << " with max-change=" << max_change_with_min_scale <<"). ";
     if (param_delta > max_param_change * max_change_scale)
       ostr << "Global max-change factor was "
            << max_param_change * max_change_scale / param_delta
            << " with max-change=" << max_param_change << ".";
     KALDI_LOG << ostr.str();
   }
   // applies both of the max-change scalings all at once, component by component
   // and updates parameters
   scale_factors.Scale(scale);
   AddNnetComponents(delta_nnet, scale_factors, scale, nnet);
   return true;
 }

◆ UpdateNnetWithMaxChange() [2/2]

bool UpdateNnetWithMaxChange	(	const Nnet &	delta_nnet,
		BaseFloat	max_param_change,
		BaseFloat	max_change_scale,
		BaseFloat	scale,
		Nnet *	nnet,
		MaxChangeStats *	stats
	)

Definition at line 2269 of file nnet-utils.cc.

References MaxChangeStats::num_max_change_global_applied, MaxChangeStats::num_max_change_per_component_applied, MaxChangeStats::num_minibatches_processed, and UpdateNnetWithMaxChange().

                                                     {
   bool ans = UpdateNnetWithMaxChange(
       delta_nnet, max_param_change, max_change_scale,
       scale, nnet,
       &(stats->num_max_change_per_component_applied),
       &(stats->num_max_change_global_applied));
   stats->num_minibatches_processed++;
   return ans;
 }

◆ VariableMergingOptimization()

void VariableMergingOptimization	(	const NnetOptimizeOptions &	config,
		const Nnet &	nnet,
		NnetComputation *	computation
	)

This wraps class VariableMergingOptimizer in a simplified interface.

Definition at line 417 of file nnet-optimize.cc.

References VariableMergingOptimizer::MergeVariables().

Referenced by Optimize().

                                                                {
   bool changed = true;
   while (changed) {
     changed = false;
     VariableMergingOptimizer opt(config, nnet, computation);
     if (opt.MergeVariables())
       changed = true;
   }
 }

◆ VectorizeNnet()

void VectorizeNnet	(	const Nnet &	src,
		VectorBase< BaseFloat > *	params
	)

Copies the nnet parameters to *params, whose dimension must be equal to NumParameters(src).

Definition at line 378 of file nnet-utils.cc.

References VectorBase< Real >::Dim(), Nnet::GetComponent(), KALDI_ASSERT, KALDI_ERR, kUpdatableComponent, Nnet::NumComponents(), NumParameters(), UpdatableComponent::NumParameters(), Component::Properties(), and UpdatableComponent::Vectorize().

                                                       {
   KALDI_ASSERT(parameters->Dim() == NumParameters(src));
   int32 dim_offset = 0;
   for (int32 c = 0; c < src.NumComponents(); c++) {
     const Component *comp = src.GetComponent(c);
     if (comp->Properties() & kUpdatableComponent) {
       // For now all updatable components inherit from class UpdatableComponent.
       // If that changes in future, we will change this code.
       const UpdatableComponent *uc =
           dynamic_cast<const UpdatableComponent*>(comp);
       if (uc == NULL)
         KALDI_ERR << "Updatable component does not inherit from class "
             "UpdatableComponent; change this code.";
       int32 this_dim = uc->NumParameters();
       SubVector<BaseFloat> this_part(*parameters, dim_offset, this_dim);
       uc->Vectorize(&this_part);
       dim_offset += this_dim;
     }
   }
 }

◆ WriteCindexVector()

void WriteCindexVector	(	std::ostream &	os,
		bool	binary,
		const std::vector< Cindex > &	vec
	)

Definition at line 282 of file nnet-common.cc.

References rnnlm::i, kaldi::WriteBasicType(), WriteCindexVectorElementBinary(), and kaldi::WriteToken().

Referenced by UnitTestCindexIo(), and NnetComputation::MatrixDebugInfo::Write().

                                                      {
   // This token will make it easier to write back-compatible code if we later
   // change the format.
   WriteToken(os, binary, "<I1V>");
   int32 size = vec.size();
   WriteBasicType(os, binary, size);
   if (!binary) {  // In text mode we just use the native Write functionality.
     for (int32 i = 0; i < size; i++) {
       int32 node_index = vec[i].first;
       if (i == 0 || node_index != vec[i-1].first) {
         if (i > 0)
           os.put(']');
         os.put('[');
         WriteBasicType(os, binary, node_index);
         os.put(':');
       }
       vec[i].second.Write(os, binary);
       if (i == size - 1)
         os.put(']');
     }
   } else {
     for (int32 i = 0; i < size; i++)
       WriteCindexVectorElementBinary(os, vec, i);
   }
 }

◆ WriteCindexVectorElementBinary()

static void kaldi::nnet3::WriteCindexVectorElementBinary	(	std::ostream &	os,
		const std::vector< Cindex > &	vec,
		int32	i
	)

static

Definition at line 162 of file nnet-common.cc.

References rnnlm::i, KALDI_ERR, Index::n, Index::t, kaldi::WriteBasicType(), and Index::x.

Referenced by WriteCindexVector().

              {
   bool binary = true;
   int32 node_index = vec[i].first;
   const Index &index = vec[i].second;
   if (i == 0 || node_index != vec[i-1].first) {
     // divide using '|' into ranges that each have all the same node name, like:
     // [node_1: index_1 index_2] [node_2: index_3 index_4] Caution: '|' is
     // character 124 so we have to avoid that character in places where it might
     // be confused with this separator.
     os.put('|');
     WriteBasicType(os, binary, node_index);
   }
   if (i == 0) {
     // we don't need to be concerned about reserving space for character 124
     // ('|') here, since (wastefully) '|' is always printed for i == 0.
     //
     // we don't use std::abs(index.t) < 125 here because it doesn't have the
     // right (or even well-defined) behavior for
     // index.t == std::numeric_limits<int32>::min().
     if (index.n == 0 && index.x == 0 &&
         index.t > -125 && index.t < 125) {
       // handle this common case in one character.
       os.put(static_cast<signed char>(index.t));
     } else if (index.t == 0 && index.x == 0 &&
                (index.n == 0 || index.n == 1)) {
       // handle this common case in one character.
       os.put(static_cast<signed char>(index.n + 125));
     } else {  // handle the general case less efficiently.
       os.put(127);
       WriteBasicType(os, binary, index.n);
       WriteBasicType(os, binary, index.t);
       WriteBasicType(os, binary, index.x);
     }
   } else {
     const Index &last_index = vec[i-1].second;
     // we don't do if std::abs(index.t - last_index.t) < 124
     // below because it doesn't work right if the difference
     // equals std::numeric_limits<int32>::min().
     if (index.n == last_index.n && index.x == last_index.x &&
         index.t - last_index.t < 124 &&
         index.t - last_index.t > -124) {
       signed char c = index.t - last_index.t;
       os.put(c);
       // note: we have to reserve character 124 ('|') for when 'n' or 'x'
       // changes.
     } else if (index.t == last_index.t && index.x == last_index.x &&
               (index.n == last_index.n || index.n == last_index.n + 1)) {
       os.put(125 + index.n - last_index.n);
     } else {  // handle the general case less efficiently.
       os.put(127);
       WriteBasicType(os, binary, index.n);
       WriteBasicType(os, binary, index.t);
       WriteBasicType(os, binary, index.x);
     }
   }
   if (!os.good())
     KALDI_ERR << "Output stream error detected";
 }

◆ WriteExamples()

static void kaldi::nnet3::WriteExamples	(	const MatrixBase< BaseFloat > &	feats,
		const std::vector< ChunkInfo *> &	chunks,
		const std::string &	utt,
		bool	compress,
		int32	num_pdfs,
		int32 *	num_egs_written,
		std::vector< NnetExampleWriter >	example_writers
	)

static

Definition at line 82 of file nnet3-xvector-get-egs.cc.

References NnetExample::Compress(), NnetIo::indexes, NnetExample::io, KALDI_ERR, KALDI_WARN, ChunkInfo::label, ChunkInfo::name, ChunkInfo::num_frames, MatrixBase< Real >::NumCols(), MatrixBase< Real >::NumRows(), ChunkInfo::output_archive_id, and ChunkInfo::start_frame.

Referenced by main().

                                                      {
   for (std::vector<ChunkInfo *>::const_iterator it = chunks.begin();
       it != chunks.end(); ++it) {
     ChunkInfo *chunk = *it;
     NnetExample eg;
     int32 num_rows = feats.NumRows(),
           feat_dim = feats.NumCols();
     if (num_rows < chunk->num_frames) {
       KALDI_WARN << "Unable to create examples for utterance " << utt
                  << ". Requested chunk size of "
                  << chunk->num_frames
                  << " but utterance has only " << num_rows << " frames.";
     } else {
       // The requested chunk positions are approximate. It's possible
       // that they slightly exceed the number of frames in the utterance.
       // If that occurs, we can shift the chunks location back slightly.
       int32 shift = std::min(0, num_rows - chunk->start_frame
                                  - chunk->num_frames);
       SubMatrix<BaseFloat> chunk_mat(feats, chunk->start_frame + shift,
                                   chunk->num_frames, 0, feat_dim);
       NnetIo nnet_input = NnetIo("input", 0, chunk_mat);
       for (std::vector<Index>::iterator indx_it = nnet_input.indexes.begin();
           indx_it != nnet_input.indexes.end(); ++indx_it)
         indx_it->n = 0;
 
       Posterior label;
       std::vector<std::pair<int32, BaseFloat> > post;
       post.push_back(std::pair<int32, BaseFloat>(chunk->label, 1.0));
       label.push_back(post);
       NnetExample eg;
       eg.io.push_back(nnet_input);
       eg.io.push_back(NnetIo("output", num_pdfs, 0, label));
       if (compress)
         eg.Compress();
 
       if (chunk->output_archive_id >= example_writers->size())
         KALDI_ERR << "Requested output index exceeds number of specified "
                   << "output files.";
       (*example_writers)[chunk->output_archive_id]->Write(
                          chunk->name, eg);
       (*num_egs_written) += 1;
     }
   }
 }

◆ WriteIndexVector()

void WriteIndexVector	(	std::ostream &	os,
		bool	binary,
		const std::vector< Index > &	vec
	)

Definition at line 126 of file nnet-common.cc.

References rnnlm::i, Index::Write(), kaldi::WriteBasicType(), WriteIndexVectorElementBinary(), and kaldi::WriteToken().

Referenced by IndexLessNxt::operator()(), UnitTestIndexIo(), kaldi::nnet3::time_height_convolution::UnitTestTimeHeightConvolutionCompile(), NnetIo::Write(), NnetDiscriminativeSupervision::Write(), IoSpecification::Write(), NnetChainSupervision::Write(), and NnetComputation::Write().

                                                    {
   // This token will make it easier to write back-compatible code if we later
   // change the format.
   WriteToken(os, binary, "<I1V>");
   int32 size = vec.size();
   WriteBasicType(os, binary, size);
   if (!binary) {  // In text mode we just use the native Write functionality.
     for (int32 i = 0; i < size; i++)
       vec[i].Write(os, binary);
   } else {
     for (int32 i = 0; i < size; i++)
       WriteIndexVectorElementBinary(os, vec, i);
   }
 }

◆ WriteIndexVectorElementBinary()

static void kaldi::nnet3::WriteIndexVectorElementBinary	(	std::ostream &	os,
		const std::vector< Index > &	vec,
		int32	i
	)

static

Definition at line 45 of file nnet-common.cc.

References rnnlm::i, KALDI_ERR, Index::n, Index::t, kaldi::WriteBasicType(), and Index::x.

Referenced by WriteIndexVector().

              {
   bool binary = true;
   const Index &index = vec[i];
   if (i == 0) {
     // we don't use std::abs(index.t) < 125 here because it doesn't have the
     // right (or even well-defined) behavior for
     // index.t == std::numeric_limits<int32>::min().
     if (index.n == 0 && index.x == 0 &&
         index.t > -125 && index.t < 125) {
       // handle this common case in one character.
       os.put(static_cast<signed char>(index.t));
     } else {  // handle the general case less efficiently.
       os.put(127);
       WriteBasicType(os, binary, index.n);
       WriteBasicType(os, binary, index.t);
       WriteBasicType(os, binary, index.x);
     }
   } else {
     Index last_index = vec[i-1];
     // we don't do if (std::abs(index.t - last_index.t) < 125)
     // below because this doesn't work right if that difference
     // equals std::numeric_limits<int32>::min().
     if (index.n == last_index.n && index.x == last_index.x &&
         index.t - last_index.t < 125 &&
         index.t - last_index.t > -125) {
       signed char c = index.t - last_index.t;
       os.put(c);
     } else {  // handle the general case less efficiently.
       os.put(127);
       WriteBasicType(os, binary, index.n);
       WriteBasicType(os, binary, index.t);
       WriteBasicType(os, binary, index.x);
     }
   }
   if (!os.good())
     KALDI_ERR << "Output stream error detected";
 }

◆ WriteVectorAsChar()

void WriteVectorAsChar	(	std::ostream &	os,
		bool	binary,
		const VectorBase< BaseFloat > &	vec
	)

Definition at line 237 of file nnet-example-utils.cc.

References VectorBase< Real >::Data(), VectorBase< Real >::Dim(), rnnlm::i, KALDI_ASSERT, VectorBase< Real >::Write(), and kaldi::WriteIntegerVector().

Referenced by NnetDiscriminativeSupervision::Write().

                                                          {
   if (binary) {
     int32 dim = vec.Dim();
     std::vector<unsigned char> char_vec(dim);
     const BaseFloat *data = vec.Data();
     for (int32 i = 0; i < dim; i++) {
       BaseFloat value = data[i];
       KALDI_ASSERT(value >= 0.0 && value <= 1.0);
       // below, the adding 0.5 is done so that we round to the closest integer
       // rather than rounding down (since static_cast will round down).
       char_vec[i] = static_cast<unsigned char>(255.0 * value + 0.5);
     }
     WriteIntegerVector(os, binary, char_vec);
   } else {
     // the regular floating-point format will be more readable for text mode.
     vec.Write(os, binary);
   }
 }

◆ YzxVectorIndex()

int32 kaldi::nnet3::YzxVectorIndex	(	int32	x,
		int32	y,
		int32	z,
		int32	input_x_dim,
		int32	input_y_dim,
		int32	input_z_dim
	)

inline

Definition at line 226 of file nnet-combined-component.cc.

References KALDI_PARANOID_ASSERT.

Referenced by ConvolutionComponent::InderivPatchesToInderiv(), and ConvolutionComponent::InputToInputPatches().

                                                {
   KALDI_PARANOID_ASSERT(x < input_x_dim && y < input_y_dim && z < input_z_dim);
   return (input_y_dim * input_z_dim) * x + (input_y_dim) * z + y;
 }

◆ ZeroComponentStats()

void ZeroComponentStats ( Nnet * nnet )

Zeroes the component stats in all nonlinear components in the nnet.

Definition at line 269 of file nnet-utils.cc.

References Nnet::GetComponent(), Nnet::NumComponents(), and Component::ZeroStats().

Referenced by NnetChainTrainer::NnetChainTrainer(), NnetDiscriminativeTrainer::NnetDiscriminativeTrainer(), NnetTrainer::NnetTrainer(), and RecomputeStats().

                                     {
   for (int32 c = 0; c < nnet->NumComponents(); c++) {
     Component *comp = nnet->GetComponent(c);
     comp->ZeroStats();  // for some components, this won't do anything.
   }
 }

◆ ZyxVectorIndex()

int32 kaldi::nnet3::ZyxVectorIndex	(	int32	x,
		int32	y,
		int32	z,
		int32	input_x_dim,
		int32	input_y_dim,
		int32	input_z_dim
	)

inline

Definition at line 234 of file nnet-combined-component.cc.

References KALDI_PARANOID_ASSERT.

Referenced by ConvolutionComponent::InderivPatchesToInderiv(), and ConvolutionComponent::InputToInputPatches().

                                                {
   KALDI_PARANOID_ASSERT(x < input_x_dim && y < input_y_dim && z < input_z_dim);
   return (input_y_dim * input_z_dim) * x + (input_z_dim) * y + z;
 }

Variable Documentation

◆ computation_checker_warned_unused_input

bool computation_checker_warned_unused_input = false

static

Checks that we never use variables before they are allocated or after they are deallocated, and some other checks that can be done from the MatrixAccesses.

Definition at line 680 of file nnet-analyze.cc.

◆ kNoTime

const int kNoTime = std::numeric_limits<int32>::min()

Definition at line 573 of file nnet-common.cc.

Referenced by ComputationStepsComputer::Check(), ComputationLoopedOptimizer::CheckIdentifiedMatrices(), Compiler::ComputeInputLocationsList(), kaldi::nnet3::time_height_convolution::ConvolveForwardSimple(), RestrictedAttentionComponent::CreateIndexesVector(), MatrixExtender::FixDebugInfo(), RestrictedAttentionComponent::GetInputIndexes(), TimeHeightConvolutionComponent::GetInputIndexes(), TdnnComponent::GetInputIndexes(), GetTList(), RestrictedAttentionComponent::IsComputable(), TimeHeightConvolutionComponent::IsComputable(), TdnnComponent::IsComputable(), ComputationLoopedOptimizer::NormalizeCindexes(), Index::operator+=(), kaldi::nnet3::time_height_convolution::SetSomeIndexesBlank(), and kaldi::nnet3::time_height_convolution::ZeroBlankRows().

Namespaces

Classes

Typedefs

Enumerations

Functions

Variables

Typedef Documentation

◆ Cindex

◆ NnetChainExampleWriter

◆ NnetDiscriminativeExampleWriter

◆ NnetExampleWriter

◆ RandomAccessNnetChainExampleReader

◆ RandomAccessNnetDiscriminativeExampleReader

◆ RandomAccessNnetExampleReader

◆ SequentialNnetChainExampleReader

◆ SequentialNnetDiscriminativeExampleReader

◆ SequentialNnetExampleReader

Enumeration Type Documentation

◆ AccessType

◆ CommandType

◆ ComponentProperties

◆ FillMode

◆ NodeType

◆ ObjectiveType

Function Documentation

◆ AddNnet()

◆ AddNnetComponents()

◆ AddTimeOffsetToComputationRequest()

◆ AppendCindexes()

◆ ApplyAffineTransform()

◆ ApplyL2Regularization()

◆ AssertGraphEqual()

◆ AssertVectorEqual()

◆ BuildTestGraph()

◆ BuildTestGraphTranspose()

◆ BuildTestSccGraph()

◆ BuildTestSccs()

◆ BuildTestTopSortOrder()

◆ CheckComputation()

◆ CheckComputationOnline()

◆ CheckStringsApproxEqual()

◆ CollapseModel()

◆ CompileLooped()

◆ CompileLoopedInternal()

◆ ComponentDotProducts()

◆ ComputeAccuracy()

◆ ComputeCommandAttributes()

◆ ComputeCommandPairs()

◆ ComputeComputationGraph()

◆ ComputeComputationPhases()

◆ ComputeComputationPhasesForEpoch()

◆ ComputeExampleComputationRequestSimple()

◆ ComputeGraphTranspose()

◆ ComputeMatrixAccesses()

◆ ComputeMatrixToSubmatrix()

◆ ComputeMinAndMaxTimes()

◆ ComputeNnetComputationEpochs()

◆ ComputeObjectiveFunction()

◆ ComputeObjf()

◆ ComputeSimpleNnetContext()

◆ ComputeSimpleNnetContextForShift()

◆ ComputeTopSortOrder()

◆ ComputeTopSortOrderRecursive()

◆ ComputeVariableAccesses()

◆ ConsolidateIoOperations()

◆ ConsolidateMemory()

◆ ConsolidateModelUpdate()

◆ ConstrainOrthonormal()

◆ ConstrainOrthonormalInternal()

◆ ContainsSingleExample()

◆ ConvertAdditionToAssignment()

◆ ConvertNumNValues()

◆ ConvertRepeatedToBlockAffine() [1/2]

◆ ConvertRepeatedToBlockAffine() [2/2]

◆ ConvertToIndexes()

◆ CopyPairVector() [1/2]

◆ CopyPairVector() [2/2]

◆ CreateComputationRequestInternal()

◆ CreateLoopedComputationRequest()

◆ CreateLoopedComputationRequestSimple()