ICA

Independent Component Analysis, using FastICA. It has the ability to center and whiten data. It is based on the method presented in:

A. Hyvärinen and E. Oja. Independent Component Analysis: Algorithms and Applications. Neural Networks, 13(4-5):411-430, 2000.

In particular, we provide the logcosh, exponential, and kurtosis "G" functions.

This algorithm computes the following modelmats array:

> modelmats(0) stores the inverse of the mixing matrix. If X = A*S represents the data, then it's the estimated A^{{-1}, which we assume is square and invertible for now.
> modelmats(1) stores the mean vector of the data, which is computed entirely on the first pass. This
means once we estimate A}{-1} in modelmats(0), we need to first shift the data by this amount, and then multiply to recover the (centered) sources. Example:

modelmats(0) * (data - modelmats(1))

Here, data is an n x N matrix, whereas modelmats(1) is an n x 1 matrix. For efficiency reasons, we assume a constant batch size for each block of data so we take the mean across all batches. This is true except for (usually) the last batch, but this almost always isn't enough to make a difference.

Thus, modelmats(1) helps to center the data. The whitening in this algorithm happens during the updates to W in both the orthogonalization and the fixed point steps. The former uses the computed covariance matrix and the latter relies on an approximation of W^T*W to the inverse covariance matrix. It is fine if the data is already pre-whitened before being passed to BIDMach.

Currently, we are thinking about the following extensions:

> Allowing ICA to handle non-square mixing matrices. Most research about ICA assumes that A is n x n. > Improving the way we handle the computation of the mean, so it doesn't rely on the last batch being of similar size to all prior batches. Again, this is minor, especially for large data sets. > Thinking of ways to make this scale better to a large variety of datasets

For additional references, see Aapo Hyvärinen's other papers, and visit: http://research.ics.aalto.fi/ica/fastica/

Linear Supertypes

FactorModel, Model, AnyRef, Any

Instance Constructors

new ICA(opts: Opts = new ICA.Options)

Value Members

final def !=(arg0: Any): Boolean

Definition Classes
AnyRef → Any
final def ##(): Int

Definition Classes
AnyRef → Any
final def ==(arg0: Any): Boolean

Definition Classes
AnyRef → Any
var G_fun: (Mat) ⇒ Mat
var _modelmats: Array[Mat]

Definition Classes
Model
final def asInstanceOf[T0]: T0

Definition Classes
Any
var batchIteration: Double
def bind(ds: DataSource): Unit

Definition Classes
Model
def clone(): AnyRef

Attributes
protected[java.lang]
Definition Classes
AnyRef
Annotations
@throws( ... )
def copyMats(from: Array[Mat], to: Array[Mat]): Unit

Definition Classes
Model
def copyTo(mod: Model): Unit

Definition Classes
Model
var datasource: DataSource

Definition Classes
Model
def doblock(gmats: Array[Mat], ipass: Int, i: Long): Unit

Definition Classes
FactorModel → Model
def doblockg(amats: Array[Mat], ipass: Int, here: Long): Unit

Definition Classes
Model
final def eq(arg0: AnyRef): Boolean

Definition Classes
AnyRef
def equals(arg0: Any): Boolean

Definition Classes
AnyRef → Any
def evalblock(mats: Array[Mat], ipass: Int, here: Long): FMat

Definition Classes
FactorModel → Model
def evalblockg(amats: Array[Mat], ipass: Int, here: Long): FMat

Definition Classes
Model
def evalfun(data: Mat, user: Mat, ipass: Int): FMat

Currently, this computes the approximation of negentropy, which is the objective function to maximize.
Currently, this computes the approximation of negentropy, which is the objective function to maximize.
To understand this, let w be a single row vector of W, let x be a single data vector, and let v be a standard normal random variable. To find this one independent component, we maximize
J(w^{Tx) \approx ( Expec[G(w}Tx)] - Expec[G(v)] )^2,
where G is the function set at opts.G_function. So long as the W matrix (capital "W") is orthogonal, which we do enforce, then w^{Tx satisfies the requirement that the variance be one. To extend this to
the whole matrix W, take the sum over all the rows, so the problem is: maximize{ \sum_w J(w}Tx) }.
On the other hand, the batchSize should be much greater than one, so "data" consists of many columns. Denoting the data matrix as X, we can obtain the expectations by taking the sample means. In other words, we take the previous "user" matrix, W*X, apply the function G to the data, and THEN take the mean across rows, so mean(G(W*X),2). The mean across rows gives what we want since it's applying the same row of W to different x (column) vectors in our data.
data
An n x batchSize matrix, where each column corresponds to a data sample.
user
An intermediate matrix that stores (w_j^{T) * (x}{i}) values.
ipass
The current pass through the data.

Definition Classes
ICA → FactorModel
def finalize(): Unit

Attributes
protected[java.lang]
Definition Classes
AnyRef
Annotations
@throws( classOf[java.lang.Throwable] )
var g_d_fun: (Mat) ⇒ Mat
var g_fun: (Mat) ⇒ Mat
final def getClass(): Class[_]

Definition Classes
AnyRef → Any
var gmats: Array[Mat]

Definition Classes
Model
def hashCode(): Int

Definition Classes
AnyRef → Any
def init(): Unit

Definition Classes
ICA → FactorModel → Model
final def isInstanceOf[T0]: Boolean

Definition Classes
Any
var mats: Array[Mat]

Definition Classes
Model
var mm: Mat
def modelmats: Array[Mat]

Definition Classes
Model
def mupdate(data: Mat, user: Mat, ipass: Int): Unit

This performs the matrix fixed point update to the estimated W = A^{-1}:
This performs the matrix fixed point update to the estimated W = A^{-1}:
W^{+ = W + diag(alpha_i) * [ diag(beta_i) - Expec[g(Wx)*(Wx)}T] ] * W,
where g = G', beta_i = -Expec[(Wx)_ig(Wx)_i], and alpha_i = -1/(beta_i - Expec[g'(Wx)_i]). We need to be careful to take expectations of the appropriate items. The gwtx and g_wtx terms are matrices with useful intermediate values that represent the full data matrix X rather than a single column/element x. The above update for W^+ goes in updatemats(0), except the additive W since that should be taken care of by the ADAGrad updater.
I don't THINK anything here changes if the data is not white, since one of Hyvärinen's papers implied that the update here includes an approximation to the inverse covariance matrix.
data
An n x batchSize matrix, where each column corresponds to a data sample.
user
An intermediate matrix that stores (w_j^{T) * (x}{i}) values.
ipass
The current pass through the data.

Definition Classes
ICA → FactorModel
def mupdate2(data: Mat, user: Mat, ipass: Int): Unit

Definition Classes
FactorModel
final def ne(arg0: AnyRef): Boolean

Definition Classes
AnyRef
final def notify(): Unit

Definition Classes
AnyRef
final def notifyAll(): Unit

Definition Classes
AnyRef
val opts: Opts

Definition Classes
ICA → FactorModel → Model
var parent_model: Model

Definition Classes
Model
var putBack: Int

Definition Classes
Model
var refresh: Boolean

Definition Classes
Model
def setmodelmats(a: Array[Mat]): Unit

Definition Classes
Model
var stdNorm: FMat
final def synchronized[T0](arg0: ⇒ T0): T0

Definition Classes
AnyRef
def toString(): String

Definition Classes
AnyRef → Any
def updatePass(ipass: Int): Unit

Definition Classes
Model
var updatemats: Array[Mat]

Definition Classes
Model
var useDouble: Boolean

Definition Classes
Model
var useGPU: Boolean

Definition Classes
Model
def uupdate(data: Mat, user: Mat, ipass: Int): Unit

Store data in "user" for use in the next mupdate() call, and updates the moving average if necessary.
Store data in "user" for use in the next mupdate() call, and updates the moving average if necessary. Also "orthogonalizes" the model matrix after each update, as required by the algorithm.
First, it checks if this is the first pass over the data, and if so, updates the moving average assuming that the number of data samples in each block is the same for all blocks. After the first pass, the data mean vector is fixed in modelmats(1). Then the data gets centered via: "data ~ data - modelmats(1)".
We also use "user ~ mm * data" to store all (w_j^{T) * (x}{i}) values, where w_j^{T is the j}th row of our estimated W = A^{{-1}, and x}{i} is the i^{th} sample in this block of data. These values are later used as part of fixed point updates.
data
An n x batchSize matrix, where each column corresponds to a data sample.
user
An intermediate matrix that stores (w_j^{T) * (x}{i}) values.
ipass
The current pass through the data.

Definition Classes
ICA → FactorModel
final def wait(): Unit

Definition Classes
AnyRef
Annotations
@throws( ... )
final def wait(arg0: Long, arg1: Int): Unit

Definition Classes
AnyRef
Annotations
@throws( ... )
final def wait(arg0: Long): Unit

Definition Classes
AnyRef
Annotations
@throws( ... )

class ICA extends FactorModel

Instance Constructors

new ICA(opts: Opts = new ICA.Options)

Value Members

final def !=(arg0: Any): Boolean

final def ##(): Int

final def ==(arg0: Any): Boolean

var G_fun: (Mat) ⇒ Mat

var _modelmats: Array[Mat]

final def asInstanceOf[T0]: T0

var batchIteration: Double

def bind(ds: DataSource): Unit

def clone(): AnyRef

def copyMats(from: Array[Mat], to: Array[Mat]): Unit

def copyTo(mod: Model): Unit

var datasource: DataSource

def doblock(gmats: Array[Mat], ipass: Int, i: Long): Unit

def doblockg(amats: Array[Mat], ipass: Int, here: Long): Unit

final def eq(arg0: AnyRef): Boolean

def equals(arg0: Any): Boolean

def evalblock(mats: Array[Mat], ipass: Int, here: Long): FMat

def evalblockg(amats: Array[Mat], ipass: Int, here: Long): FMat

def evalfun(data: Mat, user: Mat, ipass: Int): FMat

def finalize(): Unit

var g_d_fun: (Mat) ⇒ Mat

var g_fun: (Mat) ⇒ Mat

final def getClass(): Class[_]

var gmats: Array[Mat]

def hashCode(): Int

def init(): Unit

final def isInstanceOf[T0]: Boolean

var mats: Array[Mat]

var mm: Mat

def modelmats: Array[Mat]

def mupdate(data: Mat, user: Mat, ipass: Int): Unit

def mupdate2(data: Mat, user: Mat, ipass: Int): Unit

final def ne(arg0: AnyRef): Boolean

final def notify(): Unit

final def notifyAll(): Unit

val opts: Opts

var parent_model: Model

var putBack: Int

var refresh: Boolean

def setmodelmats(a: Array[Mat]): Unit

var stdNorm: FMat

final def synchronized[T0](arg0: ⇒ T0): T0

def toString(): String

def updatePass(ipass: Int): Unit

var updatemats: Array[Mat]

var useDouble: Boolean

var useGPU: Boolean

def uupdate(data: Mat, user: Mat, ipass: Int): Unit

final def wait(): Unit

final def wait(arg0: Long, arg1: Int): Unit

final def wait(arg0: Long): Unit

Inherited from FactorModel

Inherited from Model

Inherited from AnyRef

Inherited from Any

Ungrouped