API¶

cartgen is module with models which could be used with sklearn library.

This program is free software: you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation, either version 3 of the License, or (at your option) any later version.

This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.

You should have received a copy of the GNU General Public License along with this program. If not, see <https://www.gnu.org/licenses/>.

class cartgen.CartGenModel(metric_to_minimize=None, n_generations=20, samples_in_gen=50, elitarity_n=5, mutation_points=3, tqdm=None, n_inputs=None, n_outputs=None, depth=None, n_rows=1, basis_funcs=None, recurse_depth=5, arity=None, full_mutate_prob=0.0, seed=None, cgf=None)[source]¶

CartGenModel is a class with model which could process any ML task (regression, classification, multiclass, etc). It utilizes sklearn interface for usage. It consists of simple generations-based optimizer for CartesianGenomeFunc and any custom metric.

Parameters

metric_to_minimize (callable) – metric function with signature analogous to sklearn (see sklearn metrics) to minimize
n_generations (int) – number of generations to evolve
samples_in_gen (int) – number of samples in each generation for each elitary sample
elitarity_n (int) – number of elite best samples to save
mutation_points (int) – number of points to mutate in each elitary sample on new samples acquisition
tqdm (callable) – tqdm function with signature: lambda x: x . Use tqdm or tqdm_notebook from tqdm package
n_inputs (int) – number on inputs
n_outputs (int) – number of outputs
depth (int) – depth of genome func representation
n_rows (int) – number of functions on each layer of depth
recurse_depth (int) – depth of previous layers allowed to transmit inputs to each next level
arity (int) – arity of basis functions, if not set then would be determined automatically on given basis
seed (int) – random seed for random operations (init_random_genome and such)
full_mutate_prob (float) – probability of all possible mutation occurs for some individual
basis_funcs (list) – list of callable, basis functions for genome func representations
cgf (CartesianGenomeFunc) – function to use as cgf if you don’t want to create one

Examples

import numpy as np
from cartgen import CartGenModel
from tqdm import tqdm_notebook
from sklearn.datasets import load_digits
from sklearn.metrics import mean_absolute_error
from sklearn.model_selection import train_test_split
from sklearn.utils import shuffle
from sklearn.preprocessing import StandardScaler

dataset = load_digits()
data,target = dataset['data'],dataset['target']

data,target = shuffle(data,target,random_state=1)
data = StandardScaler().fit_transform(data)

X_train,X_test,y_train,y_test = train_test_split(data,target,test_size=0.33,random_state=42)

def sqrt(x):
    return np.sqrt(np.abs(x))

def log(x):
    return np.log(np.abs(x+0.00001))

def summ(x,y):
    return x+y

def diff(x,y):
    return x-y

def div(x,y):
    return x/(y+0.1)

def neg(x):
    return -x

def mult(x,y):
    return x*y

def div_2(x):
    return x/2

def mult_3(x):
    return x*3

def abss(x):
    return np.abs(x)


basis = [sqrt,log,neg,summ,mult,div,abss,div_2,mult_3,diff]

model = CartGenModel(metric_to_minimize=mean_absolute_error,
                     n_generations=150,
                    samples_in_gen=50,
                    mutation_points=3,
                    elitarity_n=9,
                    tqdm=tqdm_notebook,
                    n_inputs=X_train.shape[1],
                    n_outputs=1,
                    depth=36,
                    recurse_depth=9,
                    basis_funcs=basis,
                    seed=9,
                    n_rows=1)

model.fit(X_train,y_train)

test_preds = model.predict(X_test)
print(mean_absolute_error(test_preds,y_test))

from sklearn.ensemble import BaggingRegressor

bclf = BaggingClassifier(base_estimator=model, n_estimators=10, max_samples=1.0, max_features=1.0, bootstrap=True,
                               bootstrap_features=False, oob_score=False,
                               warm_start=False, n_jobs=None, random_state=None, verbose=0)

bclf.fit(X_train,y_train)

test_preds = bclf.predict(X_test)
print(mean_absolute_error(test_preds,y_test))

fit(X, y)[source]¶

Fit X and y: run genetic evolution for some generations and acquire best learned CGF

Parameters

X (numpy.array) – numpy array matrix with features to learn
y (numpy.array) – numpy array matrix with target to learn

Returns

learned model with best learned self._cgf

Return type

CartGenModel

get_params(deep=False)[source]¶

Get parameters of fitted estimator (sklearn interface here: https://scikit-learn.org/stable/developers/develop.html#cloning)

Parameters: deep (bool) – sklearn parameter stub
Returns: dict with parameters of estimator

predict(X)[source]¶

Predict X by running best fitted CGF function

Parameters

X (numpy.array) – numpy array matrix with features to learn
y (numpy.array) – numpy array matrix with target to learn

Returns

learned model with best learned self._cgf

Return type

CartGenModel

set_params(**params)[source]¶

Set parameters of fitted estimator (sklearn interface here: https://scikit-learn.org/stable/developers/develop.html#cloning)

Parameters: params (kwargs) – parameters kwargs
Returns: model with parameters from kwargs
Return type: CartGenModel

cartesian_genome_func is module with cartesian genome functions representation. Currently consists of pure python 3.x non-optimized CartesianGenomeFunc.

This program is free software: you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation, either version 3 of the License, or (at your option) any later version.

This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.

You should have received a copy of the GNU General Public License along with this program. If not, see <https://www.gnu.org/licenses/>.

class cartesian_genetics_base.cartesian_genome_func.CartesianGenomeFunc(n_inputs=None, n_outputs=None, depth=None, n_rows=None, basis_funcs=None, recurse_depth=1, arity=None, seed=None)[source]¶

CartesianGenomeFunc class is simple and naive CGP function implementation (https://en.wikipedia.org/wiki/Cartesian_genetic_programming). It is still not optimized, goes front to back, propagate through all nodes to calculate result

Parameters

n_inputs (int) – number on inputs
n_outputs (int) – number of outputs
depth (int) – depth of genome func representation
n_rows (int) – number of functions on each layer of depth
recurse_depth (int) – depth of previous layers allowed to transmit inputs to each next level
arity (int) – arity of basis functions, if not set then would be determined automatically on given basis
seed (int) – random seed for random operations (init_random_genome and such)
basis_funcs (list) – list of callable, basis functions for genome func representations

call(input_vals)[source]¶

Call genome function with input vals

Parameters: input_vals (list) – list of input arguments (arguments type depends on basis functions)
Returns: output values from output layer
Return type: list

get_genome()[source]¶

Get current genome representation

Returns: list of floats with current genome
Return type: list

get_int_genome()[source]¶

Get current genome representation as int type, could be used to well-build optimisation. Todo in 0.0.2 version

Returns: list of ints with current genome
Return type: list

init_random_genome()[source]¶

Inits random genome with uniform distribution

Returns: inits random genome inplace, doesn’t return anything
Return type: None

set_basis(new_basis)[source]¶

Set basis functions to this genome function

Parameters: new_basis (list) – list of callables for use as basis functions
Returns: nothing to return
Return type: None

set_genome(new_genome)[source]¶

Validate and set genome using current basis

Parameters: new_genome – list of floats
Returns: inplace operation, returns None
Return type: None

API¶

Previous topic

This Page