sconce package

Submodules

sconce.trainer module

class sconce.trainer.Trainer(*, model, training_data_generator, test_data_generator, optimizer, monitor=None, rate_controller=None)[source]

Bases: object

A Class that is used to train pytorch models. It defines the training loop and orchestrates the various other sconce objects (DataGenerator, Monitor, RateController, ect).

Keyword Arguments:
 
  • model (torch.nn.Module) – the torch model to be trained. See sconce.models for examples.
  • training_data_generator (DataGenerator) – yields training inputs and targets.
  • test_data_generator (DataGenerator) – yields test inputs and targets. These are never used for back-propagation.
  • optimizer (torch.optim.Optimizer) – the torch optimizer that updates model parameters during training.
  • monitor (Monitor, optional) – the sconce monitor that records data during training. This data can be sent to external systems during training or kept until training completes allowing you to analyze training or make plots. If None, a composite monitor consisting of a StdoutMonitor and a DataframeMonitor will be created for you and used.
  • rate_controller (RateController, optional) – a sconce rate_monitor that adjusts the optimizer’s learning rate during training. If None, a CosineRateController with max_learning_rate=1e-4 will be created for you and used.
checkpoint(filename=None)[source]

Save model state and retain filename for a later call to restore().

Parameters:filename (path, optional) – the filename to save the model state to.
load_model_state(filename)[source]

Restore model state frome a file.

Parameters:filename (path) – the filename to where the model’s state was saved.
multi_train(*, num_cycles, cycle_length=1, cycle_multiplier=2.0, **kwargs)[source]

Runs multiple training sessions one after another.

Parameters:
  • num_cycles (int) – [1, inf) the number of cycles to train for.
  • cycle_length (float) – (0.0, inf) the length (in epochs) of the first cycle.
  • cycle_multiplier (float) – (0.0, inf) a factor used to determine the length of a cycle. The length of a cycle is equal to the length of the previous cycle (or cycle_length if it is the first cycle) multiplied by cycle_multiplier.
Keyword Arguments:
 

**kwargs – are passed to the underlying train() method.

num_trainable_parameters

The number of trainable parameters that the models has.

restore()[source]

Restore model to previously checkpointed state. See also checkpoint().

save_model_state(filename=None)[source]

Save model state to a file.

Parameters:filename (path, optional) – the filename to save the model state to. If None, a system dependent temporary location will be chosen.
Returns:the passed in filename, or the temporary filename chosen if None was passed in.
Return type:filename (path)
survey_learning_rate(*, num_epochs=1.0, min_learning_rate=1e-12, max_learning_rate=10, monitor=None, batch_multiplier=1, rate_controller_class=<class 'sconce.rate_controllers.exponential_rate_controller.ExponentialRateController'>, stop_factor=10, **rate_controller_kwargs)[source]

Checkpoints a model, then runs a learning rate survey, before restoring the model back.

Keyword Arguments:
 
  • num_epochs (float, optional) – (0.0, inf) the number of epochs to train the model for.
  • min_learning_rate (float, optional) – (0.0, inf) the minimum learning rate used in the survey.
  • max_learning_rate (float, optional) – (0.0, inf) the maximum learning rate used in the survey.
  • monitor (Monitor, optional) – the sconce monitor that records data during the learning rate survey. If None, a composite monitor consisting of a StdoutMonitor and a DataframeMonitor will be created for you and used.
  • batch_multiplier (int, optional) – [1, inf) determines how often parameter updates will occur during training. If greater than 1, this simulates large batch sizes without increasing memory usage. For example, if the batch size were 100 and batch_multipler=10, the effective batch size would be 1,000, but the memory usage would be for a batch size of 100.
  • rate_controller_class (class, optional) – the subclass of RateController to be used during this learning rate survey. In practice, only ExponentialRateController and LinearRateController are typically used.
  • stop_factor (float) – (1.0, inf) determines early stopping. If the training loss rises by more than this factor from it’s minimum value, the survey will stop.
  • **rate_controller_kwargs – passed to the constructor of the rate_controller_class along with min_learning_rate, max_learning_rate, and stop_factor.
Returns:

the monitor used during this learning rate survey.

Return type:

monitor (Monitor)

test(*, monitor=None)[source]

Run all samples of self.test_data_generator through the model in test (inference) mode.

Parameters:monitor (Monitor, optional) – the sconce monitor that records data during this testing. If None, a composite monitor consisting of a StdoutMonitor and a DataframeMonitor will be created for you and used.
Returns:the monitor used during this testing.
Return type:monitor (Monitor)
train(*, num_epochs, monitor=None, rate_controller=None, test_to_train_ratio=None, batch_multiplier=1)[source]

Train the model for a given number of epochs.

Parameters:
  • num_epochs (float) – the number of epochs to train the model for.
  • monitor (Monitor, optional) – a monitor to use for this training session. If None, then self.monitor will be used.
  • ( (rate_controller) – py:class:~sconce.rate_controllers.base.RateController, optional): a rate_controller to use for this training session. If ``None`, then self.rate_controller will be used.
  • test_to_train_ratio (float, optional) – [0.0, 1.0] determines how often (relative to training samples) that test samples are run through the model during training. If None, then the relative size of the training and test datasets is used. For example, for MNIST with 60,000 training samples and 10,000 test samples, the value would be 1/6th.
  • batch_multiplier (int, optional) – [1, inf) determines how often parameter updates will occur during training. If greater than 1, this simulates large batch sizes without increasing memory usage. For example, if the batch size were 100 and batch_multipler=10, the effective batch size would be 1,000, but the memory usage would be for a batch size of 100.
Returns:

the monitor used during training.

Return type:

monitor (Monitor)

sconce.utils module

class sconce.utils.Progbar(target, width=30, verbose=1, interval=0.05, stateful_metrics=None)[source]

Bases: object

Displays a progress bar. # Arguments

target: Total number of steps expected, None if unknown. width: Progress bar width on screen. verbose: Verbosity mode, 0 (silent), 1 (verbose), 2 (semi-verbose) stateful_metrics: Iterable of string names of metrics that

should not be averaged over time. Metrics in this list will be displayed as-is. All others will be averaged by the progbar before display.

interval: Minimum visual progress update interval (in seconds).

add(n, values=None)[source]
update(current, values=None)[source]

Updates the progress bar. # Arguments

current: Index of current step. values: List of tuples:

(name, value_for_last_step). If name is in stateful_metrics, value_for_last_step will be displayed as-is. Else, an average of the metric over time will be displayed.