ecoli.library.schema
Simulation Helper Functions
This is a collection of helper functions used thoughout our code base.
- class ecoli.library.schema.MetadataArray(input_array, metadata=None)[source]
Bases:
ndarray
Subclass of Numpy array that allows for metadata to be stored with the array. Currently used to store next unique molecule index for unique molecule arrays.
- ecoli.library.schema.UNIQUE_DIVIDERS
A mapping of unique molecules to the names of their divider functions ars they are registered in the
divider_registry
inecoli/__init__.py
- class ecoli.library.schema.UniqueNumpyUpdater[source]
Bases:
object
Updates that set attributes of currently active unique molecules must be applied before any updates that add or delete molecules. If this is not enforced, in a single timestep, an update might delete a molecule and allow a subsequent update to add a new molecule in the same row. Then, an update that intends to modify an attribute of the original molecule in that row will actually corrupt the data for the new molecule.
To fix this, this unique molecule updater is a bound method with access to instance attributes that allow it to accumulate updates until given the signal to apply the accumulated updates in the proper order. The signal to apply these updates is given by a special process (
ecoli.processes.unique_update.UniqueUpdate
) that is automatically added to the simulation byecoli.composites.ecoli_master.Ecoli.generate_processes_and_steps()
Sets up instance attributes to accumulate updates.
- add_updates
List of updates that add unique molecules
- set_updates
List of updates that modify existing unique molecules
- delete_updates
List of updates that delete unique molecules
- updater(current, update)[source]
Accumulates updates in instance attributes until given signal to apply all updates in the following order:
set
,add
,delete
- Parameters:
current (MetadataArray) – Structured Numpy array for a given unique molecule
Dictionary of updates to apply that can contain any combination of the following keys:
set
: Dictionary or list of dictionariesEach key is an attribute of the given unique molecule and each value is an array. Each array contains the new attribute values for all active unique molecules in a givne timestep. Can have multiple such dictionaries in a list to apply multiple
set
updates.
add
: Dictionary or list of dictionariesEach key is an attribute of the given unique moleucle and each value is an array. The nth element of each array is the value for the corresponding attribute for the nth unique molecule to be added. If not provided, unique indices for the
unique_index
attribute are automatically generated for each new molecule. If you need to reference the unique indices of new molecules in the same process and time step in which you generated them, you MUST use theecoli.library.schema.create_unique_indices()
to generate the indices and supply them under theunique_index
key of youradd
update. Can have multiple such dictionaries in a list to apply multipleadd
updates.
delete
: List or 1D Numpy array of integers, or list of thoseList of active molecule indices to delete. Note that
current
may have rows that are marked as inactive, so deleting the 10th active molecule may not equate to deleting the value in the 10th row ofcurrent
. Can have multiple such lists in a list to apply multipledelete
updates.
update
: BooleanSpecial key that should only be included in the update of
ecoli.processes.unique_update.UniqueUpdate
. Tells updater to apply all cached updates (e.g. at the end of an “execution layer”; see Partitioning).
- Returns:
Updated unique molecule structured Numpy array.
- Return type:
- ecoli.library.schema.attrs(states, attributes)[source]
Helper function to pull out arrays for unique molecule attributes
- Parameters:
states (MetadataArray) – Structured Numpy array for all unique molecules of a given type (e.g. RNA, active RNAP, etc.)
attributes (List[str]) – List of field names (attributes) whose data should be retrieved for all active unique molecules in
states
- Returns:
List of arrays, one for each attribute. nth entry in each array corresponds to the value of that attribute for the nth active unique molecule in
states
- Return type:
- ecoli.library.schema.bulk_name_to_idx(names, bulk_names)[source]
Primarily used to retrieve indices for groups of bulk molecules (e.g. NTPs) in the first run of a process and cache for future runs
- ecoli.library.schema.bulk_numpy_updater(current, update)[source]
Updater function for bulk molecule structured array.
- Parameters:
current (ndarray) – Bulk molecule structured array
update (List[Tuple[int | ndarray, int | ndarray]]) – List of tuples
(mol_idx, add_val)
, wheremol_idx
is the index (or array of indices) for the molecule(s) to be updated andadd_val
is the count (or array of counts) to be added to the current count(s) for the specified molecule(s).
- Returns:
Updated bulk molecule structured array
- Return type:
- ecoli.library.schema.counts(states, idx)[source]
Helper function to pull out counts at given indices.
- ecoli.library.schema.create_unique_indices(n_indexes, unique_molecules)[source]
We strongly recommend letting
UniqueNumpyUpdater.updater()
generate unique indices for new unique molecules. If that is not possible, this function can be used to generate unique indices that should not conflict with any existing unique indices.- Parameters:
n_indexes (int) – Number of indexes to generate.
unique_molecules (MetadataArray) – Structured Numpy array of unique molecules.
- Returns:
List of unique indexes for new unique molecules.
- Return type:
- ecoli.library.schema.divide_RNAs_by_domain(values, state)[source]
Divider function for RNA unique molecules. Ensures that incomplete transcripts are divided in accordance with how active RNAPs are divided (which themselves depend on how chromosome domains are divided).
- Parameters:
values (MetadataArray) – Structured Numpy array of RNA unique molecule state
state (Dict[str, Any]) – View of relevant unique molecule states according to the topology under the
RNAs
key inecoli.library.schema.UNIQUE_DIVIDERS
.
- Returns:
List of two structured Numpy arrays, each containing the RNA unique molecule state of a daughter cell.
- Return type:
- ecoli.library.schema.divide_bulk(state)[source]
Divider function for bulk molecules. Automatically added to bulk molecule ports schemas by
ecoli.library.schema.numpy_schema()
whenname == 'bulk'
. Uses binomial distribution withp=0.5
to randomly partition counts.
- ecoli.library.schema.divide_by_domain(values, state)[source]
Divider function for unique molecules that are attached to the chromsome. Ensures that these molecules are divided in accordance with the way that chromosome domains are divided.
- Parameters:
values (ndarray) – Structured Numpy array of unique molecule state
state (Dict[str, Any]) – View of
full_chromosome
andchromosome_domain
state as configured under any of the unique molecules with a divider of by_domain inecoli.library.schema.UNIQUE_DIVIDERS
.
- Returns:
List of two structured Numpy arrays, each containing the unique molecule state of a daughter cell.
- Return type:
- ecoli.library.schema.divide_domains(state)[source]
Divider function for chromosome domains. Ensures that all chromosome domains associated with a full chromosome go to the same daughter cell that the full chromosome does.
- ecoli.library.schema.divide_ribosomes_by_RNA(values, state)[source]
Divider function for active ribosome unique molecules. Automatically added to ports schema by
ecoli.library.schema.numpy_schema()
whenname == 'active_ribosome'
. Ensures that ribosomes are divided the same way that their associated mRNAs are.- Parameters:
values (MetadataArray) – Structured Numpy array of active ribosome unique molecule state
state (Dict[str, Any]) – View into relevant unique molecule states according to the topology defined under the
active_ribosome
key inecoli.library.schema.UNIQUE_DIVIDERS
.
- Returns:
List of two structured Numpy arrays, each containing the active ribosome unique molecule state of a daughter cell
- Return type:
- ecoli.library.schema.divide_set_none(values)[source]
Divider function that sets both daughter cell states to
None
.
- ecoli.library.schema.empty_dict_divider(values)[source]
Divider function that sets both daughter cell states to empty dicts.
- ecoli.library.schema.follow_domain_tree(domain, domain_index, child_domains, place_holder)[source]
Recursive function that returns all the descendents of a single node in the domain tree, including itself.
- Parameters:
domain (int) – Domain index to find all descendents for
domain_index (ndarray) – Array of all domain indices
child_domains (ndarray) – Array of child domains for each index in
domain_index
place_holder (int) – Placeholder domain index (e.g. used in
child_domains
for domain indices that do not have child domains)
- Return type:
- class ecoli.library.schema.get_bulk_counts[source]
Bases:
Serializer
Serializer for bulk molecules that saves counts without IDs or masses.
- ecoli.library.schema.get_descendent_domains(root_domains, domain_index, child_domains, place_holder)[source]
Returns an array of domain indexes that are descendents of the indexes listed in root_domains, including the indexes in root_domains themselves.
- Parameters:
root_domains – List of domains to get descendents of
domain_index – Array of all domain indices for chromosome domains
child_domains – Array of child domains for each index in
domain_index
place_holder – Placeholder domain index (e.g. used in
child_domains
for domain indices that do not have any child domains)
- ecoli.library.schema.get_free_indices(result, n_objects)[source]
Find inactive rows for new molecules and expand array if needed
- Parameters:
result (MetadataArray) – Structured Numpy array for all unique molecules of a given type (e.g. RNA, active RNAP, etc.)
n_objects (int) – Number of new unique molecules to be added
- Returns:
A tuple
(result, free_idx)
.result
is the same as the input argument unlessn_objects
is greater than the number of inactive rows inresult
. In this case,result
is grown by at least 10% by concatenating new rows (all zeros).free_idx
is an array of sizen_objects
that contains the indices of rows inresult
that are inactive (_entryState
field is 0).- Return type:
- class ecoli.library.schema.get_unique_fields[source]
Bases:
Serializer
Serializer for unique molecules.
- ecoli.library.schema.listener_schema(elements)[source]
Helper function that can be used in
ports_schema
to create generic schema for a collection of listeners.- Parameters:
elements (Dict[str, Any]) – Dictionary where keys are listener names and values are the defaults for each listener. Alternatively, if the value is a tuple, assume that the first element is the default and the second is metadata that will be emitted at the beginning of a simulation when
emitter
is set todatabase
andemit_config
is set toTrue
(seeecoli.experiments.ecoli_master_sim
). This metadata can then be retrieved later to aid in interpreting listener values (seevivarium.core.emitter.data_from_database()
for sample code to query experiment configuration collection). As an example, this metadata might be an array of molecule names for a listener whose emits are arrays of counts, where the nth molecule name in the metadata corresponds to the nth value in the counts that are emitted.- Returns:
Ports schemas for all listeners in
elements
.- Return type:
- ecoli.library.schema.not_a_process(value)[source]
Returns
True
if not avivarium.core.process.Process
instance.
- ecoli.library.schema.numpy_schema(name, emit=True)[source]
Helper function used in ports schemas for bulk and unique molecules