Processor tools¶
Clustering of AMBER candidates¶
-
class
darc.processor_tools.clustering.
Clustering
(obs_config, output_dir, logger, input_queue, output_queue, ncluster, config_file='/opt/hostedtoolcache/Python/3.8.6/x64/lib/python3.8/site-packages/darc/config.yaml')¶ Bases:
multiprocessing.context.Process
Clustering and thresholding of AMBER triggers
- Parameters
obs_config (dict) – Observation settings
output_dir (str) – Output directory for data products
logger (Logger) – Processor logger object
input_queue (Queue) – Input queue for triggers
output_queue (Queue) – Output queue for clusters
ncluster (mp.Value) – 0
config_file (str) – Path to config file
-
_cluster
(triggers)¶ Execute trigger clustering
- Parameters
triggers (np.ndarray) – Input triggers, columns: DM, S/N, time, integration_step, sb
-
_load_config
()¶ Load configuration
-
run
()¶ Main loop
-
stop
()¶ Stop this thread
Filterbank reader¶
-
class
darc.processor_tools.filterbank_reader.
ARTSFilterbankReader
(fname, cb, ntab=12, config_file='/opt/hostedtoolcache/Python/3.8.6/x64/lib/python3.8/site-packages/darc/config.yaml')¶ Bases:
object
Filterbank reader for ARTS data, one file per TAB
- Parameters
fname (str) – path to filterbank files, with {cb:02d} and {tab:02d} for CB and TAB indices
cb (int) – CB index
ntab (int) – Number of TABs (Default: NTAB from constants)
config_file (str) – Path to config file, passed to SBGenerator
-
get_header
(tab=0)¶ Read filterbank parameters to self.header
- Parameters
tab (int) – TAB index (Default: 0)
-
get_sb
(sb)¶ Construct an SB. TAB data must be read before calling this method
- Parameters
sb (int) – SB index
- Returns
Spectra object with SB data
-
load_single_sb
(sb, startbin, chunksize)¶ Convenience tool to read only a single SB and its associated TABs. Note: Any internal TAB data is cleared after calling this method
- Parameters
sb (int) – SB index
startbin (int) – Index of first time sample to read
chunksize (int) – Number of time samples to read
- Returns
Spectra object with SB data
-
read_filterbank
(tab, startbin, chunksize)¶ Read a chunk of filterbank data
- Parameters
tab (int) – TAB index
startbin (int) – Index of first time sample to read
chunksize (int) – Number of time samples to read
- Returns
chunk of data with shape (nfreq, chunksize)
-
read_tabs
(startbin, chunksize, tabs=None)¶ Read TAB data
- Parameters
startbin (int) – Index of first time sample to read
chunksize (int) – Number of time samples to read
tabs (list) – which TABs to read (Default: all)
-
exception
darc.processor_tools.filterbank_reader.
ARTSFilterbankReaderError
¶ Bases:
Exception
-
class
darc.processor_tools.spectra.
Spectra
(freqs, dt, data, starttime=0, dm=0)¶ Bases:
object
A class to store spectra. This is mainly to provide reusable functionality.
Spectra constructor
- Parameters
freqs (list) – Observing frequencies for each channel.
dt (float) – Sampling time (seconds)
data (array) – A 2D numpy array containing pulsar data. Axis 0 should contain channels. (e.g. data[0,:]) Axis 1 should contain spectra. (e.g. data[:,0])
starttime (float) – Start time (in seconds) of the spectra with respect to the start of the observation. (Default: 0).
dm (float) – Dispersion measure (in pc/cm^3). (Default: 0)
- Returns
Spectra object
-
dedisperse
(dm=0, padval=0)¶ Shift channels according to the delays predicted by the given DM. * Dedispersion happens in place *
- Parameters
dm (float) – The DM (in pc/cm^3) to use.
padval (float/str) – The padding value to use when shifting channels during dedispersion. See documentation of Spectra.shift_channels. (Default: 0)
-
downsample
(factor=1, trim=True)¶ Downsample the spectra by co-adding ‘factor’ adjacent bins. * Downsampling is done in place *
- Parameters
factor (int) – Reduce the number of spectra by this factor. Must be a factor of the number of spectra if ‘trim’ is False.
trim (bool) – Trim off excess bins.
-
get_chan
(channum)¶
-
get_spectrum
(specnum)¶
-
masked
(mask, maskval='median-mid80')¶ Replace masked data with ‘maskval’. Returns a masked copy of the Spectra object.
- Parameters
mask (array) – An array of boolean values of the same size and shape as self.data. True represents an entry to be masked.
maskval (str) –
Value to use when masking. This can be a numeric value, ‘median’, ‘mean’, or ‘median-mid80’.
The values ‘median’ and ‘mean’ refer to the median and mean of the channel, respectively. The value ‘median-mid80’ refers to the median of the channel after the top and bottom 10% of the sorted channel is removed. (Default: ‘median-mid80’)
- Returns
maskedspec: A masked version of the Spectra object.
-
scaled
(indep=False)¶ Return a scaled version of the Spectra object. When scaling subtract the median from each channel, and divide by global std deviation (if indep==False), or divide by std deviation of each row (if indep==True).
- Parameters
indep (bool) – If True, scale each row independently (Default: False).
- Returns
scaled_spectra: A scaled version of the Spectra object.
-
scaled2
(indep=False)¶ Return a scaled version of the Spectra object. When scaling subtract the min from each channel, and divide by global max (if indep==False), or divide by max of each row (if indep==True).
- Parameters
indep (bool) – If True, scale each row independently (Default: False).
- Returns
scaled_spectra: A scaled version of the Spectra object.
-
shift_channels
(bins, padval=0)¶ Shift each channel to the left by the corresponding value in bins, an array. * Shifting happens in-place *
- Parameters
bins (array) – An array containing the number of bins to shift each channel by.
padval (float/str) –
Value to use when shifting near the edge of a channel. This can be a numeric value, ‘median’, ‘mean’, or ‘rotate’.
The values ‘median’ and ‘mean’ refer to the median and mean of the channel. The value ‘rotate’ takes values from one end of the channel and shifts them to the other.
-
smooth
(width=1, padval=0)¶ Smooth each channel by convolving with a top hat of given width. The height of the top had is chosen shuch that RMS=1 after smoothing. Overlap values are determined by ‘padval’. This bit of code is taken from Scott Ransom’s PRESTO’s single_pulse_search.py (line ~ 423). * Smoothing is done in place. *
- Parameters
width (int) – Number of bins to smooth by (Default: no smoothing)
padval (float/str) – Padding value to use. Possible values are float-value, ‘mean’, ‘median’, ‘wrap’. (Default: 0).
-
subband
(nsub, subdm=None, padval=0)¶ Reduce the number of channels to ‘nsub’ by subbanding. The channels within a subband are combined using the DM ‘subdm’. ‘padval’ is passed to the call to ‘Spectra.shift_channels’. * Subbanding happens in-place *
- Parameters
nsub (int) – Number of subbands. Must be a factor of the number of channels.
subdm (float) – The DM with which to combine channels within each subband (Default: don’t shift channels within each subband)
padval (float/str) – The padding value to use when shifting channels during dedispersion. See documentation of Spectra.shift_channels. (Default: 0)
-
trim
(bins=0)¶ Trim the end of the data by ‘bins’ spectra. * Trimming is done in place *
- Parameters
bins (int) – Number of spectra to trim off the end of the observation. If bins is negative trim spectra off the beginning of the observation.
-
darc.processor_tools.spectra.
delay_from_DM
(DM, freq_emitted)¶ Return the delay in seconds caused by dispersion, given a Dispersion Measure in cm-3 pc, and the emitted frequency of the pulsar in MHz.
- Parameters
DM (float) – dispersion measure
freq_emitted (float) – frequency
-
darc.processor_tools.spectra.
rotate
(arr, bins)¶ Return an array rotated by ‘bins’ places to the left
- Parameters
arr (list) – Input data
bins (int) – Number of bins to rotate by
Data extraction¶
-
class
darc.processor_tools.extractor.
Extractor
(obs_config, output_dir, logger, input_queue, output_queue, ncand_above_threshold, config_file='/opt/hostedtoolcache/Python/3.8.6/x64/lib/python3.8/site-packages/darc/config.yaml')¶ Bases:
multiprocessing.context.Process
Extract data from filterbank files
- Parameters
obs_config (dict) – Observation settings
output_dir (str) – Output directory for data products
logger (Logger) – Processor logger object
input_queue (Queue) – Input queue for clusters
output_queue (Queue) – Output queue for classifier
ncand_above_threshold (mp.Value) – 0
config_file (str) – Path to config file
-
_extract
(dm, snr, toa, downsamp, sb)¶ Execute data extraction
- Parameters
dm (float) – Dispersion Measure (pc/cc)
snr (float) – AMBER S/N
toa (float) – Arrival time at top of band (s)
downsamp (int) – Downsampling factor
sb (int) – Synthesized beam index
-
_load_config
()¶ Load configuration
-
_rficlean
()¶ Clean data of RFI
-
_store_data
(fname, sb, tsamp, dms, params_amber, params_opt)¶ Save candidate to HDF5 file
- Parameters
fname (str) – Output file
sb (int) – Synthesized beam index
tsamp (float) – Sampling time (s)
dms (Quantity) – array of DMs used in DM-time array
params_amber (tuple) – AMBER parameters: dm, snr, toa
params_opt (tuple) – Optimized parameters: snr, toa
-
init_filterbank_reader
()¶ Initialize the ARTS filterbank reader
-
run
()¶ Main loop
-
stop
()¶ Stop this thread
Neural network classification of candidates¶
-
class
darc.processor_tools.classifier.
Classifier
(logger, input_queue, conn, config_file='/opt/hostedtoolcache/Python/3.8.6/x64/lib/python3.8/site-packages/darc/config.yaml')¶ Bases:
multiprocessing.context.Process
Classify candidates from HDF5 files produced by Extractor
- Parameters
logger (Logger) – Processor logger object
input_queue (Queue) – Input queue for triggers
conn (Connection) – Pipe connection to send output to
config_file (str) – Path to config file
-
_classify
(fname)¶ Classify a candidate
- Parameters
fname (str) – Path to HDF5 file containing candidate data and metadata
-
_init_models
()¶ Load the keras models
-
_load_config
()¶ Load configuration
-
_load_tensorflow
()¶ Load tensorflow into local namespace
-
_prepare_data
()¶ Verify data shape and downsampled as needed
- Returns
success (bool)
-
run
()¶ Main loop
-
stop
()¶ Stop this thread
Candidate visualization¶
-
exception
darc.processor_tools.visualizer.
ProcessorException
¶ Bases:
Exception
-
class
darc.processor_tools.visualizer.
Visualizer
(output_dir, result_dir, logger, obs_config, files, config_file='/opt/hostedtoolcache/Python/3.8.6/x64/lib/python3.8/site-packages/darc/config.yaml')¶ Bases:
object
Visualize candidates
- Parameters
output_dir (str) – Output directory for data products
result_dir (str) – central directory to copy output PDF to
logger (Logger) – Processor logger object
obs_config (dict) – Observations settings
files (list) – HDF5 files to visualize
config_file (str) – Path to config file
-
_get_plot_order
()¶ Get the order of files to plot them in descending freq-time probability order, then by S/N if probabilities are equal
- Returns
file order (np.ndarray)
-
_load_config
()¶ Load configuration
- Returns
config (Namespace)
-
_load_data
(fname, data_type)¶ Load HDF5 data
- Parameters
fname (str) – Path to HDF5 file
data_type – which data type to get, options: freq_time, dm_time, 1d_time
- Returns
data (np.ndarray), params (dict with tsamp, dm, snr, toa, downsamp, sb, dms)
-
_visualize
()¶ Run the visualization of candidates