Running a simulation¶

pyuvsim takes instrument and observation configuration settings and a source catalog and produces a UVData object with simulated data.

The function uvsim.run_pyuvsim in uvsim.py accepts as input a path to a yaml obsparam file which specifies all the required configuration information and writes out the data to uvfits, miriad, or uvh5 format. Optionally, it can also skip writing the data out and just return a UVData object filled with simulated data. The default behavior is to write to uvh5.

This is demonstrated in more detail in run_param_pyuvsim.py in the scripts directory. See Parameter and configuration Files for information on the parameter files.

Under the hood, pyuvsim uses all the configuartion information to generate a pyuvdata.UVData object without data and then fills it with simulated data. Optionally, the user can instead provide a UVData object along with the various beam objects (UVBeam or AnalyticBeam objects) and a dict identifying which beam should be used for each antenna directly to the lower level function, pyuvsim.run_uvdata_uvsim, instead of providing the configuration files.

Using MPI¶

pyuvsim is parallelized using the Message Passing Interface (MPI). To take full advantage of this, any wrapper must be run with mpirun:

# Running with 50 MPI processing units
> mpirun -n 50 python run_param_pyuvsim obsparam_filename.yaml   # This will run a parameter file job with 10 processing units.

Further speedup is achieved through numpy/scipy internal threading. How effective this is depends on the linear algebra library that numpy is compiled on, which can be checked with numpy.show_config().

Enabling Profiling¶

The line_profiler module provides runtime estimates on a line by line basis. It is built into pyuvsim to work within the MPI framework using the functions in pyuvsim.profiling. To run a simulation with profiling enabled, run the command profiling.set_profiler() before starting run_uvsim(). This function can be passed a list of functions you wish to profile (by name), as well as which rank to return data for (it will only profile one MPI rank at a time!).

Generating Config Files from Data¶

The scripts uvdata_to_telescope_config.py and uvdata_to_config.py are provided for convenience. These will accept the path to any valid file readable by UVData as input, along with output file name options, and will generate telescope layout, telescope configuration, and an obsparam file for a simulation with the same time, frequency, and baseline structure as the data file.

Note that the generated configuration files will still need to be given paths to beam models and source catalogs before they can be used.

Parallelization¶

Given Npus MPI processes, and Nsrcs, Nbls, Ntimes, Nfreqs sources, baselines, times, and frequencies (respectively), the choice of splitting the various axes goes like this:

If Npus < Nsrcs and Npus > Nbls * Ntimes * Nfreqs:

The source axis will be split across MPI processes.

Each process will receive all baselines, times, and frequencies.

Otherwise:
a. Each process will receive all sources, and times/frequencies/baselines will be split among MPI processes. b. If it is estimated that loading all sources on each process simultaneously will overrun the available memory, then the sources will be split into chunks for processing.

In each case, the source axis is handled through numpy’s threading. It’s recommended that jobs with especially large source axes be given more cores per MPI process (in SLURM, for instance, this is set by the --cpus-per-task option in sbatch or srun). Usually around 2 to four cpus per process is sufficient. For large numbers of times/baselines/frequencies, however, running with more MPI processes offers a better speedup.

The source array is initially shared in immutable shared memory, and parts of it are copied for usage within each MPI process. Likewise, UVBeam-class beam objects are loaded on the root process only, and broadcast using shared memory. These measures prevent large data arrays from being copied over Npus times, which can cause an unacceptable memory bloat.