Simalma (CASA 4.2)
This guide demonstrates how to use simalma, a CASA task that simplifies the process of simulating observations that combine data from the main 12 m array with data from the ALMA Compact Array (ACA). To learn how to create a script of the Python code on this page see Extracting scripts from these tutorials.
ALMA consists of the main array of 12m antennas plus the ACA, which includes an array of 7 m antennas and a separate set of 12 m antennas used for Total Power measurements. CASA enables users to simulate observations using any or all of these components.
Simulating Observations with the Main 12 m Array and the ACA: Manual Combination of the Data
One could simulate observations that use the main array plus the ACA by generating the data for each component separately and then "manually" combining and imaging the data. The approach is to use simobserve to generate simulated uv data sets for each component separately, and then combine and image the resulting Measurement Sets using simanalyze. This technique is general and can be used to simulate observations using multiple 12m array configurations, as well. Total power observations can be simulated either in an independent run of simobserve, or integrated with one of the interferometric simulations. Note that if you simulate total power and interferometric observations simultaneously with simobserve, they must have the same set of pointing centers and the same integration and total time. These are not realistic conditions. For example, to reduce edge effects, the Total Power antennas should observe a larger area on the sky than the main array antennas, by about 1/2 of a primary beam. So, it is generally better to generate the total power data with a separate run of simobserve.
This "manual" method of combining main-array data with ACA data is described in the ACA_Simulation_(CASA_4.2) guide.
The simalma task
The simalma task has been available in CASA since CASA version 4.1. This task takes one set of parameters describing the region of the sky to observe, and makes the appropriate calls to simobserve and simanalyze. Here we give an example showing how to use simalma.
Set simalma as current task
Reset all parameters to default, and then set the project name to m51
# Set simalma to default parameters
default("simalma")
# Our project name will be "m51", and all simulation products will be placed in a subdirectory "m51/"
project="m51"
Specify sky model image
In this example, we'll use an Halpha image of M51 as the model of the sky. The curl command used below will copy a data file with the model image to our local disk and rename it.
# Model sky = Halpha image of M51
os.system('curl http://casaguides.nrao.edu/images/3/3f/M51ha.fits.txt -f -o M51ha.fits')
skymodel = "M51ha.fits"
Note that simalma will not modify your original input image. Rather, it will make a copy m51/m51.skymodel.
To make the M51 Halpha image more suitable to a sub-millimeter ALMA observation, we will respecify most of the header parameters from the FITS file. We will:
- place the source in the southern hemisphere with the indirection parameter,
- set the pixel size to 0.1arcsec, to simulate an observation of a galaxy that is smaller in angular size than M51. (M51 itself would require a quite large mosaic, and in any case we'd like the angular scale of pixels in the input model to be significantly smaller than the synthesized beam.)
- set the peak brightness to 0.004 Jy/pixel
- set the center observing frequency to 330GHz, and since it's a 2D image we'll set the single "channel" width to be 50MHz. These parameters are plausible for observing a sub-mm emission line in a galaxy.
# Set model image parameters:
indirection="J2000 23h59m59.96s -34d59m59.50s"
incell="0.1arcsec"
inbright="0.004"
incenter="330.076GHz"
inwidth="50MHz"
Set up Observing Parameters, and Run
We will simulate observations using the main array in "configuration number 3" from Cycle 1. This configuration affords ~0.5 arcsec resolution.
antennalist="alma_cycle1_3.cfg"
We'll set the 12m array observing time to 2 hours:
totaltime="7800s"
Following the Cycle 1 convention, we will instruct simalma to observe 3 times longer with the 7m array and total power dishes.
acaratio=3.0
acaconfig="aca_cycle1.cfg"
We set the precipitable water vapor to 0.6 mm to represent observations in nominal weather. The simulation will add noise to the data based on this setting.
pwv=0.6
To cover all of the galaxy according to our rescaled pixel size, we'll need a 1 arcmin mosaic, and we'll let simalma calculate the pointings for us:
mapsize="1arcmin"
Finally you can check the input settings and run the simulation.
inp
go
What does simalma do?
The 12m array observation is simulated first -- simalma simply calls simobserve with your input parameters. The simobserve task generates a figure showing the elevation of the target, the antenna layout, the uv coverage, and the synthesized dirty beam:
The 12m-only visibilities are not currently imaged separately from the 7m visibilities, but this is an expected upgrade in a future release. One could image the generated measurement set, which will be named according to the antennalist parameter above. In this example, it is called m51.alma_cycle1_3.noisy.ms/.
Next, with a second call to simobserve, the 7m ACA observation is simulated. The simobserve task follows the same conventions as the ALMA Observation Preparation Tool, and sets the mosaic pointings to cover the area requested. It takes fewer 7m pointings to cover the region than it did 12m pointings.
It is useful to know that 'simalma generates a version of the input sky model convolved to the ACA resolution. In this example it is called m51.aca_cycle1.skymodel.flat.regrid.conv/. That image can be useful to better understand the simulation results.
Next, simobserve is called a third time to generate the total power image. Again according to Cycle 1 conventions, the total power map covers the same region as the main array mosaic, except an extra pointing is added around the outside edge of the map so that the total power map is larger than the interferometric mosaic. (Total power maps usually have additional noise and artifacts at their edges). Furthermore, a square raster pattern is used instead of the hexagonal pattern of the interferometric array maps.
Transform the visibilities back into images
Next simalma uses simanalyze to combine the three measurement sets and create a single image. It accomplishes this in the following manner.
First, simalma concatenates the two sets of interferometric visibilities, and images them. Diagnostic graphics with "concat" in their names are generated:
Next it images the total power observations. The total power image is generated using gridding tools from the ASAP package inside of CASA. The simalma task attempts to find the optimal gridding kernel to achieve maximum sensitivity and resolution of the single dish map. (Finding optimal parameters is an area of active investigation.)
Finally, it uses the feather task to combine the two images.
Note, there are many ways to combine data from separate observations. If you are dealing with real ALMA data, you may wish to discuss options with scientists at your ARC.
When combining interferometric data from different arrays "manually", it is critical to set the relative data weights properly. Simulated data have weights=1, since the thermal noise is generated uniformly per baseline. However, in reality the 7m baselines have lower sensitivity than the 12m baselines, and their weights must be decreased by the sensitivity ratio. simalma uses the visweightscale parameter of concat to apply that lower weight of (7/12)**2 to the 7m visibilities. If you wish to combine data manually, you must do this step yourself.
A further note for those combining data by hand: When combining the single dish and interferometric maps in the image plane using the feather task, one must use the interferometric map without the primary beam correction, and first multiply the total power map by the interferometric sensitivity image (".flux") -- this ensures that noise effects are properly handled on the edges of each map. After running feather, the output is masked to 0.2 times the interferometric primary beam, since the total power map was created larger than the interferometric map on purpose, so the edges of the combined image do not contain any interferometric information.: