Simalma (CASA 4.2): Difference between revisions

From CASA Guides
Jump to navigationJump to search
Jbraatz (talk | contribs)
No edit summary
Jbraatz (talk | contribs)
 
(9 intermediate revisions by the same user not shown)
Line 17: Line 17:


=====Set simalma as current task=====
=====Set simalma as current task=====
Reset all parameters to default, and then set the project name to ''m51''
Reset all parameters to default, then set the project name to ''m51'' and allow <tt>simalma</tt> to overwrite files
<source lang="python">
<source lang="python">
# Set simalma to default parameters
# Set simalma to default parameters
Line 23: Line 23:
# Our project name will be "m51", and all simulation products will be placed in a subdirectory "m51/"
# Our project name will be "m51", and all simulation products will be placed in a subdirectory "m51/"
project="m51"
project="m51"
overwrite=True
</source>
</source>


Line 55: Line 56:
[[Image:M51.alma_cycle1_3.skymodel.png|thumb|hexagonal mosaic overplotted on sky model]]
[[Image:M51.alma_cycle1_3.skymodel.png|thumb|hexagonal mosaic overplotted on sky model]]


We will simulate observations using the main array in "configuration number 3" from Cycle 1.  To that we will add the ACA cycle 1 array.  This configuration affords ~0.5 arcsec resolution.
We will simulate observations using the main array in "configuration number 3" from Cycle 1.  To that we will add the ACA cycle 1 array.  This configuration affords ~0.5 arcsec resolution.  A list of configuration files available in CASA 4.2 is available [[Antenna_Configurations_Models_in_CASA| here]].
<source lang="python">
<source lang="python">
antennalist=["alma_cycle1_3.cfg","aca_cycle1.cfg"]  
antennalist=["alma_cycle1_3.cfg","aca_cycle1.cfg"]  
</source>
</source>


We'll set the 12m array observing time to 2 hours:
We'll set the 12m array observing time to 30 minutes:
<source lang="python">
<source lang="python">
totaltime="7800s"  
totaltime="1800s"  
</source>
</source>


The ACA observing times are set by default as a multiple of the 12 m Array time.  The multiples are 2 and 4 for the 7 m Array and TP array, respectively.  The user can set these to different values by entering an array for the <tt>totaltime</tt> parameter.  See the online help for <tt>simalma</tt> for details.
The 7 m array observing time is set by default to be twice the 12 m Array time.  So in this instance, the 7 m array will observe for 1 hour.  The user can set this to a different value by entering the 12 m time and the 7 m time as a 2-element array in the <tt>totaltime</tt> parameter.  See the online help for <tt>simalma</tt> for details.
 
For this simulation we will also "single-dish" total power observations with two antennas from the 12 m Total Power Array.  A typical observation with the Total Power Array will last 4 times the total time of the 12 m main array.
 
In CASA 4.2 simulations, the total power observations are simulated in a single track.  So the total time of a TP observations should be roughly 8 hours or less, depending on the target declination, to ensure that the target position remains above the horizon.  This limitation will be addressed in a future CASA release.  For now, you should limit simulations that include the TP Array to short total integration times.
 
<source lang="python">
tpnant = 2
tptime="7200s"
</source>


We set the precipitable water vapor to 0.6 mm to represent observations in nominal weather.  The simulation will add noise to the data based on this setting.
We set the precipitable water vapor to 0.6 mm to represent observations in nominal weather.  The simulation will add noise to the data based on this setting.
Line 92: Line 102:
</source>
</source>


=====What does simalma do?=====
=== What does simalma do? ===


[[Image:M51.alma_cycle1_3.observe.png|thumb|12m observation]]
[[Image:M51.alma_cycle1_3.observe.png|thumb|12m observation]]


The 12m array observation is simulated first -- <tt>simalma</tt> simply calls <tt>simobserve</tt> with your input parameters.
==== Generate visibilities and images for each array ====
The <tt>simobserve</tt> task generates a figure showing the elevation of the target, the antenna layout, the uv coverage, and the synthesized dirty beam:
The <tt>simalma</tt> task first calls <tt>simobserve</tt> to simulate the visibilities for each of the three array components: the 12 m Main Array, the 7 m Array, and the 12 m Total Power Array. The <tt>simobserve</tt> task generates a figure showing the elevation of the target, the antenna layout, the uv coverage, and the synthesized dirty beam.


The 12m-only visibilities are not currently imaged separately from the 7m visibilities, but this is an expected upgrade in a future releaseOne could image the generated measurement set, which will be named according to the <tt>antennalist</tt> parameter above.  In this example, it is called <tt>m51.alma_cycle1_3.noisy.ms/</tt>.
Next <tt>simalma</tt> generates an image from each of the three array components, separately.  This step is not essential to getting the final result from the combined arrays, but it provides a useful diagnosticThe  images will be named according to the <tt>antennalist</tt> parameter above.  In this example, it is called <tt>m51.alma_cycle1_3.noisy.ms/</tt>.


<p>
<p>
----


[[Image:M51.aca_cycle1.skymodel.png|thumb|ACA hex map]]
[[Image:M51.aca_cycle1.skymodel.png|thumb|ACA hex map]]
Next, with a second call to <tt>simobserve</tt>, the 7m ACA observation is simulated.  The <tt>simobserve</tt> task follows the same conventions as the ALMA Observation Preparation Tool, and sets the mosaic pointings to cover the area requested.  It takes fewer 7m pointings to cover the region than it did 12m pointings.


It is useful to know that '''simalma'' generates a version of the input sky model convolved to the ACA resolution.  In this example it is called <tt>m51.aca_cycle1.skymodel.flat.regrid.conv/</tt>.  That image can be useful to better understand the simulation results.
It is useful to know that <tt>simalma</tt> generates a version of the input sky model convolved to the ACA resolution.  In this example it is called <tt>m51.aca_cycle1.skymodel.flat.regrid.conv/</tt>.  That image can be useful to better understand the simulation results.
----


Next, <tt>simobserve</tt> is called a third time to generate the total power image.  Again according to Cycle 1 conventions, the total power map covers the same region as the main array mosaic, except an extra pointing is added around the outside edge of the map so that the total power map is larger than the interferometric mosaic.  (Total power maps usually have additional noise and artifacts at their edges).  Furthermore, a square raster pattern is used instead of the hexagonal pattern of the interferometric array maps.
Note that the total power map covers the same region as the main array mosaic, with an extra pointing position added around the outside edge of the map so that the total power map is larger than the interferometric mosaic.  (Total power maps usually have additional noise and artifacts at their edges).  Furthermore, a square raster pattern is used instead of the hexagonal pattern of the interferometric array maps.
[[Image:M51.aca.tp.skymodel.png|thumb|TP map]]
[[Image:M51.aca.tp.skymodel.png|thumb|TP map]]


----
==== Combine the UV data from the 3 components and make the final image ====
 
====Transform the visibilities back into images====


Next <tt>simalma</tt> uses '''simanalyze''' to combine the three measurement sets and create a single image. It accomplishes this in the following manner.
Next <tt>simalma</tt> uses '''simanalyze''' to combine the three measurement sets and create a single image. It accomplishes this in the following manner.
Line 124: Line 128:
[[Image:M51.concat.image.png|thumb|combined interferometric map]]
[[Image:M51.concat.image.png|thumb|combined interferometric map]]


Next it images the total power observations.  The total power image is generated using gridding tools from the ASAP package inside of CASA.  The <tt>simalma</tt> task attempts to find the optimal gridding kernel to achieve maximum sensitivity and resolution of the single dish map.  (Finding optimal parameters is an area of active investigation.)
Finally it combines the total power image with the concatenated interferometric image using the CASA task <tt>feather</tt>.
 
Finally, it uses the <tt>feather</tt> task to combine the two images.


Note, there are many ways to combine data from separate observations.  If you are dealing with real ALMA data, you may wish to discuss options with scientists at your ARC.
Note, there are many ways to combine data from separate observations.  If you are dealing with real ALMA data, you may wish to discuss options with scientists at your ARC.


===Some notes for combining data "manually"===
==Some notes for combining data "manually"==


# When combining interferometric data from different arrays "manually", it is critical to set the relative data weights properly.  Simulated data have weights=1, since the thermal noise is generated uniformly per baseline.  However, in reality the 7m baselines have lower sensitivity than the 12m baselines, and their weights must be decreased by the sensitivity ratio.  <tt>simalma</tt> uses the <tt>visweightscale</tt> parameter of <tt>concat</tt> to apply that lower weight of (7/12)**2 to the 7m visibilities.  If you wish to combine data manually, you must do this step yourself.
# When combining interferometric data from different arrays "manually", it is critical to set the relative data weights properly.  Simulated data have weights=1, since the thermal noise is generated uniformly per baseline.  However, in reality the 7m baselines have lower sensitivity than the 12m baselines, and their weights must be decreased by the sensitivity ratio.  <tt>simalma</tt> uses the <tt>visweightscale</tt> parameter of <tt>concat</tt> to apply that lower weight of (7/12)**2 to the 7m visibilities.  If you wish to combine data manually, you must do this step yourself.

Latest revision as of 15:54, 25 October 2013


This guide demonstrates how to use simalma, a CASA task that simplifies the process of simulating observations that combine data from the main 12 m array with data from the ALMA Compact Array (ACA). To learn how to create a script of the Python code on this page see Extracting scripts from these tutorials.


ALMA consists of the main array of 12m antennas plus the ACA, which includes an array of 7 m antennas and a separate set of 12 m antennas used for Total Power measurements. CASA enables users to simulate observations using any or all of these components.

Simulating Observations with the Main 12 m Array and the ACA: Manual Combination of the Data

One could simulate observations that use the main array plus the ACA by generating the data for each component separately and then "manually" combining and imaging the data. The approach is to use simobserve to generate simulated uv data sets for each component separately, and then combine and image the resulting Measurement Sets using simanalyze. This technique is general and can be used to simulate observations using multiple 12m array configurations, as well. Total power observations can be simulated either in an independent run of simobserve, or integrated with one of the interferometric simulations. Note that if you simulate total power and interferometric observations simultaneously with simobserve, they must have the same set of pointing centers and the same integration and total time. These are not realistic conditions. For example, to reduce edge effects, the Total Power antennas should observe a larger area on the sky than the main array antennas, by about 1/2 of a primary beam. So, it is generally better to generate the total power data with a separate run of simobserve.

This "manual" method of combining main-array data with ACA data is described in the ACA_Simulation_(CASA_4.2) guide.

The simalma task

The simalma task has been available in CASA since CASA version 4.1. This task takes one set of parameters describing the region of the sky to observe, and makes the appropriate calls to simobserve and simanalyze. Here we give an example showing how to use simalma.

Set simalma as current task

Reset all parameters to default, then set the project name to m51 and allow simalma to overwrite files

# Set simalma to default parameters
default("simalma")
# Our project name will be "m51", and all simulation products will be placed in a subdirectory "m51/"
project="m51"
overwrite=True
Specify sky model image

In this example, we'll use an Halpha image of M51 as the model of the sky. The curl command used below will copy a data file with the model image to our local disk and rename it.

# Model sky = Halpha image of M51 
os.system('curl http://casaguides.nrao.edu/images/3/3f/M51ha.fits.txt -f -o M51ha.fits')
skymodel         =  "M51ha.fits"

Note that simalma will not modify your original input image. Rather, it will make a copy m51/m51.skymodel.

To make the M51 Halpha image more suitable to a sub-millimeter ALMA observation, we will modify most of the header parameters from the FITS file. We will:

  • place the source in the southern hemisphere with the indirection parameter,
  • set the pixel size to 0.1arcsec, to simulate an observation of a galaxy that is smaller in angular size than M51. (M51 itself would require a quite large mosaic, and in any case we'd like the angular scale of pixels in the input model to be significantly smaller than the synthesized beam.)
  • set the peak brightness to 0.004 Jy/pixel
  • set the center observing frequency to 330GHz, and since it's a 2D image we'll set the single "channel" width to be 50MHz. These parameters are plausible for observing a sub-mm emission line in a galaxy.
# Set model image parameters:
indirection="J2000 23h59m59.96s -34d59m59.50s"
incell="0.1arcsec"
inbright="0.004"
incenter="330.076GHz"
inwidth="50MHz"

Set up Observing Parameters, and Run
hexagonal mosaic overplotted on sky model

We will simulate observations using the main array in "configuration number 3" from Cycle 1. To that we will add the ACA cycle 1 array. This configuration affords ~0.5 arcsec resolution. A list of configuration files available in CASA 4.2 is available here.

antennalist=["alma_cycle1_3.cfg","aca_cycle1.cfg"]

We'll set the 12m array observing time to 30 minutes:

totaltime="1800s"

The 7 m array observing time is set by default to be twice the 12 m Array time. So in this instance, the 7 m array will observe for 1 hour. The user can set this to a different value by entering the 12 m time and the 7 m time as a 2-element array in the totaltime parameter. See the online help for simalma for details.

For this simulation we will also "single-dish" total power observations with two antennas from the 12 m Total Power Array. A typical observation with the Total Power Array will last 4 times the total time of the 12 m main array.

In CASA 4.2 simulations, the total power observations are simulated in a single track. So the total time of a TP observations should be roughly 8 hours or less, depending on the target declination, to ensure that the target position remains above the horizon. This limitation will be addressed in a future CASA release. For now, you should limit simulations that include the TP Array to short total integration times.

tpnant = 2
tptime="7200s"

We set the precipitable water vapor to 0.6 mm to represent observations in nominal weather. The simulation will add noise to the data based on this setting.

pwv=0.6

To cover all of the galaxy according to our rescaled pixel size, we'll need a 1 arcmin mosaic, and we'll let simalma calculate the pointings for us:

mapsize="1arcmin"

Finally you can check the input settings and run the simulation.

inp
go

On this first pass of the execution, simalma does a "dry run" to check input parameters that might cause a problem. After the dry run execution, you can run the actual simulation.

dryrun = False
inp
go

What does simalma do?

12m observation

Generate visibilities and images for each array

The simalma task first calls simobserve to simulate the visibilities for each of the three array components: the 12 m Main Array, the 7 m Array, and the 12 m Total Power Array. The simobserve task generates a figure showing the elevation of the target, the antenna layout, the uv coverage, and the synthesized dirty beam.

Next simalma generates an image from each of the three array components, separately. This step is not essential to getting the final result from the combined arrays, but it provides a useful diagnostic. The images will be named according to the antennalist parameter above. In this example, it is called m51.alma_cycle1_3.noisy.ms/.

ACA hex map

It is useful to know that simalma generates a version of the input sky model convolved to the ACA resolution. In this example it is called m51.aca_cycle1.skymodel.flat.regrid.conv/. That image can be useful to better understand the simulation results.

Note that the total power map covers the same region as the main array mosaic, with an extra pointing position added around the outside edge of the map so that the total power map is larger than the interferometric mosaic. (Total power maps usually have additional noise and artifacts at their edges). Furthermore, a square raster pattern is used instead of the hexagonal pattern of the interferometric array maps.

TP map

Combine the UV data from the 3 components and make the final image

Next simalma uses simanalyze to combine the three measurement sets and create a single image. It accomplishes this in the following manner.

First, simalma concatenates the two sets of interferometric visibilities, and images them. Diagnostic graphics with "concat" in their names are generated:

combined interferometric map

Finally it combines the total power image with the concatenated interferometric image using the CASA task feather.

Note, there are many ways to combine data from separate observations. If you are dealing with real ALMA data, you may wish to discuss options with scientists at your ARC.

Some notes for combining data "manually"

  1. When combining interferometric data from different arrays "manually", it is critical to set the relative data weights properly. Simulated data have weights=1, since the thermal noise is generated uniformly per baseline. However, in reality the 7m baselines have lower sensitivity than the 12m baselines, and their weights must be decreased by the sensitivity ratio. simalma uses the visweightscale parameter of concat to apply that lower weight of (7/12)**2 to the 7m visibilities. If you wish to combine data manually, you must do this step yourself.
  2. When combining the single dish and interferometric maps in the image plane using the feather task, one must use the interferometric map without the primary beam correction, and first multiply the total power map by the interferometric sensitivity image (".flux") -- this ensures that noise effects are properly handled on the edges of each map. After running feather, the output is masked to 0.2 times the interferometric primary beam, since the total power map was created larger than the interferometric map on purpose, so the edges of the combined image do not contain any interferometric information.:
combined maps