Loading Data: Difference between revisions

From CASA Guides
Jump to navigationJump to search
No edit summary
Line 7: Line 7:
== Obtaining data from the archive ==
== Obtaining data from the archive ==


EVLA data are available from the [https://archive.nrao.edu/archive/advquery.jsp NRAO Science Data Archive].  Details and updates regarding the archive can be found on the [http://science.nrao.edu/evla/archive/evla/ EVLA/VLA/VLBA Data Archive web page].  Here, we choose to download a publicly-available observation. If you are downloading proprietary data you will need to either sign into your NRAO account (using the link at the top of the archive page) or obtain the Project Access Key from the [mailto:analysts@aoc.nrao.edu NRAO data analysts].
EVLA data are available from the [https://archive.nrao.edu/archive/advquery.jsp NRAO Science Data Archive].  Details and updates regarding the archive can be found on the [http://science.nrao.edu/evla/archive/evla/ EVLA/VLA/VLBA Data Archive web page].  Here, we choose to download a publicly-available observation. <b>If you are downloading proprietary data</b> you will need to either sign into your NRAO account (using the link at the top of the archive page) or obtain the Project Access Key from the [mailto:analysts@aoc.nrao.edu NRAO data analysts].


We want to find data associated with the project "TVER0002", so enter this into the "Project Code" under the "General Search Parameters" and submit the query.  This will find two archive files; we will download the first by clicking on the checkbox next to the file name "TVER0002_sb2557689_1.55517.018916574074", entering an email address, and selecting "Create tar file".  Note that this last step makes data retrieval substantially easier, since the data will comprise multiple files within a directory if no tar bundle is requested.
We want to find data associated with the project "TVER0002", so enter this into the "Project Code" under the "General Search Parameters" and submit the query.  This will find two archive files; we will be downloading the first of these, so click on the checkbox next to the file name "TVER0002_sb2557689_1.55517.018916574074".


=== Selecting a set of scans ===
=== Selecting a set of scans ===
Although this isn't necessary, we will also choose to select only a subset of the scans in this observation.  You may wish to do this with your data if you only want to retrieve data for a specific source or receiver band.  In order to figure out which scans you want, you can either click on the "Scans" link in the "View Scans" column, or (if it's a long or particularly complex observation) you can download the SDM tables which contain metadata about the observation, but are not very large, and use the task {{listsdm}} to inspect the data.  For demonstration purposes, we will choose this method. 
To do this,
* Fill in your email address and the preferred download location;
* For the data download format, choose "SDM tables only (no visibiliites)";
* Click on "Get My Data."
<b>Ignore the file size.</b>  In this case, it's listed as 21.76GB, which is the entire dataset; the SDM tables are in fact only ** MB.
Once the download is complete, start CASA and run {{listsdm}}:
<source lang="python">
# In CASA
listsdm('TVER0002.sb2568947.eb2579996.55518.22356400463')
</source>
This gives the output:
<pre>
</pre>


=== Choosing to average data ===
=== Choosing to average data ===


=== Retrieval process ===
=== Retrieval process ===
, entering an email address, and selecting "Create tar file".  Note that this last step makes data retrieval substantially easier, since the data will comprise multiple files within a directory if no tar bundle is requested


When the archive process is complete, an email notification is sent out with information about the download directory.  Copy the data to a convenient location, and unpack the tar file by typing "<tt>tar xvf TVER0002_sb2557689_1.55517.018916574074.tar</tt>".  This will create the SDM data directory, but will retain the original tar file -- to conserve disk space, you will probably wish to delete the tar file.
When the archive process is complete, an email notification is sent out with information about the download directory.  Copy the data to a convenient location, and unpack the tar file by typing "<tt>tar xvf TVER0002_sb2557689_1.55517.018916574074.tar</tt>".  This will create the SDM data directory, but will retain the original tar file -- to conserve disk space, you will probably wish to delete the tar file.

Revision as of 13:36, 19 April 2012


Overview

This tutorial details the process of obtaining EVLA data from the archive in measurement set (MS) format. (A similar process for importing EVLA data into AIPS can be found here.) In addition, the implications of data averaging are described so that one may make an informed decision about whether -- and how -- to perform frequency or time-averaging to reduce the size of the dataset, as well as the initial flags that are generally applied to the data .

Obtaining data from the archive

EVLA data are available from the NRAO Science Data Archive. Details and updates regarding the archive can be found on the EVLA/VLA/VLBA Data Archive web page. Here, we choose to download a publicly-available observation. If you are downloading proprietary data you will need to either sign into your NRAO account (using the link at the top of the archive page) or obtain the Project Access Key from the NRAO data analysts.

We want to find data associated with the project "TVER0002", so enter this into the "Project Code" under the "General Search Parameters" and submit the query. This will find two archive files; we will be downloading the first of these, so click on the checkbox next to the file name "TVER0002_sb2557689_1.55517.018916574074".

Selecting a set of scans

Although this isn't necessary, we will also choose to select only a subset of the scans in this observation. You may wish to do this with your data if you only want to retrieve data for a specific source or receiver band. In order to figure out which scans you want, you can either click on the "Scans" link in the "View Scans" column, or (if it's a long or particularly complex observation) you can download the SDM tables which contain metadata about the observation, but are not very large, and use the task listsdm to inspect the data. For demonstration purposes, we will choose this method.

To do this,

  • Fill in your email address and the preferred download location;
  • For the data download format, choose "SDM tables only (no visibiliites)";
  • Click on "Get My Data."

Ignore the file size. In this case, it's listed as 21.76GB, which is the entire dataset; the SDM tables are in fact only ** MB.

Once the download is complete, start CASA and run listsdm:

# In CASA
listsdm('TVER0002.sb2568947.eb2579996.55518.22356400463')

This gives the output:


Choosing to average data

Retrieval process

, entering an email address, and selecting "Create tar file". Note that this last step makes data retrieval substantially easier, since the data will comprise multiple files within a directory if no tar bundle is requested

When the archive process is complete, an email notification is sent out with information about the download directory. Copy the data to a convenient location, and unpack the tar file by typing "tar xvf TVER0002_sb2557689_1.55517.018916574074.tar". This will create the SDM data directory, but will retain the original tar file -- to conserve disk space, you will probably wish to delete the tar file.

Starting CASA and initial inspection: listobs

Note that a description of importing EVLA data into AIPS can be found here.

First, be sure you have the most recent version of CASA installed. To start CASA, type "casapy"; this will start writing output to a log file called "casapy.log" as well as to the logger window, and will store any command-line input in a file called "ipython.log". (Note that a detailed description of the CASA environment, including relevant information on the Python language, can be found here.)

Initial data flagging: online flags, zero flags, and shadow flags