Minimal examples

These are minimal C, C++ and Fortran examples that load a 2-dimensional grid and print the value at (0,0). In each case the grid contains floating point values.

C example:

#include <mpi.h>
#include <asagi.h>
#include <stdio.h>
int main(int argc, char** argv)
{
  MPI_Init(&argc, &argv);
  
  asagi_grid* grid = asagi_grid_create(ASAGI_FLOAT);
  asagi_grid_set_comm(grid, MPI_COMM_WORLD);
  // with threads, set number of threads
  asagi_grid_set_threads(grid, 1);
  
  if (asagi_grid_open(grid, "/path/to/netcdf/file.nc", 0) != ASAGI_SUCCESS) {
    printf("Could not load file\n");
    return 1;
  }
  
  double pos[] = {0, 0};
  printf("Value at (0,0): %f\n", asagi_grid_get_float(grid, pos, 0));
  
  asagi_grid_close(grid);
  
  MPI_Finalize();
  
  return 0;
}

C++ example:

#include <mpi.h>
#include <asagi.h>
#include <iostream>
using namespace asagi;
int main(int argc, char** argv)
{
  MPI_Init(&argc, &argv);
  
  Grid* grid = Grid::create();
  grid->setComm(MPI_COMM_WORLD);
  // with threads, set number of threads
  grid->setThreads(1);
  
  if (grid->open("/path/to/netcdf/file.nc") != Grid::SUCCESS) {
    std::cout << "Could not load file" << std::endl;
    return 1;
  }
  
  double pos[] = {0, 0};
  std::cout << "Value at (0,0): " << grid->getFloat(pos) << std::endl;
  
  // The same as: "Grid::close(grid);"
  delete grid;
  
  MPI_Finalize();
  
  return 0;
}

Fortran example:

! You have two options:
! - Include the module file _once_ in your project:
!include 'asagi.f90'
! - Compile and link the module file as any other file in your project
program minimal
  use mpi
  use asagi
  use, intrinsic :: iso_c_binding
  implicit none
  
  integer :: grid_id
  real( kind=c_double ), dimension(2) :: pos
  integer :: error
  call mpi_init( error )
  
  grid_id = asagi_grid_create( )
  call asagi_grid_set_comm( grid_id, mpi_comm_world )
  ! with threads, set number of threads
  call asagi_grid_set_threads( grid_id, 1 );
  if( asagi_grid_open( grid_id, "/path/to/netcdf/file.nc" ) /= asagi_success ) then
    write (*,*) "Could not load file"
    call exit(1)
  end if
  pos(:) = 0
  write (*,*) "Value at (0,0):", asagi_grid_get_float( grid_id, pos )
  call asagi_grid_close( grid_id )
  call mpi_finalize( error )
end program minimal

Grid types

ASAGI distinguishes between three different grid types:

FULL The whole grid will be loaded during the initialization. The file is not accessed during runtime. (default)
CACHE ASAGI is used as a cache. After initialization, the cache will be empty. Each access to an element, will put the corresponding block into the cache for later usage.
PASS-THROUGH ASAGI will pass each access to the underlying file system without any caching, etc.

Full storage does not automatically mean, that the full grid is stored on every CPU. If asagi::Grid::setComm() and/or asagi::Grid::setThreads() are called, the initial grid will be distributed among all nodes resp. CPUs. If the cache-grid is used and asagi::Grid::setThreads() and/or asagi::Grid::setComm() are set, ASAGI will copy the data from other NUMA domains and/or other MPI processes. Only if it is not available in another cache, the data will be fetched from the file.

Dimensions

ASAGI supports grids with up to MAX_DIMENSIONS dimensions. (MAX_DIMENSIONS is 4 by default, but can be changed during compilation of ASAGI.) The number of actual dimensions in the grid cannot be specified by calling an ASAGI function but depends on the netCDF input file.

Remarks: The order in the dimension in the netCDF file is in Fortran style (column-major, see NetCDF files) but the ASAGI interface uses C/C++ ordering (row-major).

Level of detail

A grid can have multiple resolutions. Each resolution is identified by a level id (level of detail). If the number of levels is not specified when creating a grid, the grid will contain only one level of detail. In this case you can also omit the level id in all other functions, since level 0 will be used by default. (C does not support default arguments or overloading, therefore omitting arguments is not possible when using the C interface.)

For grids with multiple levels asagi::Grid::open() must be called once for each level. Several levels can be stored in a single NetCDF file with different variable names. (Use asagi::Grid::setParam() to specify the variable name.) The coarsest resolution should have the level id 0. With ascending level id, the resolution should get finer. When accessing values with any get function, the level of detail can be selected with the last argument. The function asagi::Grid::close() has to be called only once for the whole grid.

Coordinate mapping

ASAGI distinguishes between actual coordinates and internal array indexes. All functions, that return a grid value, expect actual coordinates. ASAGI maps each coordinate to an array index using the coordinate variables from the NetCDF file (see section NetCDF files on how specify coordinate variables in NetCDF files). If no coordinate variable is available, the mapping is omitted. After the mapping, the coordinate is rounded to the nearest array index. ASAGI does not interpolate between array values.

The actual range of the grid can be obtained with asagi::Grid::getMin()/asagi::Grid::getMax(). They also return coordinates, not array indexes. It is erroneous to access values outside range of the grid.

The range of a dimension can be $(-\infty,\infty)$ . This is the case if the size of the dimension in the netCDF file is one.

Value position

ASAGI supports cell-centered and vertex-centered grids. The value position can be switched with asagi::Grid::setParam().

Cell-centered and vertex-centered grids

NetCDF files

All NetCDF files opened with ASAGI should respect the COARDS conventions (http://ferret.wrc.noaa.gov/noaa_coop/coop_cdf_profile.html). However, ASAGI has some further limitations:

The attributes scale_factor and add_offset are ignored. Besides conversion between data types, ASAGI does not modify the values.
Since ASAGI does not change the NetCDF file, all values have to be present in the file. Attributes, like _FillValue and missing_value, are not supported.
ASAGI is not aware of any units. It is up to the user of the library to interpret the values correctly.
Variables with more than three dimensions are not supported.

It is possible to open a NetCDF file by different grids or levels at the same time. This allows you, for example, to store all levels of one grid in a single NetCDF file. In this case the levels must be distinguished by the variable names.

Multi-thread support

When compiled with THREADSAFE=ON (see section Compilation) all functions are thread-safe. However, there are some restrictions due to MPI implementations. If your MPI library is not thread-safe, you have to add the additional flag THREADSAFE_MPI=ON which will mare sure that ASAGI does not call MPI functions from different threads at the same time. However, in this case, you are not allowed to call MPI and ASAGI functions at the same time.

Multi-thread support is required if you want to use ASAGI's NUMA functionality (see NUMA).

In addition, for ASAGI to work correctly, it has to now about all threads the application is using. Use asagi::Grid::setThreads() to set the number of threads and call asagi::Grid::open() from all threads. asagi::Grid::open() is a collective operation for all threads.

NUMA

ASAGI is able to detect the NUMA domains of your node. If more than one NUMA domain is detected, ASAGI will place a cache on each NUMA domain to increase locality. You can control the NUMA detection, with the configuration parameter NUMA_COMMUNICATION (see Parameters).

MPI Communication

ASAGI supports two different MPI communication patterns: Via MPI remote memory access (MPI windows) or a separate communication thread. The MPI windows are used by default since they do not have any special requirement and are easy to use. However, in some MPI libraries, RMA is poorly tested and does not work well, especially with hybrid parallelization.

Therefore, you can use the communication thread. In this mode, a separate thread is required which is responsible for answering remote requests. You have to start the thread with asagi::Grid::startCommThread before any grid using the communication thread is opened. Multiple grids will share one communication thread does you must not start more than communication. However, you have to make sure that the MPI communicator for the communication thread includes all grid communicators. Once the last grid using the communication thread is closed, you should stop the additional with asagi::Grid::stopCommThread. To use the communication thread it is also necessary to have a thread-safe MPI implementation.

To disable MPI communication completely, set MPI_COMMUNICATION to OFF

Parameters

ASAGI supports several parameters for each grid:

Name	Values	Description	Grid-global (*)
GRID	FULL \| CACHE \| PASS-THROUGH	The grid type (see Grid types)	yes
NUMA_COMMUNICATION	ON \| OFF \| CACHE	Enable/disable NUMA detection. `CACHE` can be used in combination with the "full grid". It enables NUMA detection and in addition ASAGI will look into all node local NUMA caches before activating MPI communication. (default: `ON` if compiled with NUMA support)	yes
MPI_COMMUNICATION	OFF \| THREAD \| WINDOW	Use a communication thread or MPI RMA (windows) for MPI communication (default: WINDOW, see MPI Communication)	yes
VALUE-POSITION	CELL-CENTERED \| VERTEX-CENTERED	The value position (see Value position)	yes
TIME-DIMENSION	int	The dimension that holds the time (default is -1 which means no time dimension exists). ASAGI treats time dimension specially.	yes
VARIABLE	string	The variable name in the netCDf file. (default: z)	no
BLOCK-SIZE-X	int	The size of a block in dimension X. Use a negative value to set the block size equal to the total number cells in this dimension.	no
CACHE-SIZE	int	The size of the cache (in blocks) on each CPU.	no
CACHE-HAND-SPREAD	int	ASAGI uses the clock algorithm to approx. LRU. This parameter specifies the difference of the 2 hands in the clock. Lower values result in a faster algorithm but a worse approximation.	no

(*) If yes, the parameter can only be set for all levels at the same time. Set the parameter level in asagi::Grid::setParam() to 0 to change value.

Access counters

ASAGI supports several access counters to measure the throughput of the library and get information about effectiveness of the caches:

Name	Description
accesses	Total number of data accesses
numa_transfers	Number of blocks transfered between CPUs
mpi_transfers	Number of blocks transfered between processes
file_load	Number of blocks loaded from file (after initialization)
local_hits	Number values that where already in local NUMA domain
node_hits	Number values that where already on the local node
local_misses	Number of values that where not already in local memory

Remarks: If more than one thread is used and ASAGI is not compiled with THREADSAFE_COUNTER=ON, the counters might be inaccurate.