usra-sti / gdt-core Goto Github PK

View Code? Open in Web Editor NEW

5.0 5.0 8.0 9.78 MB

Gamma-ray Data Tools - Core Components

License: Apache License 2.0

Python 100.00%

astronomy gamma-ray gamma-ray-astronomy nasa usra

gdt-core's People

Contributors

Stargazers

Watchers

Forkers

ranjithrosan17 burstcube corifletcher-usra gifts-6u oliverroberts-usra sumanbala2210-usra joshuarwood

gdt-core's Issues

Slow TTE file read behavior

I noticed that opening TTE files takes about 3x longer when using GDT compared to older GBM data tools. The slowdown traces to a list(events) conversion happening inside the data_primitives.EventList class

https://github.com/USRA-STI/gdt-core/blob/main/src/gdt/core/data_primitives.py#L505

This can be avoided by first creating an empty numpy array and then filling it with the TIME and PHA columns

self._events = np.empty(times.size, dtype=[('TIME', times.dtype.type),
                                  ('PHA', channels.dtype.type)])
self._events['TIME'] = times
self._events['PHA'] = channels

Filling the empty array takes ~0.06 sec on a 2019 Macbook Pro versus 1 sec for the list conversion. I'll work on a pull request with this suggested change.

Make DataCollection behave like a dict

DataCollection is an obvious dict wrapper. In my opinion, it should behave like one. This will make the code more intuitive.

See #11

Update gdt-data to eliminate deprecation notice

The gdt-data script needs to be updated to use importlib.resources and importlib.packages instead of the deprecated pkg_resources.

Remove unnecessary try/except when writing FITS files

gdt-core/src/gdt/core/file.py

Lines 186 to 189 in 2707c23

 try: 

 self.hdulist.writeto(full_path, checksum=True, **kwargs) 

 except Exception as e: 

 print(e)

This prevents debugging and has no obvious purpose. Should remove the try/except and allow any exceptions to be raised upon write.

`SkyPlot` heatmaps aren't rendered correctly when values exist with RA close to 0 or 360 degrees

I've noticed this issue when visualizing localizations with EquatorialPlot, where the probability heatmap isn't rendered correctly when the centroid RA is close to 0 or 360 degrees. This can be illustrated using a local copy of glg_healpix_all_bn220130968_v01.fit:

from gdt.core.plot.sky import EquatorialPlot
from gdt.missions.fermi.gbm.localization import GbmHealPix

gbm_healpix = GbmHealPix.open("./glg_healpix_all_bn220130968_v01.fit")
gbm_healpix.centroid
(357.87735849056605, -50.480044262442945)

eqplot = EquatorialPlot()
eqplot.add_localization(gbm_healpix, gradient=True)

The following plot is generated:

According to the GBM catalog, the correct skymap should look like:

The issue appears to be caused by the transformation of specified lon/lat arrays to SkyCoord in SkyMap.plot_heatmap():

gdt-core/src/gdt/core/plot/sky.py

Lines 456 to 467 in 91020a3

 coords = SkyCoord(x.flatten(), y.flatten(), frame='gcrs', unit='deg') 

 try: 

 coords = coords.transform_to(self._astropy_frame) 

 except: 

 pass 

 lon, lat = get_lonlat(coords) 

 lon = lon.reshape(x.shape) 

 lat = lat.reshape(y.shape) 

 heatmap = SkyHeatmap(lon.value, lat.value, heatmap, self.ax, 

 frame=self._frame, flipped=self._flipped, **kwargs)

In the case of localizations, the lon_array specified to plot_heatmap() will contain values in $[0, 360]$. However, the call to SkyCoord() will convert the last 360 value to 0. E.g.:

from astropy.coordinates import SkyCoord
SkyCoord(ra=[0, 180, 360], dec=[45, 45, 45], unit="deg")

<SkyCoord (ICRS): (ra, dec) in deg
    [(  0., 45.), (180., 45.), (  0., 45.)]>

This appears to cause the issue when subsequently rendering the heatmap. Note that it's only visible when centroids straddle RA=0/360 degrees, other localisation centroids are rendered correctly. I've also noticed a similar issue with the contour lineswhere although it doesn't impact the visualisation as much as the heatmaps, the lines are broken, and I assume it's also due to the use of SkyCoord.

I have a local workaround that overrides plot_heatmap() to exclude the call to SkyCoord(), where the specified lon_array and lat_array are passed directly to SkyHeatmap, similar to the previous GBM Data Tools implementation:

from gdt.core.plot.plot import SkyHeatmap

class FixEquatorialPlot(EquatorialPlot):
    def plot_heatmap(self, heatmap, lon_array, lat_array, **kwargs):
        """Plot a heatmap on the sky.
           This is a workaround for the issue in the parent SkyPlot.plot_heatmap(),
           where the meshgrid results in a lon_array starting/ending with 0 degrees.
           This causes issues where the heatmap generated from a HEALPix location
           is populated at both RA=0/360 degrees, resulting in an unexpected output
           from the subsequent call to Matplotlib's pcolormesh().

           The old GBM Data Tools didn't generate a meshgrid, and just passes the
           specified coordinate arrays unchanged.

        Args:
            heatmap (np.array): A 2D array of values
            ra_array (np.array): The array of longitudinal gridpoints
            dec_array (np.array): The array of latitudinal gridpoints
            **kwargs: Options to pass to :class:`~gdt.plot.plot.SkyHeatmap`
        """
        heatmap = SkyHeatmap(
            lon_array,
            lat_array,
            heatmap,
            self.ax,
            frame=self._frame,
            flipped=self._flipped,
            **kwargs
        )

        return heatmap

This results in a correct rendering of the heatmap, similar to that contained in the catalo:

fixeqplot = FixEquatorialPlot()
fixeqplot.add_localization(gbm_healpix, gradient=True)

I can create a PR with this fix but I wanted to check first whether the SkyCoord usage was required for other reasons.

Filename not an OGIP standard in Primary header

for the pha.py there is a Filename set in the Primary header which is not in an OGIP standard.

Attribute `names` incorrectly updates when creating a SuperFunction

Referencing Issue USRA-STI/gdt-fermi#27.

This appears to be the offending line:

gdt-core/src/gdt/core/spectra/functions.py

Line 259 in 4a5291c

obj.names = [func1.name, func2.name]

This should affect both additive and multiplicative SuperFunctions alike.

healpix convolve does not preserve header information

gdt-core/src/gdt/core/healpix.py

Line 106 in 018acb3

return type(self).from_data(new_hpx, trigtime=self.trigtime, **kwargs)

Currently trigtime is the only property saved when returning a HealPix-derived class from the convolve function.

In practice, this means a user trying to convolve a systematic error with a GBM map will receive a map with header, scpos, and quaternion information stripped away. Is this the intended behavior? Should users be expected to re-apply that information to the returned map?

matplotlib AttributeError: 'Colorbar' object has no attribute 'patch'

It seems like the patch option only works for matplotlib<=3.6.3 for some of the plotting schemes in the gdt core.

Pstat does not handle zero counts

The Pstat likelihood does not handle zero-count bins. While this is noted in SpectralFitterPstat, this can be rectified by including the special-case calculation of likelihood for zero-count bins:

$\mathcal{L}_i(S_i = 0) = t_s * (m_i + b_i)$

where $t_s$ is the source exposure, $m_i$ is the source model count rate for that bin, and $b_i$ is the background model count rate for that bin. Without this correction, using Pstat when many bins have zero counts can significantly bias the fit.

Request for consistent handling of time values between spacecraft frames and tte objects

@BillCleveland-USRA @AdamGoldstein-USRA

Currently when working with poshist history information from the GBM mission a user can do:

poshist = GbmPosHist.open(pos_file)
frames = poshist.get_spacecraft_frame()

t = Time(val, format)
frame = frames.at(t)

where a Time object is used to retrieve a spacecraft frame. In this case, the user does not need knowledge of the underlying time format used in the mission. They only need to create a valid time object.

However, this is incompatible with the time format in tte files:

tte = GbmTte.open(tte_file)
frame = frames.at(tte.trigtime) <-- results in an error

because the tte class has no inherent knowledge of the underlying time format from the mission.

Would it be possible to require knowledge of the time format in the core tte class? Inherited classes from missions would then be required to set this value, similar to how GbmPosHist knows the underlying time format needed to generate spacecraft frames. Functions like splice_time could then allow for either float or time class input with something like:

def __init__(self, data, trigtime..., time_format):
    self. format = time_format (need to figure out how to handle class + format string)

def splice_time(self, tstart, tstop)
    if isinstance(tstart, Time):
          tstart = getattr(tstart, self.format)

this would allow users the ability to provide Time values instead of needing to know the internal mission time formats themselves. You could also have a separate splice_time function for a formatted input if you want to avoid slowing down the regular float method. The benefit here is that you avoid having to caste the underlying data to astropy time objects.

There is also a need to return a Time class value for tte.trigtime

What do you think? Should I prototype this for a pull request?

Add Bayesian Blocks binning functions for binned and unbinned data

A highly requested feature is the Bayesian Blocks binning algorithm implemented in the GDT.

This could be implemented using Astropy's bayesian_blocks method. Both binned and unbinned data would use fitness='event' and would accept the gamma and ncp_prior keywords.

For unbinned data, the GDT function would be little more than a wrapper around bayesian_blocks, since the input is an array of event times and the ouput is an array of bin edges.

For binned data, the GDT function is slightly more complex because the number of counts in each new bin must be calculated as well as the exposure of each bin.

Install not working

I’m currently trying to install GDT on my computer and I’ve run into an issue that I can’t figure out. I used the pip install command as it says on the docs page. However, when I run gdt-data init I receive the following error:

Traceback (most recent call last):
File "/home/unholysun/Documents/GDT/GDT/bin/gdt-data", line 44, in
from gdt.core.heasarc import FileDownloader
File "/home/unholysun/Documents/GDT/GDT/lib/python3.8/site-packages/gdt/core/heasarc.py", line 379, in
class FileDownloader(BaseFinder):
File "/home/unholysun/Documents/GDT/GDT/lib/python3.8/site-packages/gdt/core/heasarc.py", line 402, in FileDownloader
def bulk_download(self, urls: list[str], dest_dir: Union[str, Path], verbose: bool = True):
TypeError: 'type' object is not subscriptable

I’m not sure if this is user error or perhaps something specific to my setup. I’m running Ubuntu through WSL2 in Windows 11. I have all required packages for the requierments.txt file installed and a virtual environment set up.

Error in heasarc.py

from gdt.core.heasarc import BrowseCatalog

Error Returned:

Traceback (most recent call last):
File "", line 1, in
File gdt-core/src/gdt/core/heasarc.py", line 379, in
class FileDownloader(BaseFinder):
File "gdt-core/src/gdt/core/heasarc.py", line 402, in FileDownloader
def bulk_download(self, urls: list[str], dest_dir: Union[str, Path], verbose: bool = True):
TypeError: 'type' object is not subscriptable

Attribute error while using jupyter with a FitsFileContextManager class.

When FitsFileContextManager.repr_html() is called by jupyter it returns an attribute error on a new object that doesn't have a HDUlist defined.

This was reported as issue USRA-STI/gdt-fermi#15

Matplotlib version issue with get_cmap

https://github.com/USRA-STI/gdt-core/blob/f94bcbea6a6b2eabfa94e8d56733653653175361/src/gdt/core/plot/plot.py#L256C8-L256C53

The newest version of matplotlib doesnt like plt.cm.get_cmap(), it needs to be updated to plt.get_cmap() for anything newer than 3.9.0

"Nan" 'el' Values from 'gdt.core.coords.spacecraft', SpacecraftFrame.

Sollution

@a_coords.frame_transform_graph.transform(a_coords.FunctionTransform, a_coords.ICRS, SpacecraftFrame)
def icrs_to_spacecraft_mod(icrs_frame, sc_frame):
"""
Modified child class of 'icrs_to_spacecraft' to remove 'Nan' vale of 'el'.

"""
xyz = icrs_frame.cartesian.xyz.value
rot = Rotation.from_quat(sc_frame.quaternion)
xyz_prime = rot.inv().apply(xyz.T)
if xyz_prime.ndim == 1:
    xyz_prime = xyz_prime.reshape(1, -1)
az = np.arctan2(xyz_prime[:, 1], xyz_prime[:, 0])
mask = (az < 0.0)
az[mask] += 2.0 * np.pi
el = np.pi / 2.0 - np.arccos(np.clip(xyz_prime[:, 2],-1,1)). ###### np.clip helps to avoid the Nan values
return type(sc_frame)(az=az * u.radian, el=el * u.radian,
                        quaternion=sc_frame.quaternion)

Add a tolerance when computing "good segments" for ExposureBins and TimeEnergyBins

ExposureBins and TimeEnergyBins calculate "good segments," which are contiguous segments of data separated by bad time intervals (typically no data). This is determined by comparing the high edge of a bin to the low edge of the next bin, and if they are not the same, then the end of the good segment is declared, and a new segment starts. The implementation in ExposureBins, for example, is

mask = (self.lo_edges[1:] != self.hi_edges[:-1])

There are cases when self.lo_edges[1:] and self.hi_edges[:-1] are slightly different, usually by rounding/numerical precision limitations, especially on older files that use 32-bit floats. This causes an incorrect segmentation of the data. An improvement is to implement a tolerance:

mask = np.abs(self.lo_edges[1:] - self.hi_edges[:-1]) > tol

where tol is the tolerance in seconds allowed. As an example, tol=1e-4 is sufficient for BATSE continuous data.

The keyword option of tol could either be added as an option to the _calculate_good_segments() method and passed via an option in the initializer, or have the initializer set a private tolerance variable and _calculate_good_segments() uses that private variable.

	try:
	self.hdulist.writeto(full_path, checksum=True, **kwargs)
	except Exception as e:
	print(e)

	coords = SkyCoord(x.flatten(), y.flatten(), frame='gcrs', unit='deg')

	try:
	coords = coords.transform_to(self._astropy_frame)
	except:
	pass

	lon, lat = get_lonlat(coords)
	lon = lon.reshape(x.shape)
	lat = lat.reshape(y.shape)
	heatmap = SkyHeatmap(lon.value, lat.value, heatmap, self.ax,
	frame=self._frame, flipped=self._flipped, **kwargs)