winter-telescope / mirar Goto Github PK

View Code? Open in Web Editor NEW

7.0 7.0 8.0 252 MB

Modular Image Reduction and Analysis Resource (MIRAR)

License: MIT License

Python 100.00%

astronomy image-processing infrared-astronomy open-source optical-astronomy photometry python

mirar's People

Contributors

Stargazers

Watchers

Forkers

yashvi-sharma saarahhall broulston jamiesoon astqx geoffreymo earleyn138

mirar's Issues

Missing requirements

Extra python requirements were introduced in #39 , but these are not yet reflected in requirements.txt/setup.py.

At a minimum, confluent_kafka is now required, I didn't investigate further yet.

Wirc_imsub pipeline not running

Error message thrown (tried on main, candidatestable, and fritzscan)

Image not being closed properly in zogy

/Users/robertstein/Code/winterdrp/winterdrp/processors/zogy/py_zogy.py:53: ResourceWarning: unclosed file <_io.FileIO name='/Users/robertstein/Data/summer/20220815/subtract/SUMMER_20220816_042349_Camera0.resamp.resamp.fits.scaled' mode='rb' closefd=True>
  N = fits.open(Nf)[0].data

Nan/zero in swarp

To quote @virajkaram:

"Swarp sets sets masked pixels to zero when it resamples, but the other processors only masks nans. This affects the subtractions"

Right now we mask zeros when loading raw images. Maybe we should try making this self-contained/do such things in the swarp processor.

The BasePipeline is already set up to enable different running modes which can be selected via command line. So everything in imsub should eventually be merged back into the wirc directory/pipeline.

Check quality of pipeline

Right now there is a unit test, so we check the code is consistent up to photometric calibration. We do not yet check where the reduction is consistently good or consistently bad.

Database Modifier

Add processor to modify database entries

NoiseChisel

Mansi suggested that we potentially implement NoiseChisel: https://www.gnu.org/software/gnuastro/manual/html_node/NoiseChisel.html

Systematic Error Handling

Following standard python practise:

All errors should be raised and then handled, rather than relying on passing around processing status numbers etc.

We need to systematically raise errors, handle them, and then be able to summarise them. In production you want want a nightly email summary tracking which images were/were not successfully processed.

So far this has been partially addressed by #47 and #48, but there are still missing pieces.

Ideally, all errors would be raised by the code itself. In any case, we should track which errors were not raised by the code, so we can prioritise fixing them, leaving errors related to e.g image issues which are understood and unavoidable.

Add Cosmic ray cleaner processor

Maybe LACosmic?

Reorganise Database Processor

We want to reorganise the Database Processors into:

DBHandler
BaseDBImporter
BaseDBExporter
ImageDBImporter
ImageDBExporter
DataframeDBImporter
DataframeDBExporter

That'll be needed for many downstream functionality including Reference Image generation and Candidate naming.

Should be done on db branch.

Test database

Much of the database creation is not tested. Let's change that!

Unit testing for Image Subtraction/Candidate generation

Following on from #54 , we should test the image subtraction things as well. The reference image is already there, we just need to add in image subtraction components to test_wirc_pipeline.py.

Processing candidates to Fritz

If Fritz is down, we run into the issue that candidate cannot be processed/annotations cannot be updated. To ensure this doesn't cause the pipeline to break:

Once candidates table is in the database, have a field is_fritz_processed. Query everything that hasn’t been submitted to fritz and feed them to SendToFritz processor
If Fritz is down, update the is_fritz_prcessed field to False for that candidate, wait 30 seconds (Fritz goes now in 30 sec stretches), then continue
Successfull Fritz processing/updating, sets is_fritz_prcessed field to True
Query is_fritz_processed again for False entries after going through all cands

Candidates Processor should output a list of dataframes instead of a single dataframe

Change implementation of sextractor dual mode

Instead of passing multiple images, add a key to the header of the image
https://github.com/winter-telescope/winterdrp/blob/924dc18960d969521d6c18434995fa956b68f5d3/winterdrp/processors/astromatic/sextractor/sextractor.py#L70

Set up unit tests for summer

We now have a framework for running unit tests using data from a private Github repo. We can now adopt the mantra "test evertthing" without worrying about making data public (though that's my strong preference where possible). Anyway, we can actually run the summer unit tests with the CI as well as WIRC, so should set that up.

Similar to #38, similarly blocked by #22.

Photometry history possibly duplicated

Update avro fields

Make sure the dataframe fields match with the spreadsheet

Slack Reports

It would be nice to let the monitor send slack reports...

Integrate with Github Actions

We want to run the tests on Github Actions. However, we first need a way to get test data.

We could:

Get permission for publishing some limited set of test data (I vote for this in the spirit of open source)
Set up some download function, perhaps with secret github keys, to download the data

We also need to install sextractor etc on Github, which may or may not be possible. Otherwise, Docker? Needs investigation.

Wiki Page Development

Include broad overview of pipeline functioning (batching, etc)
Info on overview on processors
How to run and create unit tests
#19

TDC harvard service timing out

The TDC-harvard catalog service (http://tdc-www.harvard.edu/cgi-bin/scat?catalog=sdss&ra=40.088937&dec=47.850813&system=J2000&rad=-90) has been timing out a lot recently. This is used in autoastrometry to query catalog stars, we should maybe change the place from where these catalogs are pulled

Add magdifflim field to df

magdifflim is a field used by photometry creation (sections that need it are currently being skipped). Once added, make_photometry method in SendToFritz.py can be updated.

Pull out database schema to summer directory

Right now the summer files are distributed across the repo in a way that is difficult to track. We should move all the summer-specific .sql files to the summer directory.

Update SUMMER test ZP values

The Summer data reduction pipeline currently uses 30 arcminute radius to query catalog sources for astrometry and photometric calibration. The field of view of the camera is ~15 arcmin on a side, so we really need only 7.5 arcmin radius searches, making the queries faster. This changes the zeropoints slightly (0.001), but result in CLI failing

Set up unit tests with WIRC

We got the green light for setting up unit tests with WIRC data. This would involve making a minimum number of WIRC images public, needed for testing the pipeline. We should identify which images are needed:

A target where we have an image to use as a reference for subtraction
A set of flats/darks
A block of dithers for a WIRC stack
Ideally use published data -> Select old target

Topic datetime for data processing

When sending avro packets to IPAC, currently sending the packets to topic name with utc now. This allows for testing and down the line, for reprocessing of data.

Need to review that this is the right decision for the topic naming.

Unit tests broken

The unit tests which run successfully on main do NOT run on the imsub branch. We should not merge until we understand the discrepancy.

pycache files added to repo

Looks like a bunch of random pycache files that should be untracked were added to the repo with PR #84. Is it okay to delete them?

LastModified in database

Would be good to track last modified

ImageRejector

I want a processor that works like an anti ImageSelector. It should remove images if they have header keys matching particular values. I'll use it to eliminate focus images, AND to select images which have not been processed/entered into a database before.

Reference images from database

Check for reference images from the database and run the image subtraction pipeline only for the reference images. requires #16

Image subtraction with SUMMER

Get image subtraction, candidate generation working with SUMMER data.

Validate photometry coming out of imsub pipeline

Weird Problem with the Sextractor Module

When trying to reduce data, I am running into a weird error:

Error for processor winterdrp.processors.astromatic.sextractor.sextractor at 2022-08-30 12:10:28.784147 (local time): 
   File "/Users/robertstein/Code/winterdrp/winterdrp/processors/base_processor.py", line 127, in base_apply
    batch = self.apply(batch)
  File "/Users/robertstein/Code/winterdrp/winterdrp/processors/base_processor.py", line 199, in apply
    images, headers = self._apply_to_images(images, headers)
  File "/Users/robertstein/Code/winterdrp/winterdrp/processors/astromatic/sextractor/sextractor.py", line 179, in _apply_to_images
    header[sextractor_checkimg_keys[checkimg_type]] = checkimage_name[ind]
KeyError: 'NONE' 
  This error affected the following files: ['SUMMER_20220824_204552_Camera0.resamp.fits'] 
This error was not a known error raised by winterdrp.

Beyond the typos, why would the code be trying to get an entry marked "None"?

Hardcoded paths need to be replaced

I think there are some paths which were hard-coded relative the winterdrp directory, rather than absolute paths. The upshot of this is that you can only run the code if you are in the winterdrp directory, rather than in any directory as expected.

The solution is to replace any relative path to one referencing the absolute path of the code directory, which can get with e.g pathlib or using os.path.abspath(__file__).

The specific error I get is:

FileNotFoundError: [Errno 2] No such file or directory: 'winterdrp/pipelines/wirc_imsub/wirc_imsub_files/schema/candidates.sql'

but I suspect there are others.

Wirc pipeline not running

Running wirc_imsub pipeline on winter server and getting the following error. Just merged with main in this commit https://github.com/winter-telescope/winterdrp/commit/cd4afc5e415b681088c0eccb43deb5d9cd69538b

Download WIRC data from gayatri does not work sometimes

The code looks for the data in /scr2/ptf/observation_data but sometimes the data can be in /data/sanand/WIRC.
NB The directory names are different in both paths, observation_data has WIRC_ but /sanand/WIRC has

Only instantiate processors for the pipeline which is being used

Currently, all processors of all pipelines are instantiated regardless of which pipeline is being used. It would be nice to change this behaviour

Email summary option

We need some code to send emails summarising an error stack (For #49, after #47 and #48 done).

Less verbose log

After successfully running the code on a live night of ~300 images, we have logs of 4.6Mb. That feels too big to send as an email attachment every day (admittedly it was in debug). We should consider whether to reduce some of the text output (e.g those jumbo astroquery tables).