airctic / icevision Goto Github PK

View Code? Open in Web Editor NEW

840.0 24.0 150.0 858.46 MB

An Agnostic Computer Vision Framework - Pluggable to any Training Library: Fastai, Pytorch-Lightning with more to come

Home Page: https://airctic.github.io/icevision/

License: Apache License 2.0

Python 99.42% Shell 0.51% Dockerfile 0.07%

object-detection deep-learning pytorch pytorch-lightning python fastai ai computer-vision effecientdet faster-rcnn

icevision's Introduction

An Agnostic Computer Vision Framework

IceVision is the first agnostic computer vision framework to offer a curated collection with hundreds of high-quality pre-trained models from Torchvision, Open MMLab's MMDetection, Ultralytic's YOLOv5, Ross Wightman's EfficientDet and soon PyTorch Image Models. It orchestrates the end-to-end deep learning workflow allowing to train networks with easy-to-use robust high-performance libraries such as PyTorch-Lightning and Fastai.

IceVision Unique Features:

Data curation/cleaning with auto-fix
Access to an exploratory data analysis dashboard
Pluggable transforms for better model generalization
Access to hundreds of neural net models
Access to multiple training loop libraries
Multi-task training to efficiently combine object detection, segmentation, and classification models

Installation

pip install icevision[all]

For more installation options, check our docs.

Important: We currently only support Linux/MacOS.

Quick Example: How to train the Fridge Objects Dataset

Happy Learning!

If you need any assistance, feel free to:

Join our Forum

icevision's People

Contributors

Stargazers

Watchers

Forkers

ai-fast-track philtrade ramaneswaran paras-jain jcustin frapochetti lgvaz silasgg0530 lee00286 singhalpranav22 davanstrien hdocmsu julianpedro famosi sutt zupeiza liw71 fstroth anrim partham16 ribenamaplesyrup jerbly jahnavi0105 elmourr mfkiwl rajaskakodkar bnapora rsomani95 potipot addono roeydatagen aaodiall humandotlearning chaoyue729 sasakits sadjadasghari trendingtechnology adamfarquhar miwojc kedarisetti sasakits-git mikful orbis-international jesperkers jaedukseo stjordanis ncduy0303 yrodriguezmd answerlinyi ganesh3 bugdiaries bluseking bogdan-evtushenko drscotthawley francescosaveriozuppichini alexheat cnry verzaru lightning-sandbox drat matt-deboer neobrainz cuulee rokonuz strickvl debarshichanda ljcoopz anjum48 brownaa devloper13 harishsdev fcakyon dark-art108 hectorlop 2649 dnth laplacekorea thangaraj535 toussd giscardbiamby aravinda89 joowon-dm-snu alexanderhucheerful maphysart panda4us linaom1214 deanofthewebb techthiyanes aisensiy sayansahu99 mbrukman aniketmaurya roboserg ricpruss jlvahldiek hiiiua fabiogeraci maxpark rbavery ross-hr

icevision's Issues

MantisRCNN predict return types

🚀 Feature

Make .predict method on MantisRCNN models return standard objects BBox, Mask...

Deprecate CategoryParser

This parser does not follow the structure of the other parsers and it's not very useful anyways

Restructuring Model folder

Following discussion on Slack and on issue #60
We come to a structure as follows

so we can have a structure something like this
-> backbones (use torchvision + custom)
-> layers (for some layers that you would use in the backbone)
-> models
-----> model_name_folder (e.g. fasterrcnn)
----------> model.py (take help of backbones here)
----------> dataloader.py (with minor edits for every mode)

model.py includes train_step, validation_step, test_step.

And train.py (code for training and inference from the model) Would be in the examples folder.

Inheriting from rcnn to faster rcnn is extra inter code coupling which we might avoid.
Let's have seperate structures for rcnn, fast rcnn and faster rcnn. It would make debugging easier as well.

Also, I will raise a PR for contributing.MD and FAQs.MD (will check how to make .rst)

COCOMetric bug with transforms

Transform that resize the image changes the positions of bboxes and segs.

Currently COCOMetric will use the positions of the original records to calculate it's metrics. There are three possible solutions:

Never scale validation images
Don't use pycocotools, write metrics from scratch (good for longterm, pycocotools is causing a lot of minor issues)
Apply transforms to the records passed to COCOMetric

transforms and iscrowd

If a transform removes an item from the image, the corresponded iscrowd also has to be removed

DataParser should automatically included catmap

For instance, when we use COCOParser we should not need to pass a catmap

Start using absolute imports

Absolute imports are easier to refactor (automatically refactor when moving packages) and offer no drawbacks

Learner

Is a high level Learner class a good idea?

This class would behave similarly to fastai, but would differ from the lightning workflow.

Maybe we can think of this like the high level API for training models, while lightning would be the mid level.

Trainer.fit more than once

We need the workflow to be able to do something like this:

model.freeze_to(-1)
trainer.fit(...)
model.freeze_to(0)
trainer.fit(...)

Trainer already correctly resumes training, but we need to reset the lr_scheduler.

Models and their configurations

🚀 Feature

It would be hard for end-user to understand where to implement custom architectures and backbones. They require same training code as there is a change in backbone which is feature extractor.
Provide different backbone configurations that can be extended easily, something like a model file
https://github.com/oke-aditya/pytorch_fasterrcnn/blob/dev/src/model.py
Then use pytorch lightning trainer which is same for all these models. Something that simplifies https://github.com/oke-aditya/pytorch_fasterrcnn/blob/dev/src/engine.py

using torch lightning.

Also it would make a standard API, define the model, using lightning as trainer, user need not edit and inherit lot of stuff for backbone changes, architecture changes, num_classes etc. He can simply edit the model.

Why torchvision uses FrozenBatchNorm?

What is FrozenBatchNorm?

Why is it used on models like FasterRCNN and MaskRCNN?

How does it impact fine tuning? Because it's often a good idea to never freeze any batch norm layer while training (even if the other layers are freezed)

lr_finder.plot is showing the figure twice

Integrate pytorch hub

Is your feature request related to a problem? Please describe.
Use models from pytorch hub

Describe the solution you'd like
Easily use models from hub, with minimal setup.

Always return list of objects in parser mixins methods

We can either always return a list, or wrap the returned object with a list if it's not already (might lead to future problems)

Refactors show_pred

Currently show_pred is specify to RCNN models

I think it go out of visualize and inside RCNNModel, I'm open to new ideas

Calculate validation loss

Implement COCO evaluator

black pre-commit hook

refactor id2cat

Figure out a better way to handle the vocab/id2cat

Automatically log hyperparameters

This can probably be done automatically by pytorch-lightning. More info here

Lr schedule

Add lr schedule to at least one example, person.ipynb

Improves example on the wheat dataset

📓 New example

What is the task?
Object detection

Is this example for a specific model?
FasterRCNN

Is this example for a specific dataset?
wheat

Don't remove
Main issue for examples: #39

Clean examples folder

Pickle records

Option to pickle records so we don't have to parse all data everytime

This option should be transparent to the user, we can expose it by a optional argument passed to DataParser.

Always a good discussion is where to store this data. Do it store it relative to the current file? Into /tmp? Or into a .mantisshrimp folder in the home directory?

Storing relative to the current file is always annoying when using version control, we have to explicitly not add it to checkout

Example

COCOParser(data, source, use_cached=True)

Learn.fit multiple times

Calling fit multiple times should start where the last call ended.

Need to take care of steps in the trainer

Validation loss feeds network twice

The modification previously made on layers.ipynb were affecting the model performance on evaluation. It's needed to better understand what is happening in roi_heads.forward before modifying the method.

For now it's okay to feed the model twice for getting the loss and then the predictions.

Getting the validation loss by using model.train also disconsiders other important effects like Dropout and BatchNorm

Stop using pycocotools metrics

Stop using pycocotools and implement metrics ourselves, this way them can be easily controlled and logged.

Rename FastRCNN to FasterRCNN

dataloader method on models

It's good that each model knows how to create it's own dataloader, but I don't like the fact that we need to instantiate the model to have access to the dataloader.

Previously we were using staticmethod, that got removed because we could not call super.

I think it's a good idea to bring staticmethod back, and instead of calling super we can just call the appropriate function

Synthetic data

When should we start working in synth data?

References:
https://github.com/debidatta/syndata-generation

Implements func_kwargs in parsers

Update examples, kaggle kernels, readme

Also explain the concept of the mid/high level API for training

Import pretrained models from detectron2

Rework of Item

Currently we have an Item class that is handling all use cases, this introduces a lot of complexity because we have to keep checking for Nones.

Because each model is specific to a single task, we could instead use specific items for each task. Something like:

class MaskBBoxItem(Item):
    ...

Or even specific to each model like:

class FasterRCNNItem
    ...

If we go specific to each model, we can insert item2training_sample in the class

RLE.to_mask core dumped

The error happens at mask_utils.decode from pycocotools.

Related issue can be found here

Tutorials and Examples

📒 Tutorials

Tutorials are in .ipynb format, explaining each step of the process, really detailed, not production like.

Core

parser

Object detection

Segmentation

penfundan

Keypoints

📓 Examples

Examples are be in the .py format, more production oriented. Ready to be run with arguments from the command line and easy to integrate with wandb sweeps and alike.

Object detection

wheat

Segmentation

Keypoints

Is there a new tutorial or example you would like to add? Comment below and we talk about it 😁

Once we agree, create an Tutorial or Example request issue (use the template) and I'll edit this post with your new cool example!

Remove fastcore dependencies

The main question is: Do we want to keep utils functions like L , lmap, ifnotnone and stuff like that?

While this functions are really helpful to who already is used to then, it elevates the barrier for new contributors.

rename ImageParser to InfoParser

Visualize batch after transforms

Like fastai dl.show_batch

fit_one_cycle mixin for trainer

Implement layer groups

For fine-tuning, differential learning rates.

Would be good to have something like fastai freeze_to

Are images with no objects removed?

Need to check if the DataParser is removing images without annotations.

Integrate models from pytorch hub

🚀 Feature

Is your feature request related to a problem? Please describe.
Use model from pytorch hub

Describe the solution you'd like
Use models available on hub with minimal setup

renames repo to mantisshrimp (remove 2)

Rework of dataloader

It might be a good idea to embed the dataloader inside the model, because each dataloader is specific to a model anyways, this would also more closely follow the lightning guidelines

class MantisRCNN(MantisModule):
    @staticmethod
    def dataloader(self, <pytorch dataloader kwargs>):
        # Do the specialization
        return dataloader

It would also be logical to bring item2training_sample inside the model

class MantisRCNN(MantisModule):
    @staticmethod
    def item2training_sample(item):
        # convert item to training sample
        ...

Related to this and this

Integrate detr

🚀 Feature

Detr is an amazing new approach to object detection just launched by facebook. Pretained weights are available to hub, so naturally this issue is a bit related to #38.

Describe the solution you'd like
Let's divide this task into three separate parts:

Model inference
Train from scratch: Should be easier to implement and is supported in the original code
Fine tuning: Not officially tested in the original code, will be a bit harder to implement.

Additional context
No other library supports this yet, let's goooo!! 🚀 🚀 🚀

airctic / icevision Goto Github PK

icevision's Introduction

An Agnostic Computer Vision Framework

Installation

Quick Example: How to train the Fridge Objects Dataset

Happy Learning!

icevision's People

Contributors

Stargazers

Watchers

Forkers

icevision's Issues

🚀 Feature

🚀 Feature

📓 New example

Example

📒 Tutorials

📓 Examples

🚀 Feature

🚀 Feature

Recommend Projects

Recommend Topics

Recommend Org