[To TensorFlow Team] Feedback on TensorFlow Object Detection API - Kaggle

innat · February 17, 2022, 6:14pm

TensorFlow - Help Protect the Great Barrier Reef

The recent kaggle competition is finished and most of the top solutions are in PyTorch.

Here is the concerned post from a TF user, below. Bringing the post here to reach the real man (TF teams) to give some feedback for end-users.

In addition to this event, a few months ago there was an instance of segmentation competition in kaggle, and sadly, not a single solution or discussions were done in TF (AFAIK).

( PS. Some effective steps need to be taken, IMHO. I’m optimistic for Keras-CV but I also understand that it would be not as easy as it looks. )

Bhack · February 17, 2022, 6:46pm

Thank you for sharing this. It is very interesting especially as it is coming from a very popular (and Google owned) platform like Kaggle.

IMHO It seems also to be partially connected to our thread at:
https://tensorflow-prod.ospodiscourse.com/t/keras-cv-keras-nlp-keras-applications-models-garden/7276

And to the trends mentioned in:
https://tensorflow-prod.ospodiscourse.com/t/which-models-would-you-like-to-see-on-tensorflow-hub/111/14

/cc @thea @Joana

Bhack · February 17, 2022, 6:49pm

Additional Query

At the end of the TensorFlow - Help Protect the Great Barrier Reef competition, kaggle staff made a concise list of the solution summary, find HERE.

The common framework is PyTorch and the common model is YOLO-V5, which doesn’t have the published paper (AFAIK). In tensorflow/model, so far YOLO-V4 is there with NO SUPPORT.

DISCLAIMER: this YOLO implementation is still under development. No support will be provided during the development phase.

As the competition ends, the solution is YOLO-V5 with PyTorch framework, I’m wondering how it’s gonna be used in Google research for the COTS project with CSIRO?

To scale up video-based surveying systems, Australia’s national science agency, CSIRO has teamed up with Google to develop innovative machine learning technology that can analyze large image datasets accurately, efficiently, and in near real-time.

Bhack · February 18, 2022, 10:28pm

We already had a similar feedback:

github.com/tensorflow/models

adding Yolov5 model in tensorflow.

opened 02:02AM - 20 Oct 21 UTC

Mrinal18

type:feature models:official

# Prerequisites Please answer the following question for yourself before subm…itting an issue. - [ ] I checked to make sure that this feature has not been requested already. ## 1. The entire URL of the file you are using https://github.com/Mrinal18/YOLOv5_tensorflow#readme ## 2. Describe the feature you request YOLOv5 model implementation in tensorflow ## 3. Additional context Add any other context about the feature request here. ## 4. Are you willing to contribute it? (Yes or No) Yes

I share some ideas:

more third party papers refence implementations in TF. We need to attract third party papers authors.
Find a way to expose in TF many “non third party” research that currently is done in JAX. I like framework diversity/competition but we have one more barrier to have these works available in TF
a clear collection of reusable model components, as a library, to incentivate and speedup community contribution without reinventing the wheel in multiple repositories.
scale the community contribution promoting long term/stable contributors for Codeownership and subcomponets reviews
Incentivate TF datasets collection from dataset paper authors and with Kaggle competitions.
We don’t need to write tedius data feeding/processing scripts every time.
More finetunable TFHUB models
Extra: GitHub Action Jobs on GKE or on any other Google Cloud resource to run training job on community models contributions approved by maintainers.

innat · February 25, 2022, 10:44am

Very thorough.

Bhack · February 25, 2022, 11:16am

There are some interesting (but known) points in this report.

But I found that some important stats are missing.
As they are both OSS projects/ecosystems, how much the external contributors (not Meta/Google) are contrbuting to these repositories?

I think that in the long run the diversity and inclusivity in the contribution could really help the ecosystem sustainability, vibrancy, health and It could also help to minimize some bias on where to invest and schedule the always “not infinite” resources.

Building this It is much harder than just relasing a set of libraries.

lgusm · February 25, 2022, 12:39pm

Thanks @innat for the feedback!
This is very important and I agree that it can be improved!

innat · May 18, 2022, 7:35am

I’ve watched a demonstration promo video from Google regarding the Great Barrier Reef, link below.

In that video, Megha Malpani, a product manager at Google AL/ML states about the Kaggle competition for this project. And also states that TensorFlow 2. Model Garden Library was their Foundation of codebases (video: 2.35 seconds)!

Now, this shocked me like a hell. It was the YOLO-V5 model written in PyTorch that produced high-performing detection results. At the beginning of the competition, a starter with TensorFlow Model Garden was provided but not only did it give a low performance, people rejected that like nothing.

In this promo video, it states that (video: 2.18 seconds), they used kaggle competition results to glean insights into what did and didn’t work for a particular task. The fun fact is, that the results from the kaggle competition are only YOLO-V5, which is currently dominating several object detection kaggle competitions.

I am wondering how the TensorFlow team reforms their model garden codebases to solve this task for CSIRO. As far as I know, the YOLO-V5 model won’t be included in Model Garden any near soon. Or what?

update

An example on the way.

github.com/tensorflow/models

COTS: Add tutorial sections

tensorflow:master ← MarkDaoust:COTS

opened 05:58PM - 16 May 22 UTC

MarkDaoust

+1066 -436

# Add tutorial sections to the COTS notebook. Add more descriptive text and e…xample code explaining how this works. Break up the code into smaller chunks. Mainly: * Add a walkthrough of how to decode the detection results. * Show the detection results on a pair of images to motivate the "Tracker" code. * Upgrade the optical flow code to run on a batches of detections . * Demonstrate the optical flow aspect before getting into the tracker. Here is a [copy with the outputs rendered](https://colab.sandbox.google.com/drive/1I2TDSUJxXl_1BmI8OaZsHKhNP_E2iZ2k?resourcekey=0-qJjR9zYIvSFYhKOtb87d1A#scrollTo=GUQ1x137ysLD) ## Change type - [x] Documentation update ## Tests Spot checked the final output video, Detection IDs and frame numbers all seem identical. ## Checklist - [x] I have signed the [Contributor License Agreement](https://github.com/tensorflow/models/wiki/Contributor-License-Agreements). - [x] I have read [guidelines for pull request](https://github.com/tensorflow/models/wiki/Submitting-a-pull-request). - [x] My code follows the [coding guidelines](https://github.com/tensorflow/models/wiki/Coding-guidelines). - [x] I have performed a self [code review](https://github.com/tensorflow/models/wiki/Code-review) of my own code. - [x] I have commented my code, particularly in hard-to-understand areas. - [x] I have made corresponding changes to the documentation. - [x] My changes generate no new warnings. - [x] I have added tests that prove my fix is effective or that my feature works.