[New Model] RWKV by JanFidor · Pull Request #1902 · unit8co/darts

JanFidor · 2023-07-17T19:53:35Z

Fixes #1817 .

Quick summary

For now the implementation follows pretty closely what was described in the paper. The implementation from the official RWKV repo has quite a few improvements which weren't discussed in the paper, but for now I wanted to get at least a workable model.

Roadmap

There's still a lot of things to be done, but I wanted to put up a PR as a quick update on how everything's going and a simple roadmap for the future

codecov-commenter · 2023-07-17T20:29:36Z

Codecov Report

❌ Patch coverage is 23.78049% with 125 lines in your changes missing coverage. Please review.
✅ Project coverage is 92.97%. Comparing base (a5560cc) to head (7a1f0ee).
⚠️ Report is 379 commits behind head on master.

Files with missing lines	Patch %	Lines
darts/models/forecasting/rwkv_model.py	23.31%	125 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##           master    #1902      +/-   ##
==========================================
- Coverage   93.95%   92.97%   -0.98%     
==========================================
  Files         125      126       +1     
  Lines       11773    11923     +150     
==========================================
+ Hits        11061    11086      +25     
- Misses        712      837     +125

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

dennisbader · 2023-07-21T14:57:40Z

Hi @JanFidor, and thanks for this PR. Just to let you know that we're wrapping up the last few things for the release in 1-2 weeks. Once that's done we'll come back to this and review 🚀

gdevos010 · 2023-08-29T16:22:25Z

@JanFidor Were you able to benchmark this model?

JanFidor · 2023-08-30T21:45:49Z

@gdevos010 just some basic ones, I still have to play around with parameter initializations. On SunspotsDataset I noticed that NLinear and Transformer were having noticeable MAPE changes depending on output_chunk_length (changes around 60 <-> 200 ) while RWKV was consistently performing around 100. I also threw in ETTh1 dataset, with 720 input_chunk _length 336 output_chunk_length. The RWKV had terrible MAPE. Not sure it the architecture was at fault or if it was caused by under fitting. I'll try to make a more comprehensive benchmark next week

add rwkv model implementation

68bf5bb

JanFidor requested a review from dennisbader as a code owner July 17, 2023 19:53

Merge branch 'master' into feature/rwkv-model

7a1f0ee

JanFidor mentioned this pull request Jul 20, 2023

Refactor Transformer model #601

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[New Model] RWKV#1902

[New Model] RWKV#1902
JanFidor wants to merge 2 commits intounit8co:masterfrom
JanFidor:feature/rwkv-model

JanFidor commented Jul 17, 2023 •

edited

Loading

Uh oh!

codecov-commenter commented Jul 17, 2023 •

edited by codecov bot

Loading

Uh oh!

dennisbader commented Jul 21, 2023

Uh oh!

gdevos010 commented Aug 29, 2023

Uh oh!

JanFidor commented Aug 30, 2023 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

JanFidor commented Jul 17, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Quick summary

Roadmap

Uh oh!

codecov-commenter commented Jul 17, 2023 • edited by codecov bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

dennisbader commented Jul 21, 2023

Uh oh!

gdevos010 commented Aug 29, 2023

Uh oh!

JanFidor commented Aug 30, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

JanFidor commented Jul 17, 2023 •

edited

Loading

codecov-commenter commented Jul 17, 2023 •

edited by codecov bot

Loading

JanFidor commented Aug 30, 2023 •

edited

Loading