51 lines
1.2 KiB
Markdown
51 lines
1.2 KiB
Markdown
# UPDATES LOG
|
|
|
|
## 2022.11.02
|
|
Add "ADD" OPS, now support resnet/mbnet v2 experimental~
|
|
|
|
## 2022.08.30
|
|
take time to optimize O0&O1. now new O0 almost as fast as old O1~
|
|
fast RVV fp16 model in 97ms now!!!
|
|
|
|
## 2022.08.29
|
|
too busy recently... Slightly optimize for O1.
|
|
prepare add kws example.
|
|
|
|
## 2022.08.22
|
|
Add auto test script.
|
|
Add experimental FP8 1.4.3/1.5.2 model, run simple mnist model successed.
|
|
Rename arch_O0 to arch_cpu, Move O1,O2 to TM_OPT_LEVEL setting, start write tm_layer_O1.c
|
|
|
|
## 2022.08.17
|
|
Add Support for RV64V's FP16 model, use RVV1.0 inntrinsic
|
|
Optimize reduce sum with RVV's vfredosum_vs_f32m1_f32m1
|
|
speed up TM_ARCH_RV64V infer mbnet v1 0.25:
|
|
|model type|before|after|speedup|
|
|
|--|--|--|--|
|
|
|INT8| 212ms| 178ms|19%|
|
|
|FP32| 190ms| 182ms|4%|
|
|
|FP16| 178ms| 161ms|11%|
|
|
|
|
tested infer time duty: sbuf:dot:sum ~ 9:5:2
|
|
need opt sbuf time
|
|
|
|
## 2022.08.08
|
|
First commit to github, support INT8/FP32 model,
|
|
support arch:
|
|
```
|
|
arch_O0,
|
|
arch_ARM_SIMD, arch_ARM_NEON,arch_ARM_MVEI,
|
|
arch_RV32P, arch_RV64V
|
|
```
|
|
support OPS:
|
|
```
|
|
Conv2D,DWConv2D,(with relu/relu6,stride)
|
|
FC,GAP,SOFTMAX
|
|
```
|
|
support demos:
|
|
```
|
|
layer_test
|
|
mnist
|
|
mnist port for Arduino ATmega328
|
|
mbnet(mobilenet v1)
|
|
``` |