Commit Graph

885 Commits (a37658daff841f670d557b2ec2aee09ca8feec75)

Author SHA1 Message Date
liym27 b10ecd9d3a
[inplace] Add ShareHolderWith for class Variable and SharePlaceholderWith in VarBase.detach() to share the same Tensor/SelectedRows (#29267)
5 years ago
Chen Weihang 9ad800ebb2
Support type promote for basic math ops (quantum required) (#29265)
5 years ago
Zhen Wang be3777a50a
Add pure fp16 training with master weights. (#27712)
5 years ago
chentianyu03 8f45d14263
add complex64 and complex128 type; add +-*/@ and slice opreator for c… (#29199)
5 years ago
Zhou Wei c0a991c874
accumulate gradient for leaf tensor with previous graph and expose leaf tensor concept (#28429)
5 years ago
liym27 865a45984f
Check whether there is any inplace operation affecting gradient calculation. (#27901)
5 years ago
ShenLiang e2d01eb650
Support dynamic graph distributed (#28997)
5 years ago
Leo Chen 770395cb93
Split train_mode and has_grad for tracer (#29064)
5 years ago
Zhou Wei 8ca0a8a859
fix tensor detach to zero copy (#27921)
5 years ago
Chen Weihang 768dab441e
polish two api doc detail, test=document_fix (#28971)
5 years ago
gongweibao 1dad8ceaab
Fix gpu memory allocation bug. (#28703)
5 years ago
Zhou Wei 3b0dd5f620
fix bug that to_tensor not support paddle.Place (#28717)
5 years ago
Leo Chen 3d09929b1f
Add check for non-dispensable input (#28666)
5 years ago
Zhou Wei bf6e7cba7a
updata 2.0 API english doc (#28525)
5 years ago
Wilber 1bf4836580
[Inference] Add TryShrinkMemory interface. (#28409)
5 years ago
石晓伟 c41fd033e5
check op_version_registry in CI test, test=develop (#28402)
5 years ago
Leo Chen 8b2436a776
Add broadcast_shape api (#28257)
5 years ago
石晓伟 21a63f6f90
enhance the op_version_registry, test=develop (#28347)
5 years ago
Shang Zhizhou ea851796e5
TensorRT中ernie模型推理性能优化,支持变长输入 (#28367)
5 years ago
Wilber 6f0f45f69c
copy_to_cpu support uint8 (#28372)
5 years ago
wangguanzhong 5262b02585
add generate_proposals_v2 op (#28214)
5 years ago
石晓伟 d9b5f1261c
update the version of pybind, test=develop (#28284)
5 years ago
wangguanzhong 1c385e26f9
add op_function_generator for box_coder (#28303)
5 years ago
Guanghua Yu e8f2614da5
Enhance multiclass_nms op to support LoD for dygraph mode (#28276)
5 years ago
wangxinxin08 41d26a8287
update matrix nms op to api 2.0 (#28265)
5 years ago
Zhang Ting fdc06f2158
add Fuse bn add act pass (#28196)
5 years ago
Chen Weihang 813b2ade34
Enrich the python error types of paddle & polish format (#28124)
5 years ago
Zhou Wei fb7f85291b
fix print tensor place,add cpu/cuda/pin_memory API for Tensor (#28200)
5 years ago
Wilber f935ca8a50
[lite-xpu-subgraph] Fix xpu compile and test xpu ci. (#27932)
5 years ago
chentianyu03 05fd49e974
change paddle.fluid.layers.reduce_sum to paddle.sum in sample codes (#27998)
5 years ago
tangwei12 202bfab1be
Feature/large scale kv save base/delta (#27470)
5 years ago
Zhou Wei bf412f4665
add tensor clone (#27953)
5 years ago
guofei 6bbb6e7f45
Implement the function of OutScaleForTraining/OutScaleForInference in dygraph (#26601)
5 years ago
chentianyu03 d05058d268
Remove and reorganize the alias of APIs (#27717)
5 years ago
Leo Chen 9a2a4b5f65
Support setting xpu place in dygraph mode (#27909)
5 years ago
Leo Chen 049696bf67
Refine the format of printing tensor (#27673)
5 years ago
joanna.wozna.intel ddcd1b5381
Add bfloat16 resnet50 test (#27755)
5 years ago
Wilber 9005c5a260
Lite subgraph support arm cpu. (#27827)
5 years ago
yongqiangma e8a5aefbbd
update CUDAPlace doc. test=document_fix (#27711)
5 years ago
zhupengyang 659d04df2c
hsigmoid -> hsigmoid_loss/HSigmoidLoss; refine docs (#27745)
5 years ago
石晓伟 0d27591642
save operator version infomation to program desc, test=develop (#27668)
5 years ago
joanna.wozna.intel 0cd4907eba
Add avx512 core instructions check (#27732)
5 years ago
Zhang Ting d2369dd91f
modify docs of CPUPlace and CUDAPinnedPlace, test=document_fix (#27587)
5 years ago
Chen Weihang b14ecb8632
Polish api BuildStrategy/ExecutionStrategy doc & code example (#27662)
5 years ago
lilong12 bbc2add703
Initialize gloo for low level collective apis (#27672)
5 years ago
arlesniak 0ecf441af1
Add support for mkldnn ops types selection with FLAGS in dygraph (#27482)
5 years ago
lilong12 36c0410223
Revert "Initialize gloo for low level collective apis (#27356)", test=document_fix (#27665)
5 years ago
wanghuancoder c68a0313a5
add paddle.fluid._cuda_synchronize (#27595)
5 years ago
liym27 074a71bd25
Support assignment to a Variable in dynamic mode but not deal with backward. (#27471)
5 years ago
lilong12 fa73e4a284
Initialize gloo for low level collective apis (#27356)
5 years ago
Li Fuchen 1501a80f74
add support to float64 input of warpctc op. (#27399)
5 years ago
joanna.wozna.intel b0ee1405f7
Add conv2d bfloat16 support (#27325)
5 years ago
Zhou Wei 1e1ae5c54d
Make the Bind Method of Tensor more automatic (#27270)
5 years ago
Leo Chen aba759ba16
[Feature] Enhance inplace addto strategy for gradient accumulation in static graph (#27112)
5 years ago
huangxu96 02606d45ef
Quant op dev (#25932)
5 years ago
Wilber f827665ae6
[Pass Compatible] Bind python compatible. (#27262)
5 years ago
joanna.wozna.intel 1483ea2304
Add bfloat16 passes (#26999)
5 years ago
Zhen Wang d708b21074
Update amp_check_finite_and_scale_op and add an updating_loss_scaling op for static graph amp training. (#26240)
5 years ago
wangguanzhong a28ae86e11
Enhance ops to support LoD as input for dygraph detection models. (#25316)
5 years ago
Wilber 632125415c
Refine python inference api (#26958)
5 years ago
yaoxuefeng 7f3e6ca596
add cuda generator (#26786)
5 years ago
joanna.wozna.intel 95e1434bb2
Add bfloat16 data type (#25402)
5 years ago
joanna.wozna.intel 0627a319b0
Restore "Add mkldnn bfloat16 option to C-API " (#26882)
5 years ago
石晓伟 ced6e87eee
Revert "Add mkldnn bfloat16 option to C-API (#26676)" (#26854)
5 years ago
arlesniak 885c61f086
Add use of global flag 'use_mkldnn' to layer_helper (#26497)
5 years ago
yaoxuefeng a47d92d868
fleet add save with whitelist test=develop (#23376)
5 years ago
Wilber 68e0560c2f
refine paddle inference api (#26774)
5 years ago
Leo Chen 844583c8fd
Refine paddle.manual_seed (#26496)
5 years ago
joanna.wozna.intel 02083bda40
Add mkldnn bfloat16 option to C-API (#26676)
5 years ago
Zhen Wang f9066e6a6f
Update the demo code and the doc of varbase.backward. (#26506)
5 years ago
lilong12 1c68138327
[api 2.0] add collective op for cpu using gloo and paddle.distributed.* apis (#26552)
5 years ago
Zhang Ting 0a895bc0df
improve unique op (#26537)
5 years ago
wanghuancoder c1f5df5269
optimized transformation form tensor to numpy (#26447)
5 years ago
wanghuancoder 422a162019
api2.0 paddle.nn.Bilinear and paddle.nn.functional.bilinear (#26399)
5 years ago
wanghuancoder 6e823cfec3
add op_function_generator.exe retry in windows, test=develop (#26591)
5 years ago
wangchaochaohu ebf9b2125e
add paddle.gather for API2.0 (#26455)
5 years ago
QingshuChen 138ecf24aa
support Baidu Kunlun AI Accelerator (#25959)
5 years ago
ceci3 56890dc729
Add SyncBatchNorm (#26032)
5 years ago
Leo Chen 049ac56c08
Print user-friendly error message in core.ops [part 2] (#26377)
5 years ago
yaoxuefeng 23261ff44b
add cpu random Generator (#26013)
5 years ago
Leo Chen 672578a797
Print user-friendly error message in core.ops (#26261)
5 years ago
wangchaochaohu 0b81d76310
[API2.0] add op for cudnn version query test=develop (#26180)
5 years ago
wangchaochaohu bb11cbc250
[API2.0] add Device api (set_device and get_device)(#26103)
5 years ago
Zhou Wei 6de463d3d1
expose and unify the Tensor concepts to the user (#25978)
5 years ago
Zhou Wei 20147ace3f
fix_copy_if_different (#25868)
5 years ago
Leo Chen 2d95280e1f
Feature/Enable Auto-Mixed-Precision in dynamic graph (#24903)
5 years ago
Chen Weihang 838e36e9ed
Fix loaded variable suffix repeat error (#26169)
5 years ago
Jack Zhou dea41da715
add nll loss API for the paddlepaddle api2.0
5 years ago
Chen Weihang 3c8daa9b89
Add pin memory control for BufferedReader (#26026)
5 years ago
Leo Chen 751305ecf0
Add flags to control call stack of error message (#25997)
5 years ago
Thunderbrook 0cb60c700d
add heter ps mode (#25682)
5 years ago
tangwei12 caa90a6510
Integrated Trainer of Parameter Server (API add `fluid.contrib.layers.sparse_embedding` only) (#22957)
5 years ago
Zhou Wei b484a59c39
fix copy file random fail on windows (#25731)
5 years ago
Pei Yang 55b6205ddf
add set_mkldnn_cache_capacity python api(#25524)
5 years ago
Zhen Wang cea5086853
Fix the double grad bug for the star gan. (#25655)
5 years ago
石晓伟 7206417259
supports xpu runtime, test=develop (#25554)
5 years ago
Pei Yang 43f9f180e5
Add api to clear intermediate tensors in AnalysisPredictor (#25069)
5 years ago
Zhen Wang bb45af02ac
add the c++ part of Imperative QAT. test=develop (#25446)
5 years ago
ceci3 52be62c5ae
fix instance norm in dy (#24717)
5 years ago
gongweibao 80f1c50738
Fix typo in interface. (#24779)
5 years ago