Commit Graph

14980 Commits (cbdb8a17b1e7ab27fd7429c4a94db898d9a05b24)

Author SHA1 Message Date
dongdaxiang 433301fbc2 remove glog in shell.h
7 years ago
dongdaxiang 9e51ad4a65 fix io and fs compile on mac
7 years ago
dongdaxiang 6eca88ac76 fix io and fs compile on mac
7 years ago
dongdaxiang 2708108a08 fix fleet_wrapper compile on windows
7 years ago
dongdaxiang 4ce35815fb fix windows GLOG problem
7 years ago
dongdaxiang e3107a6ae0 fix windows compile problem
7 years ago
dongdaxiang 398004ece0 disable sys/wait.h to fix windows compile problem, include scope in lodtensor_printer
7 years ago
dongdaxiang d4514949bf remove local random engine in fleet with rand_r()
7 years ago
dongdaxiang e82969eeb0 remove getdelim in windows
7 years ago
dongdaxiang 45eb6f0765 run pre-commit check files and fix code style problem
7 years ago
dongdaxiang d87ba58c14 refine document of python API, make device_worker and trainer's API private
7 years ago
dongdaxiang 5687f234bf fix trainer_desc.proto error
7 years ago
dongdaxiang b95b80bc76 add doc string for executor and update API.spec
7 years ago
dongdaxiang 6be9f719e2 make string_helper dependency work
7 years ago
xjqbest e95cafd9a7 fix code style & add dataset testcase
7 years ago
dongdaxiang 39362a8415 move root_scope->DropKids() into Finalize() so that we do not have to drop all the kids
7 years ago
dongdaxiang ba15d6b164 move root_scope->DropKids() into Finalize() so that we do not have to drop all the kids
7 years ago
xjqbest be74de2c61 fix code style & fix register bug & add release_memory
7 years ago
dongdaxiang a0b59773af fix code style
7 years ago
dongdaxiang f39b323ed7 remove trainer_library in CMakeLists
7 years ago
dongdaxiang 365be5d559 support win32 flag in io.cc shell.cc, fix code style problem in fleet_wrapper, fix lodtensor_printer_test problem
7 years ago
dongdaxiang dc8cf36e4b add more example on datagenerator
7 years ago
dongdaxiang 6bf796df14 refine print fetch list
7 years ago
xjqbest 589467f24c fix bug
7 years ago
xjqbest b7940c2918 fix bug of gen_worker_desc and set_filelist, add some doc
7 years ago
dongdaxiang 68d7bf3de5 add fetch var function
7 years ago
xjqbest a34fe6248f add some doc
7 years ago
xujiaqi01 f5c6a14b54 fix runtime error
7 years ago
xujiaqi01 a5b1a0e12b support multi dataset && add init model && fix bug
7 years ago
dongdaxiang 3c65cc1bbd add document for role_maker and fleet parameter, data_generator
7 years ago
dongdaxiang f6c9232a3d fix dataset float32 type problem
7 years ago
dongdaxiang 73b1f396d7 add data_generator into paddle.fluid.incubate.data_generator, add op run log in hogwild_device_worker and downpour_device_worker
7 years ago
dongdaxiang 73544e8b8d add training speed log
7 years ago
dongdaxiang 9419de521f add IO percent for multi_trainer
7 years ago
dongdaxiang 6af697adb0 add trainfileswithprofiler for downpour worker
7 years ago
dongdaxiang 2644b88685 add comment for MPI Symetric role maker
7 years ago
dongdaxiang cf45c54340 add distributed optimizer factory
7 years ago
dongdaxiang b7a202aa38 add distributed optimizer factory
7 years ago
xujiaqi01 70a5d4f797 fix error
7 years ago
xujiaqi01 d25389fefd add some log && fix error
7 years ago
dongdaxiang f612877797 add incubate for unified API
7 years ago
dongdaxiang 317eb0aad3 add incubate for unified API
7 years ago
xujiaqi01 39449ba0b9 fix bug && add DestroyReaders in trainer
7 years ago
dongdaxiang e657c127a8 hide opt_info in distirbuted optimizer
7 years ago
xujiaqi01 ecfc7df913 add dataset factory && fix style
7 years ago
dongdaxiang 328f11b8b6 refactor downpour optimization
7 years ago
xujiaqi01 3cea00bd52 store memory data in Dataset && fix bug
7 years ago
dongdaxiang ff87698a44 refactor downpour optimization
7 years ago
dongdaxiang b66f0074b6 fix data reading bugs in api, add VLOG(3) log for setup
7 years ago
dongdaxiang b415ec27e8 make Dataset* as an argument
7 years ago
xjqbest dd67ad08a2 modify c++ and python dataset related code & fix bug
7 years ago
dongdaxiang cc4def6ba5 fix some conflict for compilation
7 years ago
heqiaozhi 9bca1926c1 refactor & fix bug
7 years ago
xjqbest 2e9a836c6f add DataSet and InMemoryDataFeed, support load data into memory and shuffle data
7 years ago
dongdaxiang 2486389793 add RunFromDataset in executor
7 years ago
dongdaxiang e36bbcc871 fix some typo and CMakefile.txt
7 years ago
xjqbest 824b84d185 add DataSet and InMemoryDataFeed, support load data into memory and shuffle data
7 years ago
dongdaxiang 08c25995a2 add run from dataset in executor.
7 years ago
dongdaxiang c28bbdf8ba add dataset_generator.py
7 years ago
dongdaxiang be757096da add pybind for fleet
7 years ago
dongdaxiang 687cb79dbb add pipe command io interface
7 years ago
dongdaxiang 1fe54416c9 move fs.cc and shell.cc into paddle/fluid/framework/io
7 years ago
dongdaxiang 53fbab5d33 add fs_local_open example
7 years ago
dongdaxiang afaf937010 add fs_local_open example
7 years ago
dongdaxiang cf1360643f add printer for fetch variable
7 years ago
dongdaxiang d65cb13ad5 add pslib flag on fleet_wrapper CMakefile
7 years ago
dongdaxiang 6de9ebc65c refine VLOG in fleet_wrapper.h
7 years ago
dongdaxiang 97d5cd30f0 make pull dense worker work
7 years ago
dongdaxiang 39014b9f9f fix class register problem
7 years ago
dongdaxiang f0dd1201cc fix destructor problem
7 years ago
dongdaxiang f2bde9c241 fix destructor problem
7 years ago
dongdaxiang 54f047a126 fix ngraph compile option
7 years ago
dongdaxiang dd1dc9bcf0 add common.h.in back
7 years ago
dongdaxiang 378037c535 make s_instance_ private to ensure singleton
7 years ago
dongdaxiang a446d26e8a add todo for asynce executor
7 years ago
dongdaxiang c165012031 refine device_worker and trainer code
7 years ago
dongdaxiang 8a335b50be add downpour device_worker pb configuration
7 years ago
dongdaxiang 24a8001142 make -DWITH_PSLIB=ON compilable
7 years ago
dongdaxiang 67b1d6d721 add dist_multi_trainer for distributed training, add trainer_factory and device_worker_factory so that we can easily extend new training mode, add pull dense worker which is a singleton for parameter fetching
7 years ago
dongdaxiang 855bf579d2 add dist_multi_trainer for distributed training, add trainer_factory and device_worker_factory so that we can easily extend new training mode, add pull dense worker which is a singleton for parameter fetching
7 years ago
lujun d4f63d82ac
Merge pull request #16475 from junjun315/fix-doc-multiplex
7 years ago
Qiao Longfei d8974e6da0 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor-communicator
7 years ago
Shixiaowei02 bddb2cd315 resolve conflicts with the develop branch test=develop
7 years ago
lidanqing 0d656996bf fix some bugs of unzip and reading val list
7 years ago
chengduo 1096746cbf
Fuse Adam And SGD ops (#15933)
7 years ago
Jacek Czaja 2632327429 [MKL-DNN] Tensor modifications revert (#16462)
7 years ago
Zeng Jinle 4143a1c216
Merge pull request #16491 from sneaxiy/feature/advance_gc
7 years ago
chengduo 2265d091e6
Fix threaded executor bug (#16508)
7 years ago
sneaxiy 2c836ff914 check default grad maker
7 years ago
nhzlx d065b5bf2b Anakin ssd support
7 years ago
zhoukunsheng beb4a86d13 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rank
7 years ago
zhoukunsheng b06e9b773d test=develop
7 years ago
Zeng Jinle 69cb9792ea
Merge pull request #16506 from sneaxiy/revert-16424-fix_allocator_bug
7 years ago
lidanqing b46e467abc add wget and unzip part and change data_dir
7 years ago
zhoukunsheng 2f9e562100 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into linspace
7 years ago
dengkaipeng 2078f4207f fix API.spec. test=develop
7 years ago
zhoukunsheng 082822d417 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rsqrt
7 years ago
zhoukunsheng c47f3cc7fe test=develop
7 years ago
lidanqing 894aa9b235 change script file name and data_dir location
7 years ago
lidanqing 57f51e5b08 preprocess with PIL the full val dataset and save binary
7 years ago
dengkaipeng 8160a66193 fix doc priority. test=develop
7 years ago
chengduo ed61d67c73
Fix the interface of Pass::Apply (#16484)
7 years ago
dengkaipeng 193185b840 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into shift
7 years ago
Zeng Jinle 5f1c92a81c
Merge pull request #16450 from zhhsplendid/del-redundant-op-var-reg
7 years ago
Zeng Jinle 2aa18e2bda
Merge pull request #16496 from sneaxiy/fix_gc_bug
7 years ago
Sylwester Fraczek 8ece7a9708 fixed url to dataset
7 years ago
sneaxiy 5656fa9f7c fix travis ci
7 years ago
Zeng Jinle 174d0d0b90 Revert "Fix allocator bug"
7 years ago
gongweibao eb83abeac3
Add DGC(Deep Gradient Compression) interface. (#15841)
7 years ago
Qiao Longfei 34890fd3b1 fix gpu build for lookup_table_op test=develop
7 years ago
Sylwester Fraczek fe21578a44 create test for quantized resnet50
7 years ago
Michał Gallus 2d8b7b3a76 Refine default MKL-DNN Pass order (#16490)
7 years ago
Wojciech Uss 09dfc7a2aa
C-API quantization core 2 (#16396)
7 years ago
Jiabin Yang e41d581304
test=develop, fix space_to_depth_doc (#16293)
7 years ago
sneaxiy 4c8254e3bf revert some loop op revision
7 years ago
Zeng Jinle 644e8af4cf
Merge pull request #16424 from sneaxiy/fix_allocator_bug
7 years ago
sneaxiy c4c6205268 fix gc bug
7 years ago
zhoukunsheng 874b5d8362 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into linspace
7 years ago
zhoukunsheng 83c7bca13f Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into all_any
7 years ago
zhoukunsheng a55111b869 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into zeros_like
7 years ago
zhoukunsheng 848ec97ab3 test=develop
7 years ago
sneaxiy 16f0994728 Merge develop
7 years ago
sneaxiy 63651c1968 fix grad desc maker
7 years ago
Yihua Xu 57dc3c1943 Disable compare for Issue#16316 (#16466)
7 years ago
Zeng Jinle c7c6eeb44e
Merge pull request #16409 from sneaxiy/feature/advance_gc
7 years ago
Qiao Longfei 33be014535 fix distribute compile problem test=develop
7 years ago
Jiabin Yang 54a73578a8
Feature/install check (#16044)
7 years ago
Qiao Longfei b542639dc0 code clean test=develop
7 years ago
wopeizl c300b1ba69
Tensor index (#16223)
7 years ago
Jiabin Yang 0d9d25d40f
Feature/refactor layers to Layers (#16337)
7 years ago
dengkaipeng 3e352388eb fix format. test=develop
7 years ago
dengkaipeng eb2123e12d fix doc and jit. test=develop
7 years ago
liuwei1031 8d22bc17a4
Memory optimize (#16410)
7 years ago
Xin Pan f8c279b11c
Merge pull request #16454 from panyx0718/imperative2
7 years ago
Zhaolong Xing fa1796a30a
Merge pull request #16330 from NHZlX/merge_anakin_branch_to_dev
7 years ago
lujun 3f8b2f5ff5 fix multiplex doc, test=develop
7 years ago
sneaxiy a0f4fefb60 delete source file no_need_buffer_vars_inference.cc
7 years ago
Qiao Longfei 392e97aae5 fix cpplint test=develop
7 years ago
Qiao Longfei 37f6b9ab7a fix build test=develop
7 years ago
tensor-tang 1eff834e97
update jitkernel doc (#16327)
7 years ago
Qiao Longfei 30618409db Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor-communicator
7 years ago
Yiqun Liu 98802e1f75
Optimize the implementation of while_op again, for cases when is_test is true. (#16359)
7 years ago
lujun c34b24ede7
Merge pull request #16425 from junjun315/checkpoint-hotfix
7 years ago
Wu Yi 9ffd5eecef
test fix fetch bar place for ce (#16406)
7 years ago
sneaxiy 318072c26b add comments of allocator design
7 years ago
chengduo 4f2278f032
Add doc for CPUPlace CUDAPlace CUDAPinPlace (#16442)
7 years ago
dengkaipeng 1ef30c230d fix API.spec. test=develop
7 years ago
nhzlx 953bdde058 Merge branch 'develop' of https://github.com/paddlepaddle/paddle into HEAD
7 years ago
Tao Luo e0a3a49096
Merge pull request #16438 from wojtuss/wojtuss/move-cpu-quantize-passes
7 years ago
gongweibao ec6519e806
Fix allreducedep bug (#16443)
7 years ago
sneaxiy 78fb3a62e0 fix env variable settting bug
7 years ago
Qiao Longfei b65adf7f65 add communicator_send_wait_times
7 years ago
sneaxiy 2d92b6be98 merge develop
7 years ago
Jiabin Yang f735102eab
add layer norm to Layers, add transformer test in imperative mode (#16092)
7 years ago
Xin Pan fd24ab47ab polish
7 years ago
Xin Pan 1f89249a95 update DeepCF model
7 years ago
sneaxiy a7d0ac50b8 Merge develop
7 years ago
sneaxiy 7000ec85d9 fix some op grad maker
7 years ago
nhzlx 45b3766fdf fix comments
7 years ago
zhaoyuchen2018 cdb315e9d8
Merge branch 'develop' into docrefine
7 years ago
zhhsplendid 3909108cae Add SpectralNormGradOpDescMaker
7 years ago
dengkaipeng ceb31d30f0 fix formax. test=develop
7 years ago
dengkaipeng cfef382a85 fix format. test=develop
7 years ago
Zeng Jinle 4cc9809cae
Merge pull request #15799 from sneaxiy/feature/decoupled_reader
7 years ago
whs e9bec9369b
[slim] Add quantization strategy and distillation strategy. (#16408)
7 years ago
dengkaipeng d54005a7f4 fix unittest. test=develop
7 years ago
liuwei1031 de3b70a101
fix cdn issue, test=develop (#16423)
7 years ago
dengkaipeng 90bd038d35 fix format. test=develop
7 years ago
Qiao Longfei 63acbe7a65 fix bug
7 years ago
zhoukunsheng d3d31a5894 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into all_any
7 years ago
zhoukunsheng 664c342ca0 test=develop
7 years ago
lujun bc4d1c7246 fix mix input type error, test=develop
7 years ago
Qiao Longfei 0ff1e64fab fix a bug
7 years ago
zhoukunsheng 43060084a4 test=develop
7 years ago
Qiao Longfei 0997cf8f65 add more check
7 years ago
sneaxiy f8ed2c229e try to fix ci error
7 years ago
zhoukunsheng 8e9ebebcef test=develop
7 years ago
lujun 18aa59493e fix mix input type error, test=develop
7 years ago
Wojciech Uss 46677fb080 Move cpu_quantize_* passes into mkldnn subfolder
7 years ago
dengkaipeng cfda1fdea7 add attr scale. test=develop
7 years ago
sneaxiy c20db6357b split PR
7 years ago
Zeng Jinle c64d959343
Merge pull request #16295 from zhhsplendid/zhenghuihuang-dev-2
7 years ago
lujun 1b6a2a09e8 fix mix input type error, test=develop
7 years ago
nhzlx a1d11bb175 fix ci bug: cudnn handler in multi card
7 years ago
Qiao Longfei 93464b25ac update async_sparse_param_update_recorder
7 years ago
Qiao Longfei 542b52fac3 fix trainer_id
7 years ago
Qiao Longfei be0c482304 update trainer_id
7 years ago
sneaxiy 2f54d9f995 Merge develop
7 years ago
Qiao Longfei c60f312d1b add trick
7 years ago
Qiao Longfei 103c9bb376 update rpc_client
7 years ago
sneaxiy c75a880386 fix windows bug
7 years ago
sneaxiy 072d95d8f6 Merge develop
7 years ago
sneaxiy a93a9eef8f add op registry type
7 years ago
dengkaipeng f45aced59b add jit test. develop=test
7 years ago
Qiao Longfei b7661d7e56 add some log
7 years ago
Qiao Longfei e8fe5186a1 complete parameter_recv
7 years ago
Qiao Longfei d5c7898201 complete pserver side update
7 years ago
Qiao Longfei de65398cb8 update transpiler and listen and serv op
7 years ago
whs 2e5831f0dc
[slim] Refine framework of slim and add filter pruning strategy (#16226)
7 years ago
whs 18779b5b8f
[Operator] Add range op. (#15431)
7 years ago
Qiao Longfei 25e2b41729 add AsyncSparseParamUpdateRecorder test
7 years ago
Qiao Longfei c6e82785aa init async_sparse_param_update_recorder
7 years ago
phlrain 7dc4a7f4f8 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_var_name_in_opt_2
7 years ago
Zhen Wang ec11135d54
Merge pull request #16341 from wzzju/add_channel_wise_in_quant_pass
7 years ago
xiaolil1 e235882c18 Enable MKL-DNN INT8 Concat Kernel. (#16156)
7 years ago
Qiyang Min 171df5b56b
Merge pull request #16303 from junjun315/checkpoint
7 years ago
Hongyu Liu e3bca9011c
Merge pull request #16357 from phlrain/fix_concat_check
7 years ago
Hongyu Liu e5478ab5c8
Merge pull request #16346 from phlrain/add_floordiv_and_mod
7 years ago
chengduo a6a3b2fbbc
[Speed]Refine ParallelExecutor (#16190)
7 years ago
nhzlx 3df7b98a0f Merge branch 'develop' of https://github.com/paddlepaddle/paddle into HEAD
7 years ago
nhzlx f3a2e4b3d8 1. Add ANAKIN_ROOT compile option
7 years ago
phlrain 77a08750e9 add var name in optimizer; test=develop
7 years ago
chengduo 33965527fd
Add unit test for fuse all reduce (#16354)
7 years ago
Hongyu Liu 18a0f6d97a
Merge pull request #16351 from phlrain/fix_topk_shape_check
7 years ago
Hongyu Liu 15444430b0
Merge pull request #16348 from phlrain/fix_squeeze_check
7 years ago
phlrain 5dc9b51994 fix time; test=develop
7 years ago
phlrain 686b8935fe Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_floordiv_and_mod
7 years ago
phlrain 18d107c27a add floordiv and mod op; test=develop
7 years ago
phlrain ff112813de Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_concat_check
7 years ago
phlrain 8274d9d733 fix concat shape check; test=develop
7 years ago
Hongyu Liu 0d779f15f6
Merge pull request #16261 from phlrain/fix_sequence_pad_2
7 years ago
Hongyu Liu 8c81d9949e
Merge pull request #16347 from phlrain/fix_matmul_check
7 years ago
phlrain 4e3b5e727d Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_concat_check
7 years ago
sneaxiy 953214ad97 add more unittest
7 years ago
Kaipeng Deng 5670530ca7 Revert "not use PERSISTENT in batch_norm. test=develop" (#16333)
7 years ago
qingqing01 d2b938ef5a
Refine gradient proto maker and python API for affine_channel_op (#16340)
7 years ago
phlrain b3d0336603 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_concat_check
7 years ago
phlrain 249546bf45 fix concat shape check; test=develop
7 years ago
Tao Luo 294cdf6f48
Merge pull request #16177 from fc500110/remove_visualizer
7 years ago
Hongyu Liu b3b7e352ee
Merge pull request #16259 from phlrain/fix_lod_reset
7 years ago
phlrain 3b46680913 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_squeeze_check
7 years ago
phlrain 6a5545aa22 fix squeeze shape check; test=develop
7 years ago
luotao1 056599a738 add expected_kernel_cache_pass
7 years ago
phlrain d96f498bcf Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_topk_shape_check
7 years ago
chenjiawen a859bfc31d test=develop
7 years ago
phlrain e7fb344c99 fix squeeze op shape check; test=develop
7 years ago
phlrain 190cfd692c fix squeeze shape check; test=develop
7 years ago
chenjiawen be6e78b7c4 test=develop;add approval RD check api
7 years ago
phlrain 0e40298949 fix matmul shape check; test=develop
7 years ago
phlrain 56c2d384c7 add elementwise floordiv, mod; test=develop
7 years ago
Wu Yi b7baeed7bb fix win gpu build test=develop (#16334)
7 years ago
dengkaipeng 51536f7f52 StrideASum. test=develop
7 years ago
liuwei1031 df5d19aa9d
temoprarily disable the code of use kCUDNN, test=develop (#16205)
7 years ago
ruri 09e05a110b
Merge pull request #16217 from ceci3/doc
7 years ago
zhhsplendid 124f1df481 Add flags for init and re-alloc gpu
7 years ago
Zhen Wang 8965819fbb rewrite the cuda kernels of channel_wise_quant_op and channe_wise_dequant_op. test=develop
7 years ago
Wojciech Uss cbe2dbf0db Add enabling quantization (#16326)
7 years ago
lujun 09442fb27e checkpoint pr be moved here, test=develop
7 years ago
Tao Luo 9a05859179
Merge pull request #16322 from wojtuss/wojtuss/fix_cpu_quantize_pass
7 years ago
qingqing01 8caa785e83
Enhance affine_channel_op infer-shape check (#16317)
7 years ago
flame 08838f3909
Fix save inference model bug (#16242)
7 years ago
Kaipeng Deng 957ea995fc
Merge pull request #16243 from heavengate/batch_norm_not_persistent
7 years ago
nhzlx 4f4daa4b66 cherry-pick from feature/anakin-engine: add data type for zero copy #16313
7 years ago
nhzlx 07dcf2856c git cherry-pick from feature/anakin-engine: update anakin subgraph #16278
7 years ago
nhzlx c407dfa3cb cherry-pick from feature/anakin-engine: refine paddle-anakin to new interface. #16276
7 years ago
nhzlx a25331bc26 cherry-pick from feature/anakin-engine: deal the changing shape when using anakin #16189
7 years ago
nhzlx c79f06d3d8 cherry-pick from feature/anakin-engine: add batch interface for pd-anakin #16178
7 years ago
nhzlx 69d37f81d7 cherry-pick from feature/anakin-engine: refine anakin subgraph. #16157
7 years ago
nhzlx a1d200a5de cherry-pick from feature/anakin-engine: Anakin support facebox #16111
7 years ago
flame a32d420043 cherry-pick from feature/anakin-engine: batch norm (#16110)
7 years ago
flame 0945b97f07 cherry-pick feature/anakin-engine: add anakin softmax/transpose/batch_norm/flatten/reshape op (#16020)
7 years ago
nhzlx b21770a2aa cherry-pick from feature/anakin-engine: Add subgraph fuse support and anakin engine #16018
7 years ago
luotao1 bfdab00e5b Merge branch 'develop' into core_opt_choose_kernel
7 years ago
nhzlx 084310f536 paddle-anakin: concat, split, pool2d converter#16003
7 years ago
flame be523baad2 Add anakin conv2d/relu/sigmoid/tanh converter (#15997)
7 years ago
Yan Chunwei d0ce6a9044 fix anakin converter registry (#15993)
7 years ago
Tao Luo a5124ee0bb
Merge pull request #16301 from luotao1/runtime_context_pass
7 years ago
lujun 622fe6a56b checkpoint pr be moved here, test=develop
7 years ago
baojun 2de263a5d9 Add softmax_with_cross_entropy_op to ngraph engine (#16304)
7 years ago
luotao1 6c6a39222b Merge branch 'core_opt_choose_kernel' of https://github.com/Xreki/Paddle into core_opt_choose_kernel
7 years ago
sneaxiy bb166a1e10 fix API.spec
7 years ago
ruri a3b8028d46
Merge pull request #16202 from shippingwang/add_sqrt_doc
7 years ago
phlrain dd080b17c3 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_sequence_pad_2
7 years ago
phlrain 1580be5d6c fix sequence pad; test=develop
7 years ago
zhoukunsheng 2c3e4c1888 test=develop
7 years ago
dengkaipeng aba2713ffc fix comment. test=develop
7 years ago
chengduo f26ba5bddd
Fuse AllReduce (#15921)
7 years ago
Zeng Jinle d0ef682552
Merge pull request #16274 from sneaxiy/fix_grad_maker
7 years ago
dengkaipeng 93701dba50 add jit kernel for softmax axis. test=develop
7 years ago
baojun 804afc51db Minor ngraph fix (#16270)
7 years ago
Tao Luo 9195c3bb03
Merge pull request #16280 from luotao1/cos_sim_infershape
7 years ago
Wojciech Uss 104a9f1e27 fix pattern maching conv2d with(out) ResidualData
7 years ago
Wu Yi 6382b62f6b
Collective ops (#15572)
7 years ago
lujun bed0ecf3d2 checkpoint pr be moved here, test=develop
7 years ago
sneaxiy 023a3a3d62 fix op grad maker
7 years ago
luotao1 82af8031d9 add runtime_context_cache_pass
7 years ago
Zhen Wang ec88b6cc5a add channel wise quantization in ir pass.
7 years ago
Tao Luo b9fc80a133
Merge pull request #16287 from PaddlePaddle/revert-16002-runtime_context
7 years ago
whs 18911b6eea
[enhence] Make step_input of dynamic_rnn support custom lod level. (#15972)
7 years ago
zhhsplendid 22715487dc add allocator flags
7 years ago
luotao1 c05af910bc refine cos_sim infershape
7 years ago
Hongyu Liu d3acf68044
Merge pull request #16258 from phlrain/fix_concat_1
7 years ago
Tao Luo 7d2740db83
Revert "cache runtime_context"
7 years ago
sneaxiy fd23262e0c merge develop, fix conflict
7 years ago
ceci3 c79cdf258e Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into doc
7 years ago
ceci3 9d2ccec0f2 test=develop, fix doc
7 years ago
tensor-tang ead558b7f6
Merge pull request #16256 from tensor-tang/refine/seqenum
7 years ago
ceci3 27f7a72641 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into doc
7 years ago
Qiyang Min c7f1f3ed0c
Merge pull request #16214 from velconia/imperative_infer_var_type
7 years ago
Zeng Jinle f8df9eb32e fix api doc (#16201)
7 years ago
sneaxiy 3a09693f5c change API name
7 years ago
Jacek Czaja 13816dd4ac [MKL-DNN] Fix to crash of Transformer when mkldnn is to be used (#16233)
7 years ago
Yibing Liu 7e20e7691e
Fix the bug in fp16 backward kernel (#16269)
7 years ago
shippingwang 97c6051822 add api.spec, test=develop
7 years ago
Wojciech Uss af03008890 Add cpu_quantize_placement_pass for C-API quantization (#16265)
7 years ago
Tao Luo dbb92ee4b1
Merge pull request #16002 from luotao1/runtime_context
7 years ago
shippingwang 4f42504eef Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_sqrt_doc
7 years ago
dengkaipeng 6c64182709 refine softmax kernel. test=develop
7 years ago
dengkaipeng 412b7cbdf1 fix format. test=develop
7 years ago
dengkaipeng 8b88960dce fix doc. test=develop
7 years ago
dengkaipeng 2ddd23dac8 fix format. test=develop
7 years ago
dengkaipeng 3e4f3434e6 fix API.spec. test=develop
7 years ago
dengkaipeng 365e6cfd15 add mkldnn support. test=develop
7 years ago
dengkaipeng 217db27337 add mkldnn support. test=develop
7 years ago
dengkaipeng 6cb66721d2 add cudnn support. test=develop
7 years ago
dengkaipeng 518325f1e7 add softmax_axis CPU kernel. test=develop
7 years ago
sneaxiy e893cbd286 add auto increment best fit allocator
7 years ago
minqiyang 565b19b7a5 fix set data type bug
7 years ago
minqiyang 8364688c30 Fix py_func_op's problem
7 years ago
sneaxiy 161b8ddcaa Merge develop
7 years ago
Zeng Jinle 6429d2a887
Merge pull request #16188 from sneaxiy/fix_const_cast
7 years ago
minqiyang b40e41fbd1 Polish code style
7 years ago
Hongyu Liu 9874530ee8
Update lod_reset_op.cc
7 years ago
xiaolil1 e818fa1004 Enable INT8 transpose kernel for MobileNet-SSD improvement. (#16159)
7 years ago
Hongyu Liu 58c69a9eb9
Update lod_reset_op.cc
7 years ago
Xin Pan 374abcf361
Merge pull request #16247 from panyx0718/imperative
7 years ago
Hongyu Liu 96c54cd9b1
Update lod_reset_op.cc
7 years ago
Tao Luo c072998ac1
Merge pull request #16219 from luotao1/fc_infershape
7 years ago
phlrain e46a94a4a2 fix lod_rest; test=develop
7 years ago
phlrain 10ececb213 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_sequence_pad_2
7 years ago
phlrain 802b33489a remove resize then seq num == 1; test=develop
7 years ago
phlrain 152beec540 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_lod_reset
7 years ago
tangwei12 8ea4218ce1
update load persistables for increment, test=develop (#15576)
7 years ago
phlrain a21fdde218 fix lod reset op;test=develop
7 years ago
phlrain dcba2e7236 fix conncat; test=develop
7 years ago
phlrain 955fad7a90 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into develop
7 years ago
phlrain a7fe3b508e fix concat; test=develop
7 years ago
tensor-tang 50931dee1d refine seq enum op
7 years ago
Qiyang Min 8e4ad008fb
Merge pull request #16198 from velconia/imperative_train_speed
7 years ago
minqiyang 36dce65bb3 Take DataType and VarType apart
7 years ago
Xin Pan 3e9319f3ab add more imperative layer tests.
7 years ago
luotao1 d9f0e7252a refine with comments
7 years ago
Qiao Longfei 039d783db5 change communicator_recv_wait_ms to communicator_max_send_grad_num_before_recv
7 years ago
luotao1 6fa52f83ba Merge branch 'develop' into fc_infershape
7 years ago
luotao1 cc0ae1f1a1 refine with comments
7 years ago
dengkaipeng 9d435973d2 not use PERSISTENT in batch_norm. test=develop
7 years ago
luotao1 a275fd6e0c Merge branch 'develop' into runtime_context
7 years ago
Wojciech Uss 2579ade45f Add cpu_quantize_pass for C-API quantization (#16127)
7 years ago
Xin Pan 7458114b5b
Merge pull request #16228 from panyx0718/imperative
7 years ago
chengduo efca4de78e Fix cross_entropy bug (#16236)
7 years ago