Commit Graph

203 Commits (0d61653c8ea74f4e4fe86c090c117cf05f96d0f3)

Author SHA1 Message Date
WangXi 0d61653c8e Revert to run distributed unittest serially, test=develop (#21924)
6 years ago
songyouwei 8439384e25 Move sequence op unittest files to a separate folder sequence (#21842)
6 years ago
WangXi 52a61edaad rename test_dist_mnist & run dist parallel, test=develop (#21874)
6 years ago
gongweibao 855ed5fb3d
revert parallel test=develop (#21856)
6 years ago
WangXi 0fe16539ef Fix dgc & launch tests in cpu ci (#21759)
6 years ago
gongweibao 549f24b5f1 run dist tests parallel(#21751)
6 years ago
juncaipeng 7c38612347 disable op test of kldiv_loss (#21749)
6 years ago
tangwei12 9ad940fdfe
memory leak for cpu (#21174)
6 years ago
lilong12 0bc8bdf724
set dim[0] to -1 if dim[0] < 0 during compiling for c_allgather op (#21402)
6 years ago
hutuxian 47a82e38e3
Support data_norm gpu kernel (#21325)
6 years ago
Zeng Jinle cdb3d27985
Fix warn of gcc8 (#21205)
6 years ago
Dong Daxiang a6747a6ef1
add launch_ps module so that we can launch a parameter server trainin… (#20936)
6 years ago
Aurelius84 aacd16dbb4 add pyramid_hash_op (#20698)
6 years ago
Tao Luo efbdad0596
make search_compute support avx default (#20779)
6 years ago
WangXi 250e72d254 Fix DGC algorithm flow to make it the same as paper (#20758)
6 years ago
gongweibao 8088395a84
Set unique port to every distribute test to avoid potential port conflicts (#20759)
6 years ago
gongweibao 78431dc7bc
Add detail log on parallel dygraph unit test when failed. (#20660)
6 years ago
gongweibao 89c4b3ddcf
Add bash_test_modules function to capture the timeout or failed context. (#20197)
6 years ago
chengduo 0313b98ae0 temporally disable test_parallel_executor_fetch_feed in Windows CI (#20288)
6 years ago
Chengmo 728ec1b43d
Add GEO-SGD distribute training algorithm (#20018)
6 years ago
gongweibao d4bca8118a
change dist tests to serial test=develop (#20051)
6 years ago
gongweibao afc40a59eb
Add `RUN_SERIAL` attribute to `exclusive` test. (#20026)
6 years ago
ShenLiang 5920d69df5 Avoid treating broadcast as initialization operation (#19857)
6 years ago
Zeng Jinle 0436efd6a3
Unify DataLoader APIs (#19305)
6 years ago
Zeng Jinle cee0079a4d
fix readers bug, test=develop (#19868)
6 years ago
Thunderbrook bda92434db remove trainer desc test in windows temporarily (#19753)
6 years ago
Zeng Jinle bb4f8dee83
add logs to left var memory size, test=develop (#19722)
6 years ago
chengduo 2c30e64b2f
increase timelimit test_pe_serexnext (#19702)
6 years ago
chengduo 5c4eb394e9 Make test_pe_seresnext serial (#19634)
6 years ago
Jiabin Yang e9233d1c1e Refactor dygraph (#19107)
6 years ago
hutuxian 66ad68ed7b
Update UT test_boxps (#19599)
6 years ago
Tao Luo 49523ea189
replace PADDLE_ASSERT with PADDLE_ASSERT_MSG (#19586)
6 years ago
gongweibao 57f0f0f2dc
Delete pserver complete file before executor running. (#19468)
6 years ago
ShenLiang 85914f7a88 add gather_nd op and unit test (#19366)
6 years ago
chengduo e340df013e
Support feed single persistable variable to PE (#19417)
6 years ago
Yi Liu efb05ba258
supports multiple NCCL communicators preserved in NCCLCommContext (#19407)
6 years ago
chengduo 6a1632318d
Split test_parallel_executor_seresnext to three unit test (#19239)
6 years ago
Aurelius84 78a3d837f8
Add match_matrix_tensor op (#18525)
6 years ago
Zeng Jinle 5b6673c44d
merge develop to solve conflict, also fix API doc, test=develop (#18823)
6 years ago
gongweibao fd4b15a2f6
Unset unittests http_proxy env to avoid timeout. (#19269)
6 years ago
gongweibao 29d8781240
Polish fleet API to support cuda collective mode and nccl2 mode. (#18966)
6 years ago
Kevin e681d65515 Add var_conv_2d op (#18518)
6 years ago
Dong Daxiang c97ea53c3e make listen and server as exclusive run (#18990)
6 years ago
Dong Daxiang 2bb296dfe9
make dist unit test exclusive run (#18865)
6 years ago
chengduo ecd2bdada6
add CPUInplaceTestWithFuseOptimizationOps (#18867)
6 years ago
Zeng Jinle 9a8a7a1ddc
fix affine_channel no_need buffer bug, test=develop (#18844)
6 years ago
Zeng Jinle a802da650b
Feature/mem opt pass refactor (#18735)
6 years ago
guru4elephant 2efb282c86
split test_dist_se_resnext.py into 4 testcases (#18743)
6 years ago
Bob Zhu 220eef602e Extend Matmul to support matrix multiplication with multiple heads (#18570)
6 years ago
guru4elephant ebf9797ec3
split different comm method for mnist distributed training (#18715)
6 years ago