Megatest: Changes On Branch a3957fea2df00152

Changes In Branch redir-logs Through [a3957fea2d] Excluding Merge-Ins

This is equivalent to a diff from c2ba631f76 to a3957fea2d

2016-06-21
09:57		Switch to default-log-port check-in: f52cd44a6e user: mrwellan tags: redir-logs
04:06		Merging first phase of redir-logs into v1.61 check-in: 1f31d511c0 user: matt tags: v1.61
03:57		Merged filters-fix into redir-logs check-in: a3957fea2d user: matt tags: redir-logs
2016-06-20
18:20		Filter mostly fixed and added unit test for filter Closed-Leaf check-in: 1d19de5e2c user: mrwellan tags: filters-fix
2016-06-16
03:19		Added param for overriding port to debug:print and debug:print-info check-in: 7b4d2dba0e user: matt tags: redir-logs
2016-05-18
15:51		Merged with the latest 1.61/02 changes check-in: 3f21429f4f user: ritikaag tags: mtdboard
11:21		Forced cleanup db on changing versions check-in: c2ba631f76 user: ritikaag tags: v1.61
2016-05-16
17:13		Split show/hide to two buttons check-in: cf1f6d704a user: mrwellan tags: v1.61

Modified Makefile from [9afa174d56] to [1879ee0391].

Modified api.scm from [d17ed0b31f] to [56474b0795].

Modified archive.scm from [fc2c9e1ed0] to [042f92d6a8].

Modified client.scm from [ecbc2f1355] to [7cc2ebaa72].

Modified common.scm from [421424f2aa] to [a4b65729f6].

Modified common_records.scm from [01df09dfe2] to [8f8884da26].

Modified configf.scm from [805ef5eee8] to [eda94e2a4e].

Modified dashboard-tests.scm from [8ae6a513d2] to [60e020f283].

Modified dashboard.scm from [dfc3e5bd3f] to [98198ca4ab].

Modified datashare.scm from [578f007a04] to [0d1c92be09].

Modified db.scm from [b405cf0e93] to [be67a32d7b].

Modified dcommon.scm from [a93a40dfa1] to [e69a6c9935].

Modified env.scm from [15c6fe90f1] to [210b4269b0].

Modified ezsteps.scm from [18ab86f9c8] to [5c8ddee2c6].

Modified fs-transport.scm from [d187681c70] to [a9715248c2].

Modified http-transport.scm from [bb0436ebfa] to [77fd489de8].

Modified items.scm from [7ee30bc78a] to [346fccf127].

Modified keys.scm from [b0a1fb8bc8] to [9fb4a14b55].

Modified launch.scm from [24c723f779] to [558aeef368].

Modified lock-queue.scm from [1e70529cd9] to [d3482e4682].

Modified megatest.scm from [d7706449e8] to [ca4fb834f0].

Modified mt.scm from [4179497d10] to [a3313978a4].

Modified multi-dboard.scm from [3d4abbfc1d] to [5a321f3867].

Modified newdashboard.scm from [d467002de9] to [c721783846].

Modified nmsg-transport.scm from [c28712df60] to [3c791622d1].

Modified portlogger.scm from [f3f2be6883] to [977d54de64].

Modified process.scm from [7162768cf7] to [146c66de8d].

Modified rmt.scm from [e1950b4244] to [28f40eaf71].

Modified rpc-transport.scm from [1e1f685d67] to [a5276f2534].

Modified runconfig.scm from [7fa3564888] to [72b8d22e5c].

Modified runs.scm from [7cb6946e92] to [2ce0f7497c].

Modified server.scm from [109c6639f4] to [de00ab93f9].

Modified sharedat.scm from [2c59e32b03] to [6aa7fab140].

Modified spublish.scm from [9e76c7e82b] to [3eddab8b1d].

Modified sretrieve.scm from [915dd04401] to [4335bf3320].

Modified synchash.scm from [1596fbcb93] to [1b03ebb537].

Modified tasks.scm from [2559bee69c] to [7b05deb102].

Modified tdb.scm from [8d8250539c] to [7319c89ae9].

Modified tests.scm from [741f407659] to [86baf587d1].

Modified tests/unittests/basicserver.scm from [f2f7d0aa9d] to [85fa769c5b].

Modified tests/unittests/tests.scm from [15fd3688ae] to [936d866cb6].

︙
113 114 115 116 117 118 119 ~~120 121 122~~ 123 ~~124~~ 125 126 127 128 129 130 131	113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131	- - - + + + - +	(test-groups (make-hash-table)) ;; these two (disk and test groups) could be combined nicely (bup-exe (or (configf:lookup configdat "archive" "bup") "bup")) (compress (or (configf:lookup configdat "archive" "compress") "9")) (linktree (configf:lookup configdat "setup" "linktree"))) (if (not archive-dir) ;; no archive disk found, this is fatal (begin (debug:print 0 "FATAL: No archive disks found. Please add disks with at least " min-space " MB space to the [archive-disks] section of megatest.config") (debug:print 0 " use [archive] minspace to specify minimum available space") (debug:print 0 " disks: " (string-intersperse (map cadr (archive:get-archive-disks)) "\n ")) (debug:print 0 #f "FATAL: No archive disks found. Please add disks with at least " min-space " MB space to the [archive-disks] section of megatest.config") (debug:print 0 #f " use [archive] minspace to specify minimum available space") (debug:print 0 #f " disks: " (string-intersperse (map cadr (archive:get-archive-disks)) "\n ")) (exit 1)) ~~(debug:print-info 0 "Using path " archive-dir " for archiving"))~~ (debug:print-info 0 #f "Using path " archive-dir " for archiving")) ;; from the test info bin the path to the test by stem ;; (for-each (lambda (test-dat) (let* ((item-path (db:test-get-item-path test-dat)) (test-name (db:test-get-testname test-dat))
︙
149 150 151 152 153 154 155 ~~156~~ 157 ~~158~~ 159 ~~160~~ 161 162 163 164 165 166 167 168 169 170 171 172 173 ~~174~~ 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 ~~190~~ 191 192 193 194 ~~195~~ 196 197 ~~198~~ 199 200 201 202 203 204 205	149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205	- + - + - + - + - + - + - +	(substring test-physical-path 0 partial-path-index) #f))) (cond (toplevel/children ~~(debug:print 0 "WARNING: cannot archive " test-name " with id " test-id " as it is a toplevel test with children"))~~ (debug:print 0 #f "WARNING: cannot archive " test-name " with id " test-id " as it is a toplevel test with children")) ((not (file-exists? test-path)) ~~(debug:print 0 "WARNING: Cannot archive " test-name "/" item-path " as path " test-path " does not exist"))~~ (debug:print 0 #f "WARNING: Cannot archive " test-name "/" item-path " as path " test-path " does not exist")) (else ~~(debug:print 0~~ (debug:print 0 #f "From test-dat=" test-dat " derived the following:\n" "test-partial-path = " test-partial-path "\n" "test-path = " test-path "\n" "test-physical-path = " test-physical-path "\n" "partial-path-index = " partial-path-index "\n" "test-base = " test-base) (hash-table-set! disk-groups test-base (cons test-physical-path (hash-table-ref/default disk-groups test-base '()))) (hash-table-set! test-groups test-base (cons test-dat (hash-table-ref/default test-groups test-base '()))) test-path)))) tests) ;; for each disk-group (for-each (lambda (disk-group) ~~(debug:print 0 "Processing disk-group " disk-group)~~ (debug:print 0 #f "Processing disk-group " disk-group) (let* ((test-paths (hash-table-ref disk-groups disk-group)) ;; ((string-intersperse (map cadr (rmt:get-key-val-pairs 1)) "-") (bup-init-params (list "-d" archive-dir "init")) (bup-index-params (append (list "-d" archive-dir "index") test-paths)) (bup-save-params (append (list "-d" archive-dir "save" ;; (conc "--strip-path=" linktree) (conc "-" compress) ;; or (conc "--compress=" compress) "-n" (conc (common:get-testsuite-name) "-" run-id) (conc "--strip-path=" disk-group)) test-paths)) (print-prefix #f)) ;; "Running: ")) ;; change to #f to turn off printing (if (not (file-exists? archive-dir)) (create-directory archive-dir #t)) (if (not (file-exists? (conc archive-dir "/HEAD"))) (begin ;; replace this with jobrunner stuff enventually ~~(debug:print-info 0 "Init bup in " archive-dir)~~ (debug:print-info 0 #f "Init bup in " archive-dir) ;; (mutex-lock! bup-mutex) (run-n-wait bup-exe params: bup-init-params print-cmd: print-prefix) ;; (mutex-unlock! bup-mutex) )) ~~(debug:print-info 0 "Indexing data to be archived")~~ (debug:print-info 0 #f "Indexing data to be archived") ;; (mutex-lock! bup-mutex) (run-n-wait bup-exe params: bup-index-params print-cmd: print-prefix) ~~(debug:print-info 0 "Archiving data with bup")~~ (debug:print-info 0 #f "Archiving data with bup") (run-n-wait bup-exe params: bup-save-params print-cmd: print-prefix) ;; (mutex-unlock! bup-mutex) (for-each (lambda (test-dat) (let ((test-id (db:test-get-id test-dat)) (run-id (db:test-get-run_id test-dat))) (rmt:test-set-archive-block-id run-id test-id archive-id)
︙
252 253 254 255 256 257 258 ~~259~~ 260 261 262 263 264 265 266	252 253 254 255 256 257 258 259 260 261 262 263 264 265 266	- +	;; (if (and (not toplevel/children) ;; special handling needed for toplevel with children prev-test-physical-path (file-exists? prev-test-physical-path)) ;; what to do? abort or clean up or link it in? (let* ((base (pathname-directory prev-test-physical-path)) (dirn (pathname-file prev-test-physical-path)) (newn (conc base "/." dirn))) ~~(debug:print 0 "ERROR: the old directory " prev-test-physical-path ", still exists! Moving it to " newn)~~ (debug:print 0 #f "ERROR: the old directory " prev-test-physical-path ", still exists! Moving it to " newn) (rename-file prev-test-physical-path newn))) (if (and archive-path ;; no point in proceeding if there is no actual archive (not toplevel/children)) (begin ;; CREATE WORK AREA ;; test-src-path == #f ==> don't copy in data from tests directory
︙
274 275 276 277 278 279 280 ~~281~~ 282 283 284 ~~285~~ 286 287 288 289 ~~290~~ 291 292	274 275 276 277 278 279 280 281 282 283 284 285 286 287 288 289 290 291 292	- + - + - +	;; bup -d /tmp/matt/adisk1/2015_q1/fullrun_e1a40/ restore -C /tmp/seeme fullrun-30/latest/ubuntu/nfs/none/w02.1.20.54_b/ ;; DO BUP RESTORE (let* ((new-test-dat (rmt:get-test-info-by-id run-id test-id)) (new-test-path (if (vector? new-test-dat ) (db:test-get-rundir new-test-dat) (begin ~~(debug:print 0 "ERROR: unable to get data for run-id=" run-id ", test-id=" test-id)~~ (debug:print 0 #f "ERROR: unable to get data for run-id=" run-id ", test-id=" test-id) (exit 1)))) ;; new-test-path won't work - must use best-disk instead? Nope, new-test-path but tack on /.. (bup-restore-params (list "-d" archive-path "restore" "-C" (conc new-test-path "/..") archive-internal-path))) ~~(debug:print-info 0 "Restoring archived data to " new-test-physical-path " from archive in " archive-path " ... " archive-internal-path)~~ (debug:print-info 0 #f "Restoring archived data to " new-test-physical-path " from archive in " archive-path " ... " archive-internal-path) ;; (mutex-lock! bup-mutex) (run-n-wait bup-exe params: bup-restore-params print-cmd: #f) ;; (mutex-unlock! bup-mutex) (mt:test-set-state-status-by-id run-id test-id "COMPLETED" #f #f))) ~~(debug:print 0 "ERROR: No archive path in the record for run-id=" run-id " test-id=" test-id))))~~ (debug:print 0 #f "ERROR: No archive path in the record for run-id=" run-id " test-id=" test-id)))) (filter vector? tests))))

︙
32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48	32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48	- + - +	;; (old-exit code))) (define getenv get-environment-variable) (define (safe-setenv key val) (if (and (string? val)(string? key)) (handle-exceptions exn ~~(debug:print 0 "ERROR: bad value for setenv, key=" key ", value=" val)~~ (debug:print 0 #f "ERROR: bad value for setenv, key=" key ", value=" val) (setenv key val)) ~~(debug:print 0 "ERROR: bad value for setenv, key=" key ", value=" val)))~~ (debug:print 0 #f "ERROR: bad value for setenv, key=" key ", value=" val))) (define home (getenv "HOME")) (define user (getenv "USER")) ;; GLOBAL GLETCHES (define db-keys #f)
︙
150 151 152 153 154 155 156 ~~157~~ 158 ~~159 160~~ 161 162 ~~163~~ ~~164 165 166 167~~ ~~168 169 170 171 172 173 174~~ 175 176 177 178 179 180 181	150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199	+ + - + + + + + + + + + + - - + + + + + - + - - - - + + + + + - - - - - - - + + + + + + + + + +	(define (common:set-last-run-version) (rmt:set-var "MEGATEST_VERSION" (common:version-signature))) (define (common:version-changed?) (not (equal? (common:get-last-run-version) (common:version-signature)))) ;; Move me elsewhere ... ;; ~~(define (common:~~exit-on-version-ch~~an~~ged~~)~~ (define (common:cleanup-db) (db:multi-db-sync #f ;; do all run-ids ;; 'new2old 'killservers 'dejunk ;; 'adj-testids ;; 'old2new 'new2old) (if (common:version-changed?) (common:set-last-run-version))) ~~~~(begin~~ (debug:print 0~~ (define (common:exit-on-version-changed) (if (common:version-changed?) (let ((mtconf (conc (get-environment-variable "MT_RUN_AREA_HOME") "/megatest.config"))) (debug:print 0 #f "ERROR: Version mismatch!\n" " expected: " (common:version-signature) "\n" ~~" got: " (common:get-last-run-version) ~~"\n"~~~~ " got: " (common:get-last-run-version)) ~~" to switch versions you can run: \"megatest -cleanup-db\"")~~ ~~;; megatest -cleanup-db IS NOT correcting the dbver. Let's force it for now.~~ ~~;; Matt: please review this!~~ (d~~b:mul~~ti~~-db-sync~~ (if (and (file-exists? mtconf) (eq? (current-user-id)(file-owner mtconf))) ;; safe to run -cleanup-db (begin (debug:print 0 #f " I see you are the owner of megatest.config, attempting to cleanup and reset to new version") (handle-exceptions ~~#f ~~'killservers~~ 'de~~junk~~ ~~'new2old~~) ~~(rmt:set-var "MEGATEST_VERSION"~~ (common:ve~~rsio~~n-~~signature~~)) (exit 1))))~~ exn (begin (debug:print 0 #f "Failed to switch versions.") (debug:print 0 #f " message: " ((condition-property-accessor 'exn 'message) exn)) (print-call-chain (current-error-port)) (exit 1)) (common:cleanup-db))) (begin (debug:print 0 #f " to switch versions you can run: \"megatest -cleanup-db\"") (exit 1)))))) ;;====================================================================== ;; S P A R S E A R R A Y S ;;====================================================================== (define (make-sparse-array) (let ((a (make-sparse-vector)))
︙
253 254 255 256 257 258 259 ~~260~~ 261 262 263 264 265 266 267	271 272 273 274 275 276 277 278 279 280 281 282 283 284 285	- +	(define (common:read-encoded-string instr) (handle-exceptions exn (handle-exceptions exn (begin ~~(debug:print 0 "ERROR: received bad encoded string \"" instr "\", message: " ((condition-property-accessor 'exn 'message) exn))~~ (debug:print 0 #f "ERROR: received bad encoded string \"" instr "\", message: " ((condition-property-accessor 'exn 'message) exn)) (print-call-chain (current-error-port)) #f) (read (open-input-string (base64:base64-decode instr)))) (read (open-input-string (z3:decode-buffer (base64:base64-decode instr)))))) ;; dot-locking egg seems not to work, using this for now ;; if lock is older than expire-time then remove it and try again
︙
355 356 357 358 359 360 361 ~~362~~ 363 364 365 366 367 368 369	373 374 375 376 377 378 379 380 381 382 383 384 385 386 387	- +	(define (std-exit-procedure) (let ((no-hurry (if time-to-exit ;; hurry up #f (begin (set! time-to-exit #t) #t)))) ~~(debug:print-info 4 "starting exit process, finalizing databases.")~~ (debug:print-info 4 #f "starting exit process, finalizing databases.") (if (and no-hurry (debug:debug-mode 18)) (rmt:print-db-stats)) (let ((th1 (make-thread (lambda () ;; thread for cleaning up, give it five seconds (let ((run-ids (hash-table-keys db-local-sync))) (if (and (not (null? run-ids)) (or (common:legacy-sync-recommended) (configf:lookup configdat "setup" "megatest-db")))
︙
380 381 382 383 384 385 386 ~~387~~ 388 389 390 ~~391~~ 392 393 394 395 396 397 398 399 400 ~~401~~ 402 403 404 405 406 407 408	398 399 400 401 402 403 404 405 406 407 408 409 410 411 412 413 414 415 416 417 418 419 420 421 422 423 424 425 426	- + - + - +	(let ((db (cdr task-db))) (if (sqlite3:database? db) (begin (sqlite3:interrupt! db) (sqlite3:finalize! db #t) (vector-set! task-db 0 #f)))))) "Cleanup db exit thread")) (th2 (make-thread (lambda () ~~(debug:print 4 "Attempting clean exit. Please be patient and wait a few seconds...")~~ (debug:print 4 #f "Attempting clean exit. Please be patient and wait a few seconds...") (if no-hurry (thread-sleep! 5) ;; give the clean up few seconds to do it's stuff (thread-sleep! 2)) ~~(debug:print 4 " ... done")~~ (debug:print 4 #f " ... done") ) "clean exit"))) (thread-start! th1) (thread-start! th2) (thread-join! th1)))) (define (std-signal-handler signum) ;; (signal-mask! signum) (set! time-to-exit #t) ~~(debug:print 0 "ERROR: Received signal " signum " exiting promptly")~~ (debug:print 0 #f "ERROR: Received signal " signum " exiting promptly") ;; (std-exit-procedure) ;; shouldn't need this since we are exiting and it will be called anyway (exit)) (set-signal-handler! signal/int std-signal-handler) ;; ^C (set-signal-handler! signal/term std-signal-handler) ;; (set-signal-handler! signal/stop std-signal-handler) ;; ^Z NO, do NOT handle ^Z!
︙
450 451 452 453 454 455 456 ~~457~~ 458 459 460 461 462 ~~463~~ 464 465 466 467 468 469 470	468 469 470 471 472 473 474 475 476 477 478 479 480 481 482 483 484 485 486 487 488	- + - +	(else #f))) (define (any->number-if-possible val) (let ((num (any->number val))) (if num num val))) (define (patt-list-match item patts) ~~(debug:print-info 8 "patt-list-match item=" item " patts=" patts)~~ (debug:print-info 8 #f "patt-list-match item=" item " patts=" patts) (if (and item patts) ;; here we are filtering for matches with item patterns (let ((res #f)) ;; look through all the item-patts if defined, format is patt1,patt2,patt3 ... wildcard is % (for-each (lambda (patt) (let ((modpatt (string-substitute "%" ".*" patt #t))) ~~(debug:print-info 10 "patt " patt " modpatt " modpatt)~~ (debug:print-info 10 #f "patt " patt " modpatt " modpatt) (if (string-match (regexp modpatt) item) (set! res #t)))) (string-split patts ",")) res) #t)) ;; (map print (map car (hash-table->alist (read-config "runconfigs.config" #f #t))))
︙
511 512 513 514 515 516 517 ~~518~~ 519 520 521 522 523 524 525	529 530 531 532 533 534 535 536 537 538 539 540 541 542 543	- +	(let* ((rtestpatt (if rconf (runconfigs-get rconf "TESTPATT") #f)) (args-testpatt (or (args:get-arg "-testpatt") (args:get-arg "-runtests") "%")) (testpatt (or (and (equal? args-testpatt "%") rtestpatt) args-testpatt))) ~~(if rtestpatt (debug:print-info 0 "TESTPATT from runconfigs: " rtestpatt))~~ (if rtestpatt (debug:print-info 0 #f "TESTPATT from runconfigs: " rtestpatt)) testpatt)) (define (common:get-linktree) (or (getenv "MT_LINKTREE") (if configdat (configf:lookup configdat "setup" "linktree"))))
︙
545 546 547 548 549 550 551 ~~552~~ 553 554 555 556 557 558 559	563 564 565 566 567 568 569 570 571 572 573 574 575 576 577	- +	#f))) (if valid (if split tlist target) (if target (begin ~~(debug:print 0 "ERROR: Invalid target, spaces or blanks not allowed \"" target "\", target should be: " (string-intersperse keys "/") ", have " tlist " for elements")~~ (debug:print 0 #f "ERROR: Invalid target, spaces or blanks not allowed \"" target "\", target should be: " (string-intersperse keys "/") ", have " tlist " for elements") #f) #f)))) ;;====================================================================== ;; M I S C L I S T S ;;======================================================================
︙
618 619 620 621 622 623 624 ~~625~~ 626 627 628 629 630 631 632	636 637 638 639 640 641 642 643 644 645 646 647 648 649 650	- +	(value (caddr hed)) (existing-rowdat (assoc rowkey rownames)) (existing-coldat (assoc colkey colnames)) (curr-rownum (if existing-rowdat rownum (+ rownum 1))) (curr-colnum (if existing-coldat colnum (+ colnum 1))) (new-rownames (if existing-rowdat rownames (cons (list rowkey curr-rownum) rownames))) (new-colnames (if existing-coldat colnames (cons (list colkey curr-colnum) colnames)))) ~~;; (debug:print-info 0 "Processing record: " hed )~~ ;; (debug:print-info 0 #f "Processing record: " hed ) (if proc (proc curr-rownum curr-colnum rowkey colkey value)) (if (null? tal) (list new-rownames new-colnames) (loop (car tal) (cdr tal) new-rownames new-colnames
︙
650 651 652 653 654 655 656 ~~657~~ 658 659 660 661 662 663 664	668 669 670 671 672 673 674 675 676 677 678 679 680 681 682	- +	;; make "nice-path" available in config files and the repl (define nice-path common:nice-path) (define (common:read-link-f path) (handle-exceptions exn (begin ~~(debug:print 0 "ERROR: command \"/bin/readlink -f " path "\" failed.")~~ (debug:print 0 #f "ERROR: command \"/bin/readlink -f " path "\" failed.") path) ;; just give up (with-input-from-pipe (conc "/bin/readlink -f " path) (lambda () (read-line))))) (define (get-cpu-load)
︙
686 687 688 689 690 691 692 ~~693~~ 694 695 696 697 ~~698~~ 699 700 701 702 703 704 705	704 705 706 707 708 709 710 711 712 713 714 715 716 717 718 719 720 721 722 723	- + - +	(first (car loadavg)) (next (cadr loadavg)) (adjload (* maxload numcpus)) (loadjmp (- first next))) (cond ((and (> first adjload) (> count 0)) ~~(debug:print-info 0 "waiting " waitdelay " seconds due to load " first " exceeding max of " adjload (if msg msg ""))~~ (debug:print-info 0 #f "waiting " waitdelay " seconds due to load " first " exceeding max of " adjload (if msg msg "")) (thread-sleep! waitdelay) (common:wait-for-cpuload maxload numcpus waitdelay count: (- count 1))) ((and (> loadjmp numcpus) (> count 0)) ~~(debug:print-info 0 "waiting " waitdelay " seconds due to load jump " loadjmp " > numcpus " numcpus (if msg msg ""))~~ (debug:print-info 0 #f "waiting " waitdelay " seconds due to load jump " loadjmp " > numcpus " numcpus (if msg msg "")) (thread-sleep! waitdelay) (common:wait-for-cpuload maxload numcpus waitdelay count: (- count 1)))))) (define (common:get-num-cpus) (with-input-from-file "/proc/cpuinfo" (lambda () (let loop ((numcpu 0)
︙
800 801 802 803 804 805 806 ~~807~~ 808 809 810 811 812 813 814 815 816 817 818 819 820 ~~821~~ 822 823 824 ~~825~~ 826 827 828 ~~829~~ 830 831 832 833 834 835 836	818 819 820 821 822 823 824 825 826 827 828 829 830 831 832 833 834 835 836 837 838 839 840 841 842 843 844 845 846 847 848 849 850 851 852 853 854	- + - + - + - +	(let* ((spacedat (common:check-db-dir-space)) (is-ok (car spacedat)) (dbspace (cadr spacedat)) (required (caddr spacedat)) (dbdir (cadddr spacedat))) (if (not is-ok) (begin ~~(debug:print 0 "ERROR: Insufficient space in " dbdir ", require " required ", have " dbspace ", exiting now.")~~ (debug:print 0 #f "ERROR: Insufficient space in " dbdir ", require " required ", have " dbspace ", exiting now.") (exit 1))))) ;; paths is list of lists ((name path) ... ) ;; (define (common:get-disk-with-most-free-space disks minsize) (let ((best #f) (bestsize 0)) (for-each (lambda (disk-num) (let* ((dirpath (cadr (assoc disk-num disks))) (freespc (cond ((not (directory? dirpath)) (if (common:low-noise-print 300 "disks not a dir " disk-num) ~~(debug:print 0 "WARNING: disk " disk-num " at path \"" dirpath "\" is not a directory - ignoring it."))~~ (debug:print 0 #f "WARNING: disk " disk-num " at path \"" dirpath "\" is not a directory - ignoring it.")) -1) ((not (file-write-access? dirpath)) (if (common:low-noise-print 300 "disks not writeable " disk-num) ~~(debug:print 0 "WARNING: disk " disk-num " at path \"" dirpath "\" is not writeable - ignoring it."))~~ (debug:print 0 #f "WARNING: disk " disk-num " at path \"" dirpath "\" is not writeable - ignoring it.")) -1) ((not (eq? (string-ref dirpath 0) #\/)) (if (common:low-noise-print 300 "disks not a proper path " disk-num) ~~(debug:print 0 "WARNING: disk " disk-num " at path \"" dirpath "\" is not a fully qualified path - ignoring it."))~~ (debug:print 0 #f "WARNING: disk " disk-num " at path \"" dirpath "\" is not a fully qualified path - ignoring it.")) -1) (else (get-df dirpath))))) (if (> freespc bestsize) (begin (set! best (cons disk-num dirpath)) (set! bestsize freespc)))))
︙
1210 1211 1212 1213 1214 1215 1216 ~~1217~~ 1218 1219 1220 1221 ~~1222~~ 1223 1224 1225 1226 1227 1228 1229 1230 1231	1228 1229 1230 1231 1232 1233 1234 1235 1236 1237 1238 1239 1240 1241 1242 1243 1244 1245 1246 1247 1248 1249	- + - +	fallback-launcher (let loop ((hed (car launchers)) (tal (cdr launchers))) (let ((patt (car hed)) (host-type (cadr hed))) (if (tests:match patt testname itempath) (begin ~~(debug:print-info 2 "Have flexi-launcher match for " testname "/" itempath " = " host-type)~~ (debug:print-info 2 #f "Have flexi-launcher match for " testname "/" itempath " = " host-type) (let ((launcher (configf:lookup configdat "host-types" host-type))) (if launcher launcher (begin ~~(debug:print-info 0 "WARNING: no launcher found for host-type " host-type)~~ (debug:print-info 0 #f "WARNING: no launcher found for host-type " host-type) (if (null? tal) fallback-launcher (loop (car tal)(cdr tal))))))) ;; no match, try again (if (null? tal) fallback-launcher (loop (car tal)(cdr tal)))))))) fallback-launcher)))

︙
43 44 45 46 47 48 49 50 51 52 53 54 55 56 57	43 44 45 46 47 48 49 50 51 52 53 54 55 56 57	- +	(list key val metadata) (list key val)))))) (define (config:eval-string-in-environment str) (handle-exceptions exn (begin ~~(debug:print 0 "ERROR: problem evaluating \"" str "\" in the shell environment")~~ (debug:print 0 #f "ERROR: problem evaluating \"" str "\" in the shell environment") #f) (let ((cmdres (process:cmd-run->list (conc "echo " str)))) (if (null? cmdres) "" (caar cmdres))))) ;;====================================================================== ;; Make the regexp's needed globally available
︙
96 97 98 99 100 101 102 ~~103 104~~ 105 106 107 108 109 110 111 112 113 114 115 116 ~~117 118~~ 119 120 121 122 123 124 125 126 127 128 129 130 131 ~~132~~ 133 134 135 136 137 138 139	96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139	- - + + - - + + - +	((runconfigs-get) (conc "(lambda (ht)(runconfigs-get ht \"" cmd "\"))")) ((rget) (conc "(lambda (ht)(runconfigs-get ht \"" cmd "\"))")) (else "(lambda (ht)(print \"ERROR\") \"ERROR\")")))) ;; (print "fullcmd=" fullcmd) (handle-exceptions exn (begin ~~(debug:print 0 "WARNING: failed to process config input \"" l "\"") (debug:print 0 " message: " ((condition-property-accessor 'exn 'message) exn))~~ (debug:print 0 #f "WARNING: failed to process config input \"" l "\"") (debug:print 0 #f " message: " ((condition-property-accessor 'exn 'message) exn)) ;; (print "exn=" (condition->list exn)) (set! result (conc "#{( " cmdtype ") " cmd"}"))) (if (or allow-system (not (member cmdtype '("system" "shell")))) (with-input-from-string fullcmd (lambda () (set! result ((eval (read)) ht)))) (set! result (conc "#{(" cmdtype ") " cmd "}")))) (case cmdsym ((system shell scheme) (let ((delta (- (current-seconds) start-time))) (if (> delta 2) (debug:print-info 0 "for line \"" l "\"\n command: " cmd " took " delta " seconds to run with output:\n " result) (debug:print-info 9 "for line \"" l "\"\n command: " cmd " took " delta " seconds to run with output:\n " result))))) (debug:print-info 0 #f "for line \"" l "\"\n command: " cmd " took " delta " seconds to run with output:\n " result) (debug:print-info 9 #f "for line \"" l "\"\n command: " cmd " took " delta " seconds to run with output:\n " result))))) (loop (conc prestr result poststr))) res)) res))) ;; Run a shell command and return the output as a string (define (shell cmd) (let* ((output (process:cmd-run->list cmd)) (res (car output)) (status (cadr output))) (if (equal? status 0) (let ((outres (string-intersperse res "\n"))) ~~(debug:print-info 4 "shell result:\n" outres)~~ (debug:print-info 4 #f "shell result:\n" outres) outres) (begin (with-output-to-port (current-error-port) (lambda () (print "ERROR: " cmd " returned bad exit code " status))) ""))))
︙
177 178 179 180 181 182 183 ~~184 185~~ 186 187 ~~188~~ 189 190 191 192 193 194 195 196 197 198 199 ~~200~~ 201 202 203 204 ~~205~~ 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 ~~225~~ 226 227 228 229 ~~230 231~~ 232 233 234 235 236 237 238	177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226 227 228 229 230 231 232 233 234 235 236 237 238	- - + + - + - + - + - + - - + +	;; adds to ht if given (must be #f otherwise) ;; envion-patt is a regex spec that identifies sections that will be eval'd ;; in the environment on the fly ;; sections: #f => get all, else list of sections to gather ;; post-section-procs alist of section-pattern => proc, where: (proc section-name next-section-name ht curr-path) ;; (define (read-config path ht allow-system #!key (environ-patt #f)(curr-section #f)(sections #f)(settings (make-hash-table))(keep-filenames #f)(post-section-procs '())) (debug:print-info 5 "read-config " path " allow-system " allow-system " environ-patt " environ-patt " curr-section: " curr-section " sections: " sections " pwd: " (current-directory)) (debug:print 9 "START: " path) (debug:print-info 5 #f "read-config " path " allow-system " allow-system " environ-patt " environ-patt " curr-section: " curr-section " sections: " sections " pwd: " (current-directory)) (debug:print 9 #f "START: " path) (if (not (file-exists? path)) (begin ~~(debug:print-info 1 "read-config - file not found " path " current path: " (current-directory))~~ (debug:print-info 1 #f "read-config - file not found " path " current path: " (current-directory)) ;; WARNING: This is a risky change but really, we should not return an empty hash table if no file read? #f) ;; (if (not ht)(make-hash-table) ht)) (let ((inp (open-input-file path)) (res (if (not ht)(make-hash-table) ht)) (metapath (if (or (debug:debug-mode 9) keep-filenames) path #f))) (let loop ((inl (configf:read-line inp res (calc-allow-system allow-system curr-section sections) settings)) ;; (read-line inp)) (curr-section-name (if curr-section curr-section "default")) (var-flag #f);; turn on for key-var-pr and cont-ln-rx, turn off elsewhere (lead #f)) ~~(debug:print-info 8 "curr-section-name: " curr-section-name " var-flag: " var-flag "\n inl: \"" inl "\"")~~ (debug:print-info 8 #f "curr-section-name: " curr-section-name " var-flag: " var-flag "\n inl: \"" inl "\"") (if (eof-object? inl) (begin (close-input-port inp) (hash-table-delete! res "") ;; we are using "" as a dumping ground and must remove it before returning the ht ~~(debug:print 9 "END: " path)~~ (debug:print 9 #f "END: " path) res) (regex-case inl (configf:comment-rx _ (loop (configf:read-line inp res (calc-allow-system allow-system curr-section-name sections) settings) curr-section-name #f #f)) (configf:blank-l-rx _ (loop (configf:read-line inp res (calc-allow-system allow-system curr-section-name sections) settings) curr-section-name #f #f)) (configf:settings ( x setting val ) (begin (hash-table-set! settings setting val) (loop (configf:read-line inp res (calc-allow-system allow-system curr-section-name sections) settings) curr-section-name #f #f))) (configf:include-rx ( x include-file ) (let* ((curr-conf-dir (pathname-directory path)) (full-conf (if (absolute-pathname? include-file) include-file (common:nice-path (conc (if curr-conf-dir curr-conf-dir ".") "/" include-file))))) (if (file-exists? full-conf) (begin ;; (push-directory conf-dir) ~~(debug:print 9 "Including: " full-conf)~~ (debug:print 9 #f "Including: " full-conf) (read-config full-conf res allow-system environ-patt: environ-patt curr-section: curr-section-name sections: sections settings: settings keep-filenames: keep-filenames) ;; (pop-directory) (loop (configf:read-line inp res (calc-allow-system allow-system curr-section-name sections) settings) curr-section-name #f #f)) (begin ~~(debug:print '(2 9) "INFO: include file " include-file " not found (called from " path ")") (debug:print 2 " " full-conf)~~ (debug:print '(2 9) #f "INFO: include file " include-file " not found (called from " path ")") (debug:print 2 #f " " full-conf) (loop (configf:read-line inp res (calc-allow-system allow-system curr-section-name sections) settings) curr-section-name #f #f))))) (configf:section-rx ( x section-name ) (begin ;; call post-section-procs (for-each (lambda (dat) (let ((patt (car dat)) (proc (cdr dat)))
︙
249 250 251 252 253 254 255 ~~256~~ 257 258 ~~259~~ 260 261 ~~262 263~~ 264 265 266 267 268 269 270 271 272 273 274 275 276 277 278 ~~279~~ 280 281 282 283 284 285 286 287 288 ~~289~~ 290 ~~291~~ 292 293 294 295 296 297 298 299 300 301 302 303 304 305 306 307 308 309 ~~310~~ 311 312 313 314 315 316 317 318 319 320 321 322 ~~323~~ 324 325 326 327 328 329 330	249 250 251 252 253 254 255 256 257 258 259 260 261 262 263 264 265 266 267 268 269 270 271 272 273 274 275 276 277 278 279 280 281 282 283 284 285 286 287 288 289 290 291 292 293 294 295 296 297 298 299 300 301 302 303 304 305 306 307 308 309 310 311 312 313 314 315 316 317 318 319 320 321 322 323 324 325 326 327 328 329 330	- + - + - - + + - + - + - + - + - +	(let ((alist (hash-table-ref/default res curr-section-name '())) (val-proc (lambda () (let* ((start-time (current-seconds)) (cmdres (process:cmd-run->list cmd)) (delta (- (current-seconds) start-time)) (status (cadr cmdres)) (res (car cmdres))) ~~(debug:print-info 4 "" inl "\n => " (string-intersperse res "\n"))~~ (debug:print-info 4 #f "" inl "\n => " (string-intersperse res "\n")) (if (not (eq? status 0)) (begin ~~(debug:print 0 "ERROR: problem with " inl ", return code " status~~ (debug:print 0 #f "ERROR: problem with " inl ", return code " status " output: " cmdres))) (if (> delta 2) (debug:print-info 0 "for line \"" inl "\"\n command: " cmd " took " delta " seconds to run with output:\n " res) (debug:print-info 9 "for line \"" inl "\"\n command: " cmd " took " delta " seconds to run with output:\n " res)) (debug:print-info 0 #f "for line \"" inl "\"\n command: " cmd " took " delta " seconds to run with output:\n " res) (debug:print-info 9 #f "for line \"" inl "\"\n command: " cmd " took " delta " seconds to run with output:\n " res)) (if (null? res) "" (string-intersperse res " ")))))) (hash-table-set! res curr-section-name (config:assoc-safe-add alist key (case (calc-allow-system allow-system curr-section-name sections) ((return-procs) val-proc) ((return-string) cmd) (else (val-proc))) metadata: metapath)) (loop (configf:read-line inp res (calc-allow-system allow-system curr-section-name sections) settings) curr-section-name #f #f)) (loop (configf:read-line inp res (calc-allow-system allow-system curr-section-name sections) settings) curr-section-name #f #f))) (configf:key-no-val ( x key val) (let* ((alist (hash-table-ref/default res curr-section-name '())) (fval (or (if (string? val) val #f) ""))) ;; fval should be either "" or " " (one or more spaces) ~~(debug:print 10 " setting: [" curr-section-name "] " key " = #t")~~ (debug:print 10 #f " setting: [" curr-section-name "] " key " = #t") (safe-setenv key fval) (hash-table-set! res curr-section-name (config:assoc-safe-add alist key fval metadata: metapath)) (loop (configf:read-line inp res (calc-allow-system allow-system curr-section-name sections) settings) curr-section-name key #f))) (configf:key-val-pr ( x key unk1 val unk2 ) (let* ((alist (hash-table-ref/default res curr-section-name '())) (envar (and environ-patt (string-search (regexp environ-patt) curr-section-name))) (realval (if envar (config:eval-string-in-environment val) val))) ~~(debug:print-info 6 "read-config env setting, envar: " envar " realval: " realval " val: " val " key: " key " curr-section-name: " curr-section-name)~~ (debug:print-info 6 #f "read-config env setting, envar: " envar " realval: " realval " val: " val " key: " key " curr-section-name: " curr-section-name) (if envar (safe-setenv key realval)) ~~(debug:print 10 " setting: [" curr-section-name "] " key " = " val)~~ (debug:print 10 #f " setting: [" curr-section-name "] " key " = " val) (hash-table-set! res curr-section-name (config:assoc-safe-add alist key realval metadata: metapath)) (loop (configf:read-line inp res (calc-allow-system allow-system curr-section-name sections) settings) curr-section-name key #f))) ;; if a continued line (configf:cont-ln-rx ( x whsp val ) (let ((alist (hash-table-ref/default res curr-section-name '()))) (if var-flag ;; if set to a string then we have a continued var (let ((newval (conc (config-lookup res curr-section-name var-flag) "\n" ;; trim lead from the incoming whsp to support some indenting. (if lead (string-substitute (regexp lead) "" whsp) "") val))) ;; (print "val: " val "\nnewval: \"" newval "\"\nvarflag: " var-flag) (hash-table-set! res curr-section-name (config:assoc-safe-add alist var-flag newval metadata: metapath)) (loop (configf:read-line inp res (calc-allow-system allow-system curr-section-name sections) settings) curr-section-name var-flag (if lead lead whsp))) (loop (configf:read-line inp res (calc-allow-system allow-system curr-section-name sections) settings) curr-section-name #f #f)))) ~~(else (debug:print 0 "ERROR: problem parsing " path ",\n \"" inl "\"")~~ (else (debug:print 0 #f "ERROR: problem parsing " path ",\n \"" inl "\"") (set! var-flag #f) (loop (configf:read-line inp res (calc-allow-system allow-system curr-section-name sections) settings) curr-section-name #f #f)))))))) ;; pathenvvar will set the named var to the path of the config (define (find-and-read-config fname #!key (environ-patt #f)(given-toppath #f)(pathenvvar #f)) (let* ((curr-dir (current-directory)) (configinfo (find-config fname toppath: given-toppath)) (toppath (car configinfo)) (configfile (cadr configinfo)) (set-fields (lambda (curr-section next-section ht path) (let ((field-names (if ht (keys:config-get-fields ht) '())) (target (or (getenv "MT_TARGET")(args:get-arg "-reqtarg")(args:get-arg "-target")))) ~~(debug:print-info 9 "set-fields with field-names=" field-names " target=" target " curr-section=" curr-section " next-section=" next-section " path=" path " ht=" ht)~~ (debug:print-info 9 #f "set-fields with field-names=" field-names " target=" target " curr-section=" curr-section " next-section=" next-section " path=" path " ht=" ht) (if (not (null? field-names))(keys:target-set-args field-names target #f)))))) (if toppath (change-directory toppath)) (if (and toppath pathenvvar)(setenv pathenvvar toppath)) (let ((configdat (if configfile (read-config configfile #f #t environ-patt: environ-patt post-section-procs: (list (cons "^fields$" set-fields)) #f)))) (if toppath (change-directory curr-dir)) (list configdat toppath configfile fname))))
︙
465 466 467 468 469 470 471 ~~472~~ 473 ~~474~~ 475 476 477 478 479 480 481	465 466 467 468 469 470 471 472 473 474 475 476 477 478 479 480 481	- + - +	(set! res (append res (list hed)))) ((not newval) ;; key has been removed (set! new #f)) ((not (equal? newval val)) (hash-table-set! sechash key newval) (set! new (conc key " " newval))) (else ~~(debug:print 0 "ERROR: problem parsing line number " lnum "\"" hed "\"")))))~~ (debug:print 0 #f "ERROR: problem parsing line number " lnum "\"" hed "\""))))) (else ~~(debug:print 0 "ERROR: Problem parsing line num " lnum " :\n " hed )))~~ (debug:print 0 #f "ERROR: Problem parsing line num " lnum " :\n " hed ))) (if (not (null? tal)) (loop (car tal)(cdr tal)(if new (append res (list new)) res)(+ lnum 1))) ;; drop to here when done processing, res contains modified list of lines (set! fdat res))) ;; step 4: Append new values to the section (for-each
︙

︙
233 234 235 236 237 238 239 ~~240~~ 241 242 243 244 245 246 247	233 234 235 236 237 238 239 240 241 242 243 244 245 246 247	- +	))))) ;; if there is a submegatest create a button to launch dashboard in that area ;; (define (submegatest-panel dbstruct keydat testdat runname testconfig) (let* ((subarea (configf:lookup testconfig "setup" "submegatest")) (area-exists (and subarea (file-exists? subarea)))) ~~;; (debug:print-info 0 "Megatest subarea=" subarea ", area-exists=" area-exists)~~ ;; (debug:print-info 0 #f "Megatest subarea=" subarea ", area-exists=" area-exists) (if subarea (iup:frame #:title "Megatest Run Info" ; #:expand "YES" (iup:button "Launch Dashboard" #:action (lambda (obj) (system (conc "cd " subarea ";env -i PATH=$PATH DISPLAY=$DISPLAY HOME=$HOME USER=$USER dashboard &")))))
︙
422 423 424 425 426 427 428 ~~429~~ 430 431 432 433 434 435 436	422 423 424 425 426 427 428 429 430 431 432 433 434 435 436	- +	local: #t)) (testdat (rmt:get-test-info-by-id run-id test-id)) ;; (db:get-test-info-by-id dbstruct run-id test-id)) (db-mod-time 0) ;; (file-modification-time db-path)) (last-update 0) ;; (current-seconds)) (request-update #t)) (if (not testdat) (begin ~~(debug:print 2 "ERROR: No test data found for test " test-id ", exiting")~~ (debug:print 2 #f "ERROR: No test data found for test " test-id ", exiting") (exit 1)) (let* (;; (run-id (if testdat (db:test-get-run_id testdat) #f)) (test-registry (tests:get-all)) (keydat (if testdat (rmt:get-key-val-pairs run-id) #f)) (rundat (if testdat (rmt:get-run-info run-id) #f)) (runname (if testdat (db:get-value-by-header (db:get-rows rundat) (db:get-header rundat)
︙
509 510 511 512 513 514 515 ~~516~~ 517 ~~518~~ 519 520 521 522 523 524 525 526 ~~527~~ 528 529 530 531 532 533 534	509 510 511 512 513 514 515 516 517 518 519 520 521 522 523 524 525 526 527 528 529 530 531 532 533 534	- + - + - +	(> (current-milliseconds)(+ last-update 250))) ;; every half seconds if db touched (> (current-milliseconds)(+ last-update 10000)) ;; force update even 10 seconds request-update)) (newtestdat (if need-update ;; NOTE: BUG HIDER, try to eliminate this exception handler (handle-exceptions exn ~~(debug:print-info 0 "test db access issue in examine test for run-id " run-id ", test-id " test-id ": " ((condition-property-accessor 'exn 'message) exn))~~ (debug:print-info 0 #f "test db access issue in examine test for run-id " run-id ", test-id " test-id ": " ((condition-property-accessor 'exn 'message) exn)) (rmt:get-test-info-by-id run-id test-id ))))) ~~;; (debug:print-info 0 "need-update= " need-update " curr-mod-time = " curr-mod-time)~~ ;; (debug:print-info 0 #f "need-update= " need-update " curr-mod-time = " curr-mod-time) (cond ((and need-update newtestdat) (set! testdat newtestdat) (set! teststeps (tests:get-compressed-steps run-id test-id)) (set! logfile (conc (db:test-get-rundir testdat) "/" (db:test-get-final_logf testdat))) (set! rundir ;; (filedb:get-path fdb (db:test-get-rundir testdat)) ;; ) (set! testfullname (db:test-get-fullname testdat)) ~~;; (debug:print 0 "INFO: teststeps=" (intersperse teststeps "\n "))~~ ;; (debug:print 0 #f "INFO: teststeps=" (intersperse teststeps "\n ")) ;; I don't see why this was implemented this way. Please comment it ... ;; (if (eq? curr-mod-time db-mod-time) ;; do only once if same ;; (set! db-mod-time (+ curr-mod-time 1)) ;; (set! db-mod-time curr-mod-time)) (if (not (eq? curr-mod-time db-mod-time))
︙
577 578 579 580 581 582 583 ~~584~~ 585 586 587 588 589 590 591 592 593 594 595 596 597 598 599 600 ~~601~~ 602 603 604 605 606 607 608	577 578 579 580 581 582 583 584 585 586 587 588 589 590 591 592 593 594 595 596 597 598 599 600 601 602 603 604 605 606 607 608	- + - +	lbl)) (store-button store-label) (command-proc (lambda (command-text-box) (let* ((cmd (iup:attribute command-text-box "VALUE")) (fullcmd (conc (dtests:get-pre-command) cmd (dtests:get-post-command)))) ~~(debug:print-info 02 "Running command: " fullcmd)~~ (debug:print-info 02 #f "Running command: " fullcmd) (common:without-vars fullcmd "MT_.")))) (command-text-box (iup:textbox #:expand "HORIZONTAL" #:font "Courier New, -10" #:action (lambda (obj cnum val) ;; (print "cnum=" cnum) (if (eq? cnum 13) (command-prox obj))) )) (command-launch-button (iup:button "Execute!" #:action (lambda (x) (command-proc command-text-box)))) ;; (lambda (x) ;; (let ((cmd (iup:attribute command-text-box "VALUE")) ;; (fullcmd (conc (dtests:get-pre-command) ;; cmd ;; (dtests:get-post-command)))) ~~;; (debug:print-info 02 "Running command: " fullcmd)~~ ;; (debug:print-info 02 #f "Running command: " fullcmd) ;; (common:without-vars fullcmd "MT_.*"))))) (kill-jobs (lambda (x) (iup:attribute-set! command-text-box "VALUE" (conc "megatest -target " keystring " -runname " runname " -set-state-status KILLREQ,n/a -testpatt %/% " " -state RUNNING"))))
︙

︙
372 373 374 375 376 377 378 ~~379~~ 380 381 382 383 384 385 386 387 388 389 390 391 392 ~~393~~ ~~394~~ 395 396 397 398 399 400 401 402 403 404 405 406 ~~407~~ 408 409 410 411 412 413 414 415 416 417 418 419 420 421 422 ~~423~~ 424 425 426 427 428 429 430	372 373 374 375 376 377 378 379 380 381 382 383 384 385 386 387 388 389 390 391 392 393 394 395 396 397 398 399 400 401 402 403 404 405 406 407 408 409 410 411 412 413 414 415 416 417 418 419 420 421 422 423 424 425 426 427 428 429	- + - + - - + - +	(append tmptests prev-tests)) (lambda (a b) (eq? (db:test-get-id a)(db:test-get-id b))))))) (if (eq? tests-sort-reverse 3) ;; +event_time (sort newdat dboard:compare-tests) newdat)))) (vector-set! prev-dat 3 (- (current-seconds) 2)) ;; go back two seconds in time to ensure all changes are captured. ~~;; (debug:print 0 "(dboard:get-tests-for-run-duplicate: filters-changed=" (d:alldat-filters-changed data) " last-update=" last-update " got " (length tmptests) " test records for run " run-id)~~ ;; (debug:print 0 #f "(dboard:get-tests-for-run-duplicate: filters-changed=" (d:alldat-filters-changed data) " last-update=" last-update " got " (length tmptests) " test records for run " run-id) tests)) ;; create a virtual table of all the tests ;; keypatts: ( (KEY1 "abc%def")(KEY2 "%") ) (define (update-rundat data runnamepatt numruns testnamepatt keypatts) (let* ((referenced-run-ids '()) (allruns (if (d:alldat-useserver data) (rmt:get-runs runnamepatt numruns (d:alldat-start-run-offset data) keypatts) (db:get-runs (d:alldat-dblocal data) runnamepatt numruns ;; (+ numruns 1) ;; (/ numruns 2)) (d:alldat-start-run-offset data) keypatts))) (header (db:get-header allruns)) (runs (db:get-rows allruns)) (result '()) ~~(maxtests 0)~~ (maxtests 0)) ) ;; ;; trim runs to only those that are changing often here ;; (for-each (lambda (run) (let* ((run-id (db:get-value-by-header run header "id")) (key-vals (if (d:alldat-useserver data) (rmt:get-key-vals run-id) (db:get-key-vals (d:alldat-dblocal data) run-id))) (tests (dboard:get-tests-for-run-duplicate data run-id run testnamepatt key-vals))) ;; NOTE: bubble-up also sets the global (d:alldat-item-test-names data) ;; (tests (bubble-up tmptests priority: bubble-type)) ;; NOTE: 11/01/2013 This routine is NOT getting called excessively. ~~;; (debug:print 0 "Getting data for run " run-id " with key-vals=" key-vals)~~ ;; (debug:print 0 #f "Getting data for run " run-id " with key-vals=" key-vals) ;; Not sure this is needed? (if (not (null? tests)) (begin (set! referenced-run-ids (cons run-id referenced-run-ids)) (if (> (length tests) maxtests) (set! maxtests (length tests))) (if (or (not (d:alldat-hide-empty-runs data)) ;; this reduces the data burden when set (not (null? tests))) (let ((dstruct (vector run tests key-vals (- (current-seconds) 10)))) (hash-table-set! (d:alldat-allruns-by-id data) run-id dstruct) (set! result (cons dstruct result)))))))) runs) (d:alldat-header-set! data header) (d:alldat-allruns-set! data result) ~~(debug:print-info 6 "(d:alldat-allruns data) has " (length (d:alldat-allruns data)) " runs")~~ (debug:print-info 6 #f "(d:alldat-allruns data) has " (length (d:alldat-allruns data)) " runs") maxtests)) (define collapsed (make-hash-table)) ; (define row-lookup (make-hash-table)) ;; testname => (rownum lableobj) (define (toggle-hide lnum) ; fulltestname) (let* ((btn (vector-ref (dboard:uidat-get-lftcol uidat) lnum))
︙
1226 1227 1228 1229 1230 1231 1232 ~~1233~~ 1234 1235 1236 1237 1238 1239 1240	1225 1226 1227 1228 1229 1230 1231 1232 1233 1234 1235 1236 1237 1238 1239	- +	#f #f "id,testname,item_path,state,status" (if (d:alldat-filters-changed data) 0 last-update) dashboard-mode)) '()))) ;; get 'em all ~~(debug:print 0 "dboard:get-tests-dat: got " (length tdat) " test records for run " run-id)~~ (debug:print 0 #f "dboard:get-tests-dat: got " (length tdat) " test records for run " run-id) (sort tdat (lambda (a b) (let* ((aval (vector-ref a 2)) (bval (vector-ref b 2)) (anum (string->number aval)) (bnum (string->number bval))) (if (and anum bnum) (< anum bnum)
︙
1253 1254 1255 1256 1257 1258 1259 ~~1260~~ 1261 1262 1263 1264 1265 1266 1267	1252 1253 1254 1255 1256 1257 1258 1259 1260 1261 1262 1263 1264 1265 1266	- +	;; (print "obj: " obj ", id: " id ", state: " state) (let* ((run-path (tree:node->path obj id)) (run-id (tree-path->run-id ddata (cdr run-path)))) (if (number? run-id) (begin (d:data-curr-run-id-set! ddata run-id) (dashboard:update-run-summary-tab)) ~~(debug:print 0 "ERROR: tree-path->run-id returned non-number " run-id)))~~ (debug:print 0 #f "ERROR: tree-path->run-id returned non-number " run-id))) ;; (print "path: " (tree:node->path obj id) " run-id: " run-id) ))) (cell-lookup (make-hash-table)) (run-matrix (iup:matrix #:expand "YES" #:click-cb (lambda (obj lin col status)
︙
1400 1401 1402 1403 1404 1405 1406 ~~1407~~ 1408 1409 1410 1411 1412 1413 1414	1399 1400 1401 1402 1403 1404 1405 1406 1407 1408 1409 1410 1411 1412 1413	- +	;; (print "obj: " obj ", id: " id ", state: " state) (let* ((run-path (tree:node->path obj id)) (run-id (tree-path->run-id ddata (cdr run-path)))) (if (number? run-id) (begin (d:data-curr-run-id-set! ddata run-id) (dashboard:update-new-view-tab)) ~~(debug:print 0 "ERROR: tree-path->run-id returned non-number " run-id)))~~ (debug:print 0 #f "ERROR: tree-path->run-id returned non-number " run-id))) ;; (print "path: " (tree:node->path obj id) " run-id: " run-id) ))) (cell-lookup (make-hash-table)) (run-matrix (iup:matrix #:expand "YES" #:click-cb (lambda (obj lin col status)
︙
1602 1603 1604 1605 1606 1607 1608 ~~1609~~ 1610 1611 1612 1613 1614 1615 1616	1601 1602 1603 1604 1605 1606 1607 1608 1609 1610 1611 1612 1613 1614 1615	- +	;; (iup:attribute-set! obj "TITLE" (if (d:alldat-hide-not-hide data) "HideTests" "NotHide")) (iup:attribute-set! hide "BGCOLOR" sel-color) (iup:attribute-set! show "BGCOLOR" nonsel-color) (mark-for-update)))) (set! show (iup:button "Show" #:expand "YES" #:action (lambda (obj) ~~(d:alldat-hide-not-hide-set! data (not (d:alldat-hide-not-hide data)))~~ (d:alldat-hide-not-hide-set! data #f) ;; (not (d:alldat-hide-not-hide data))) (iup:attribute-set! show "BGCOLOR" sel-color) (iup:attribute-set! hide "BGCOLOR" nonsel-color) (mark-for-update)))) (iup:attribute-set! hide "BGCOLOR" sel-color) (iup:attribute-set! show "BGCOLOR" nonsel-color) ;; (d:alldat-hide-not-hide-button-set! data hideit) ;; never used, can eliminate ... (iup:vbox
︙
1644 1645 1646 1647 1648 1649 1650 ~~1651~~ 1652 1653 1654 1655 1656 1657 1658	1643 1644 1645 1646 1647 1648 1649 1650 1651 1652 1653 1654 1655 1656 1657	- +	(map cadr common:std-states))) ;; '("RUNNING" "COMPLETED" "INCOMPLETE" "LAUNCHED" "NOT_STARTED" "KILLED" "DELETED"))) (iup:valuator #:valuechanged_cb (lambda (obj) (let ((val (inexact->exact (round (/ (string->number (iup:attribute obj "VALUE")) 10)))) (oldmax (string->number (iup:attribute obj "MAX"))) (maxruns (d:alldat-tot-runs data))) (d:alldat-start-run-offset-set! data val) (mark-for-update) ~~(debug:print 6 "(d:alldat-start-run-offset data) " (d:alldat-start-run-offset data) " maxruns: " maxruns ", val: " val " oldmax: " oldmax)~~ (debug:print 6 #f "(d:alldat-start-run-offset data) " (d:alldat-start-run-offset data) " maxruns: " maxruns ", val: " val " oldmax: " oldmax) (iup:attribute-set! obj "MAX" (* maxruns 10)))) #:expand "HORIZONTAL" #:max (* 10 (length (d:alldat-allruns data))) #:min 0 #:step 0.01))) ;(iup:button "inc rows" #:action (lambda (obj)(d:alldat-num-tests-set! data (+ (d:alldat-num-tests data) 1)))) ;(iup:button "dec rows" #:action (lambda (obj)(d:alldat-num-tests-set! data (if (> (d:alldat-num-tests data) 0)(- (d:alldat-num-tests data) 1) 0))))
︙
1696 1697 1698 1699 1700 1701 1702 ~~1703~~ 1704 1705 1706 1707 1708 1709 1710	1695 1696 1697 1698 1699 1700 1701 1702 1703 1704 1705 1706 1707 1708 1709	- +	(set! lftlst (append lftlst (list (iup:hbox #:expand "HORIZONTAL" (iup:valuator #:valuechanged_cb (lambda (obj) (let ((val (string->number (iup:attribute obj "VALUE"))) (oldmax (string->number (iup:attribute obj "MAX"))) (newmax (* 10 (length alltestnamelst)))) (d:alldat-please-update-set! data #t) (d:alldat-start-test-offset-set! alldat (inexact->exact (round (/ val 10)))) ~~(debug:print 6 "(d:alldat-start-test-offset alldat) " (d:alldat-start-test-offset alldat) " val: " val " newmax: " newmax " oldmax: " oldmax)~~ (debug:print 6 #f "(d:alldat-start-test-offset alldat) " (d:alldat-start-test-offset alldat) " val: " val " newmax: " newmax " oldmax: " oldmax) (if (< val 10) (iup:attribute-set! obj "MAX" newmax)) )) #:expand "VERTICAL" #:orientation "VERTICAL" #:min 0 #:step 0.01)
︙
1843 1844 1845 1846 1847 1848 1849 ~~1850~~ 1851 1852 1853 1854 1855 1856 1857	1842 1843 1844 1845 1846 1847 1848 1849 1850 1851 1852 1853 1854 1855 1856	- +	;; Force creation of the db in case it isn't already there. (tasks:open-db) (define (dashboard:get-youngest-run-db-mod-time) (handle-exceptions exn (begin ~~(debug:print 0 "WARNING: error in accessing databases in get-youngest-run-db-mod-time: " ((condition-property-accessor 'exn 'message) exn))~~ (debug:print 0 #f "WARNING: error in accessing databases in get-youngest-run-db-mod-time: " ((condition-property-accessor 'exn 'message) exn)) (current-seconds)) ;; something went wrong - just print an error and return current-seconds (apply max (map (lambda (filen) (file-modification-time filen)) (glob (conc (d:alldat-dbdir alldat) "/.db")))))) (define (dashboard:run-update x) (let ((modtime (dashboard:get-youngest-run-db-mod-time)) ;; (file-modification-time (d:alldat-dbfpath alldat)))
︙
1928 1929 1930 1931 1932 1933 1934 ~~1935~~ 1936 1937 1938 1939 1940 1941 1942	1927 1928 1929 1930 1931 1932 1933 1934 1935 1936 1937 1938 1939 1940 1941	- +	(run-id (car dat)) (test-id (cadr dat))) (if (and (number? run-id) (number? test-id) (>= test-id 0)) (examine-test run-id test-id) (begin ~~(debug:print 3 "INFO: tried to open test with invalid run-id,test-id. " (args:get-arg "-test"))~~ (debug:print 3 #f "INFO: tried to open test with invalid run-id,test-id. " (args:get-arg "-test")) (exit 1))))) ((args:get-arg "-guimonitor") (gui-monitor (d:alldat-dblocal data))) (else (set! uidat (make-dashboard-buttons data ;; (d:alldat-dblocal data) (d:alldat-numruns data) (d:alldat-num-tests data)
︙

︙
184 185 186 187 188 189 190 ~~191~~ 192 193 194 195 196 197 198	184 185 186 187 188 189 190 191 192 193 194 195 196 197 198	- +	(time-b (db:get-value-by-header record-b header "event_time"))) (> time-a time-b))) )) (runid-to-col (hash-table-ref cachedata "runid-to-col")) (testname-to-row (hash-table-ref cachedata "testname-to-row")) (colnum 1) (rownum 0)) ;; rownum = 0 is the header ~~;; (debug:print 0 "test-ids " test-ids ", tests-detail-changes " tests-detail-changes)~~ ;; (debug:print 0 #f "test-ids " test-ids ", tests-detail-changes " tests-detail-changes) ;; tests related stuff ;; (all-testnames (delete-duplicates (map db:test-get-testname test-changes)))) ;; Given a run-id and testname/item_path calculate a cell R:C ;; NOTE: Also build the test tree browser and look up table
︙
260 261 262 263 264 265 266 ~~267~~ 268 269 270 271 272 273 274 275 276 277 278 279 280 ~~281~~ 282 283 284 285 286 287 288 289 290 291 292 293 294 295 296 297 ~~298 299~~ 300 301 302 303 304 305 306	260 261 262 263 264 265 266 267 268 269 270 271 272 273 274 275 276 277 278 279 280 281 282 283 284 285 286 287 288 289 290 291 292 293 294 295 296 297 298 299 300 301 302 303 304 305 306	- + - + - - + +	(tb (dboard:data-get-tests-tree data))) (print "INFONOTE: run-path: " run-path) (tree:add-node (dboard:data-get-tests-tree data) "Runs" test-path userdata: (conc "test-id: " test-id)) (let ((node-num (tree:find-node tb (cons "Runs" test-path))) (color (car (gutils:get-color-for-state-status state status)))) ~~(debug:print 0 "node-num: " node-num ", color: " color)~~ (debug:print 0 #f "node-num: " node-num ", color: " color) (iup:attribute-set! tb (conc "COLOR" node-num) color)) (hash-table-set! (dboard:data-get-path-test-ids data) test-path test-id) (if (not rownum) (let ((rownums (hash-table-values testname-to-row))) (set! rownum (if (null? rownums) 1 (+ 1 (apply max rownums)))) (hash-table-set! testname-to-row fullname rownum) ;; create the label (iup:attribute-set! (dboard:data-get-runs-matrix data) (conc rownum ":" 0) dispname) )) ;; set the cell text and color ~~;; (debug:print 2 "rownum:colnum=" rownum ":" colnum ", state=" status)~~ ;; (debug:print 2 #f "rownum:colnum=" rownum ":" colnum ", state=" status) (iup:attribute-set! (dboard:data-get-runs-matrix data) (conc rownum ":" colnum) (if (member state '("ARCHIVED" "COMPLETED")) status state)) (iup:attribute-set! (dboard:data-get-runs-matrix data) (conc "BGCOLOR" rownum ":" colnum) (car (gutils:get-color-for-state-status state status))) )) tests))) run-ids) (let ((updater (hash-table-ref/default (dboard:data-get-updaters data) window-id #f))) (if updater (updater (hash-table-ref/default data get-details-sig #f)))) (iup:attribute-set! (dboard:data-get-runs-matrix data) "REDRAW" "ALL") ~~;; (debug:print 2 "run-changes: " run-changes) ;; (debug:print 2 "test-changes: " test-changes)~~ ;; (debug:print 2 #f "run-changes: " run-changes) ;; (debug:print 2 #f "test-changes: " test-changes) (list run-changes all-test-changes))) ;;====================================================================== ;; TESTS DATA ;;====================================================================== ;; Produce a list of lists ready for common:sparse-list-generate-index
︙
683 684 685 686 687 688 689 ~~690 691~~ 692 693 694 695 696 697 698	683 684 685 686 687 688 689 690 691 692 693 694 695 696 697 698	- - + +	(waitons (vector-ref test-record 2))) (for-each (lambda (waiton) (let* ((waiton-box-info (hash-table-ref/default tests-hash waiton #f)) (waiton-center (dcommon:get-box-center (or waiton-box-info test-box-info)))) (dcommon:draw-arrow cnv test-box-center waiton-center))) waitons) ~~;; (debug:print 0 "test-box-info=" test-box-info) ;; (debug:print 0 "test-record=" test-record)~~ ;; (debug:print 0 #f "test-box-info=" test-box-info) ;; (debug:print 0 #f "test-record=" test-record) )) (define (dcommon:estimate-scale sizex sizey originx originy nodes) ;; (print "sizex: " sizex " sizey: " sizey " originx: " originx " originy: " originy " nodes: " nodes) (let* ((maxx 1) (maxy 1)) (for-each
︙
900 901 902 903 904 905 906 ~~907~~ 908 909 910 911 ~~912~~ 913 914 915 916 917 918 919 920 921	900 901 902 903 904 905 906 907 908 909 910 911 912 913 914 915 916 917 918 919 920 921	- + - +	(loop (car tal)(cdr tal)(+ rownum 1) 1)))))) (if (> max-row 0) (begin ;; we are going to speculatively clear rows until we find a row that is already cleared (let loop ((rownum (+ max-row 1)) (colnum 0) (deleted #f)) ~~;; (debug:print-info 0 "cleaning " rownum ":" colnum)~~ ;; (debug:print-info 0 #f "cleaning " rownum ":" colnum) (let* ((next-row (if (eq? colnum max-col) (+ rownum 1) rownum)) (next-col (if (eq? colnum max-col) 1 (+ colnum 1))) (mtrx-rc (conc rownum ":" colnum)) (curr-val (iup:attribute steps-matrix mtrx-rc))) ~~;; (debug:print-info 0 "cleaning " rownum ":" colnum " currval= " curr-val)~~ ;; (debug:print-info 0 #f "cleaning " rownum ":" colnum " currval= " curr-val) (if (and (string? curr-val) (not (equal? curr-val ""))) (begin (iup:attribute-set! steps-matrix mtrx-rc "") (loop next-row next-col #t)) (if (eq? colnum max-col) ;; not done, didn't get a full blank row (if deleted (loop next-row next-col #f)) ;; exit on this not met (loop next-row next-col deleted))))) (iup:attribute-set! steps-matrix "REDRAW" "ALL")))))

︙
39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56	39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56	- + - +	(test-name (db:test-get-testname testdat)) (kill-job #f)) ;; for future use (on re-factoring with launch.scm code (let loop ((count 5)) (if (file-exists? test-run-dir) (push-directory test-run-dir) (if (> count 0) (begin ~~(debug:print 0 "WARNING: ezsteps attempting to run but test run directory " test-run-dir " is not there. Waiting and trying again " count " more times")~~ (debug:print 0 #f "WARNING: ezsteps attempting to run but test run directory " test-run-dir " is not there. Waiting and trying again " count " more times") (sleep 3) (loop (- count 1)))))) ~~(debug:print-info 0 "Running in directory " test-run-dir)~~ (debug:print-info 0 #f "Running in directory " test-run-dir) (if (not (file-exists? ".ezsteps"))(create-directory ".ezsteps")) ;; if ezsteps was defined then we are sure to have at least one step but check anyway (if (not (> (length ezstepslst) 0)) (message-window "ERROR: You can only re-run steps defined via ezsteps") (begin (let loop ((ezstep (car ezstepslst)) (tal (cdr ezstepslst))
︙
70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92	70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92	- + - +	(if (and start-step-name (not runflag)) (if (equal? stepname start-step-name) (set! runflag #t) ;; and continue (if (not (null? tal)) (loop (car tal)(cdr tal) stepname #f)))) ~~(debug:print 4 "ezsteps:\n stepname: " stepname " stepinfo: " stepinfo " stepparts: " stepparts~~ (debug:print 4 #f "ezsteps:\n stepname: " stepname " stepinfo: " stepinfo " stepparts: " stepparts " stepparms: " stepparms " stepcmd: " stepcmd) (if (file-exists? (conc stepname ".logpro"))(set! logpro-used #t)) ;; call the command using mt_ezstep (set! script (conc "mt_ezstep " stepname " " (if prevstep prevstep "-") " " stepcmd)) ~~(debug:print 4 "script: " script)~~ (debug:print 4 #f "script: " script) (rmt:teststep-set-status! run-id test-id stepname "start" "-" #f #f) ;; now launch (let ((pid (process-run script))) (let processloop ((i 0)) (let-values (((pid-val exit-status exit-code)(process-wait pid #t))) (mutex-lock! run-mutex) (vector-set! exit-info 0 pid)
︙
113 114 115 116 117 118 119 ~~120~~ 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 ~~140~~ 141 142 143 144 145 146 147	113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147	- + - +	((eq? rollup-status 0) 'pass) (else 'fail))) (next-status (cond ((eq? overall-status 'pass) this-step-status) ((eq? overall-status 'warn) (if (eq? this-step-status 'fail) 'fail 'warn)) (else 'fail)))) ~~(debug:print 4 "Exit value received: " (vector-ref exit-info 2) " logpro-used: " logpro-used~~ (debug:print 4 #f "Exit value received: " (vector-ref exit-info 2) " logpro-used: " logpro-used " this-step-status: " this-step-status " overall-status: " overall-status " next-status: " next-status " rollup-status: " rollup-status) (case next-status ((warn) (set! rollup-status 2) ;; NB// test-set-status! does rdb calls under the hood (tests:test-set-status! test-id "RUNNING" "WARN" (if (eq? this-step-status 'warn) "Logpro warning found" #f) #f)) ((pass) (tests:test-set-status! test-id "RUNNING" "PASS" #f #f)) (else ;; 'fail (set! rollup-status 1) ;; force fail (tests:test-set-status! test-id "RUNNING" "FAIL" (conc "Failed at step " stepname) #f) )))) (if (and (steprun-good? logpro-used (vector-ref exit-info 2)) (not (null? tal))) (if (not run-one) ;; if we got here we completed the step, if run-one is true, stop (loop (car tal) (cdr tal) stepname runflag)))) ~~(debug:print 4 "WARNING: a prior step failed, stopping at " ezstep)))~~ (debug:print 4 #f "WARNING: a prior step failed, stopping at " ezstep))) ;; Once done with step/steps update the test record ;; (let* ((item-path (db:test-get-item-path testdat)) ;; (item-list->path itemdat)) (testinfo (rmt:get-testinfo-by-id run-id test-id))) ;; refresh the testdat, call it iteminfo in case need prev/curr ;; Am I completed? (if (equal? (db:test-get-state testinfo) "RUNNING") ;; (not (equal? (db:test-get-state testinfo) "COMPLETED"))
︙
155 156 157 158 159 160 161 ~~162~~ 163 164 165 166 167 168 169 170 171 172 173 174 175	155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175	- +	;; if the current status is AUTO the defer to the calculated value (i.e. leave this AUTO) (if (equal? (db:test-get-status testinfo) "AUTO") "AUTO" "PASS")) ((eq? rollup-status 1) "FAIL") ((eq? rollup-status 2) ;; if the current status is AUTO the defer to the calculated value but qualify (i.e. make this AUTO-WARN) (if (equal? (db:test-get-status testinfo) "AUTO") "AUTO-WARN" "WARN")) (else "FAIL")))) ;; (db:test-get-status testinfo))) ~~(debug:print-info 2 "Test NOT logged as COMPLETED, (state=" (db:test-get-state testinfo) "), updating result, rollup-status is " rollup-status)~~ (debug:print-info 2 #f "Test NOT logged as COMPLETED, (state=" (db:test-get-state testinfo) "), updating result, rollup-status is " rollup-status) (tests:test-set-status! test-id new-state new-status (args:get-arg "-m") #f) ;; need to update the top test record if PASS or FAIL and this is a subtest (if (not (equal? item-path "")) (cdb:roll-up-pass-fail-counts runremote run-id test-name item-path new-status)))) ;; for automated creation of the rollup html file this is a good place... (if (not (equal? item-path "")) (tests:summarize-items #f run-id test-id test-name #f)) ;; don't force - just update if no ))) (pop-directory) rollup-status))

︙
71 72 73 74 75 76 77 ~~78 79~~ 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 ~~96 97~~ 98 99 100 101 ~~102~~ 103 104 105 106 107 108 109	71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109	- - + + - + - - + + - +	(define (lock-queue:set-state dbdat test-id newstate #!key (remtries 10)) (tasks:wait-on-journal (lock-queue:db-dat-get-path dbdat) 1200) (handle-exceptions exn (if (> remtries 0) (begin ~~(debug:print 0 "WARNING: exception on lock-queue:set-state. Trying again in 30 seconds.") (debug:print 0 " message: " ((condition-property-accessor 'exn 'message) exn))~~ (debug:print 0 #f "WARNING: exception on lock-queue:set-state. Trying again in 30 seconds.") (debug:print 0 #f " message: " ((condition-property-accessor 'exn 'message) exn)) (thread-sleep! 30) (lock-queue:set-state dbdat test-id newstate remtries: (- remtries 1))) (begin ~~(debug:print 0 "ERROR: Failed to set lock state for test with id " test-id ", error: " ((condition-property-accessor 'exn 'message) exn) ", giving up.")~~ (debug:print 0 #f "ERROR: Failed to set lock state for test with id " test-id ", error: " ((condition-property-accessor 'exn 'message) exn) ", giving up.") #f)) (sqlite3:execute (lock-queue:db-dat-get-db dbdat) "UPDATE queue SET state=? WHERE test_id=?;" newstate test-id))) (define (lock-queue:any-younger? dbdat mystart test-id #!key (remtries 10)) ;; no need to wait on journal on read only queries ;; (tasks:wait-on-journal (lock-queue:db-dat-get-path dbdat) 1200) (handle-exceptions exn (if (> remtries 0) (begin ~~(debug:print 0 "WARNING: exception on lock-queue:any-younger. Removing lockdb and trying again in 5 seconds.") (debug:print 0 " message: " ((condition-property-accessor 'exn 'message) exn))~~ (debug:print 0 #f "WARNING: exception on lock-queue:any-younger. Removing lockdb and trying again in 5 seconds.") (debug:print 0 #f " message: " ((condition-property-accessor 'exn 'message) exn)) (thread-sleep! 5) (lock-queue:delete-lock-db dbdat) (lock-queue:any-younger? dbdat mystart test-id remtries: (- remtries 1))) (begin ~~(debug:print 0 "ERROR: Failed to find younger locks for test with id " test-id ", error: " ((condition-property-accessor 'exn 'message) exn) ", giving up.")~~ (debug:print 0 #f "ERROR: Failed to find younger locks for test with id " test-id ", error: " ((condition-property-accessor 'exn 'message) exn) ", giving up.") #f)) (let ((res #f)) (sqlite3:for-each-row (lambda (tid) ;; Actually this should not be needed as mystart cannot be simultaneously less than and test-id same as (if (not (equal? tid test-id)) (set! res tid)))
︙
117 118 119 120 121 122 123 ~~124 125~~ 126 127 128 129 130 131 132	117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132	- - + +	(db (lock-queue:db-dat-get-db dbdat)) (lckqry (sqlite3:prepare db "SELECT test_id,run_lock FROM runlocks WHERE run_lock='locked';")) (mklckqry (sqlite3:prepare db "INSERT INTO runlocks (test_id,run_lock) VALUES (?,'locked');"))) (let ((result (handle-exceptions exn (begin (debug:print 0 "WARNING: failed to get queue lock. Removing lock db and returning fail") ;; Will try again in a few seconds") (debug:print 0 " message: " ((condition-property-accessor 'exn 'message) exn)) (debug:print 0 #f "WARNING: failed to get queue lock. Removing lock db and returning fail") ;; Will try again in a few seconds") (debug:print 0 #f " message: " ((condition-property-accessor 'exn 'message) exn)) (thread-sleep! 10) ;; (if (> count 0) ;; #f ;; (lock-queue:get-lock dbdat test-id count: (- count 1)) - give up on retries ;; (begin ;; never recovered, remote the lock file and return #f, no lock obtained (lock-queue:delete-lock-db dbdat) #f) (sqlite3:with-transaction
︙
149 150 151 152 153 154 155 ~~156 157~~ 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 ~~176~~ 177 178 179 180 ~~181 182~~ 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 ~~202 203~~ 204 205 206 207 208 209 210 ~~211~~ 212 213 214 215 216 217 218	149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218	- - + + - + - - + + - - + + - +	(define (lock-queue:release-lock fname test-id #!key (count 10)) (let* ((dbdat (lock-queue:open-db fname))) (tasks:wait-on-journal (lock-queue:db-dat-get-path dbdat) 1200 "lock-queue:release-lock; waiting on journal") (handle-exceptions exn (begin ~~(debug:print 0 "WARNING: Failed to release queue lock. Will try again in few seconds") (debug:print 0 " message: " ((condition-property-accessor 'exn 'message) exn))~~ (debug:print 0 #f "WARNING: Failed to release queue lock. Will try again in few seconds") (debug:print 0 #f " message: " ((condition-property-accessor 'exn 'message) exn)) (thread-sleep! (/ count 10)) (if (> count 0) (begin (sqlite3:finalize! (lock-queue:db-dat-get-db dbdat)) (lock-queue:release-lock fname test-id count: (- count 1))) (let ((journal (conc fname "-journal"))) ;; If we've tried ten times and failed there is a serious problem ;; try to remove the lock db and allow it to be recreated (handle-exceptions exn #f (if (file-exists? journal)(delete-file journal)) (if (file-exists? fname) (delete-file fname)) #f)))) (sqlite3:execute (lock-queue:db-dat-get-db dbdat) "DELETE FROM runlocks WHERE test_id=?;" test-id) (sqlite3:finalize! (lock-queue:db-dat-get-db dbdat))))) (define (lock-queue:steal-lock dbdat test-id #!key (count 10)) ~~(debug:print-info 0 "Attempting to steal lock at " (lock-queue:db-dat-get-path dbdat))~~ (debug:print-info 0 #f "Attempting to steal lock at " (lock-queue:db-dat-get-path dbdat)) (tasks:wait-on-journal (lock-queue:db-dat-get-path dbdat) 1200 "lock-queue:steal-lock; waiting on journal") (handle-exceptions exn (begin ~~(tadebug:print 0 "WARNING: Failed to steal queue lock. Will try again in few seconds") (debug:print 0 " message: " ((condition-property-accessor 'exn 'message) exn))~~ (debug:print 0 #f "WARNING: Failed to steal queue lock. Will try again in few seconds") (debug:print 0 #f " message: " ((condition-property-accessor 'exn 'message) exn)) (thread-sleep! 10) (if (> count 0) (lock-queue:steal-lock dbdat test-id count: (- count 1)) #f)) (sqlite3:execute (lock-queue:db-dat-get-db dbdat) "DELETE FROM runlocks WHERE run_lock='locked';")) (lock-queue:get-lock dbdat test-it)) ;; returns #f if ok to skip the task ;; returns #t if ok to proceed with task ;; otherwise waits ;; (define (lock-queue:wait-turn fname test-id #!key (count 10)(waiting-msg #f)) (let* ((dbdat (lock-queue:open-db fname)) (mystart (current-seconds)) (db (lock-queue:db-dat-get-db dbdat))) ;; (tasks:wait-on-journal (lock-queue:db-dat-get-path dbdat) 1200 waiting-msg: "lock-queue:wait-turn; waiting on journal file") (handle-exceptions exn (begin ~~(debug:print 0 "WARNING: Failed to find out if it is ok to skip the wait queue. Will try again in few seconds") (debug:print 0 " message: " ((condition-property-accessor 'exn 'message) exn))~~ (debug:print 0 #f "WARNING: Failed to find out if it is ok to skip the wait queue. Will try again in few seconds") (debug:print 0 #f " message: " ((condition-property-accessor 'exn 'message) exn)) (print-call-chain (current-error-port)) (thread-sleep! 10) (if (> count 0) (begin (sqlite3:finalize! db) (lock-queue:wait-turn fname test-id count: (- count 1))) (begin ~~(debug:print 0 "Giving up calls to lock-queue:wait-turn for test-id " test-id " at path " fname ", printing call chain")~~ (debug:print 0 #f "Giving up calls to lock-queue:wait-turn for test-id " test-id " at path " fname ", printing call chain") (print-call-chain (current-error-port)) #f))) ;; wait 10 seconds and then check to see if someone is already updating the html (thread-sleep! 10) (if (not (lock-queue:any-younger? dbdat mystart test-id)) ;; no processing in flight, must try to start processing (begin (tasks:wait-on-journal (lock-queue:db-dat-get-path dbdat) 1200 waiting-msg: "lock-queue:wait-turn; waiting on journal file")
︙

︙
48 49 50 51 52 53 54 55 56 57 58 ~~59 60~~ 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 ~~110~~ 111 112 113 114 115 116 117 118 119 120 121 122 123 124 ~~125~~ 126 127 128 129 130 131 132	48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132	- + - - + + - + - + - + - +	(offset 0) (limit 500)) ;; (print "runsdat: " runsdat) (let* ((header (vector-ref runsdat 0)) (runslst (vector-ref runsdat 1)) (full-list (append res runslst)) (have-more (eq? (length runslst) limit))) ~~;; (debug:print 0 "header: " header " runslst: " runslst " have-more: " have-more)~~ ;; (debug:print 0 #f "header: " header " runslst: " runslst " have-more: " have-more) (if have-more (let ((new-offset (+ offset limit)) (next-batch (rmt:get-runs-by-patt keys runnamepatt targpatt offset limit #f))) ~~(debug:print-info 4 "More than " limit " runs, have " (length full-list) " runs so far.") (debug:print-info 0 "next-batch: " next-batch)~~ (debug:print-info 4 #f "More than " limit " runs, have " (length full-list) " runs so far.") (debug:print-info 0 #f "next-batch: " next-batch) (loop next-batch full-list new-offset limit)) (vector header full-list))))) ;;====================================================================== ;; T E S T S ;;====================================================================== (define (mt:get-tests-for-run run-id testpatt states status #!key (not-in #t) (sort-by 'event_time) (sort-order "ASC") (qryvals #f)(last-update #f)) (let loop ((testsdat (rmt:get-tests-for-run run-id testpatt states status 0 500 not-in sort-by sort-order qryvals last-update 'normal)) (res '()) (offset 0) (limit 500)) (let* ((full-list (append res testsdat)) (have-more (eq? (length testsdat) limit))) (if have-more (let ((new-offset (+ offset limit))) ~~(debug:print-info 4 "More than " limit " tests, have " (length full-list) " tests so far.")~~ (debug:print-info 4 #f "More than " limit " tests, have " (length full-list) " tests so far.") (loop (rmt:get-tests-for-run run-id testpatt states status new-offset limit not-in sort-by sort-order qryvals last-update 'normal) full-list new-offset limit)) full-list)))) (define (mt:lazy-get-prereqs-not-met run-id waitons ref-item-path #!key (mode '(normal))(itemmaps #f) ) (let* ((key (list run-id waitons ref-item-path mode)) (res (hash-table-ref/default pre-reqs-met-cache key #f)) (useres (let ((last-time (if (vector? res) (vector-ref res 0) #f))) (if last-time (< (current-seconds)(+ last-time 5)) #f)))) (if useres (let ((result (vector-ref res 1))) ~~(debug:print 4 "Using lazy value res: " result)~~ (debug:print 4 #f "Using lazy value res: " result) result) (let ((newres (rmt:get-prereqs-not-met run-id waitons ref-item-path mode: mode itemmaps: itemmaps))) (hash-table-set! pre-reqs-met-cache key (vector (current-seconds) newres)) newres)))) (define (mt:get-run-stats dbstruct run-id) ;; Get run stats from local access, move this ... but where? (db:get-run-stats dbstruct run-id)) (define (mt:discard-blocked-tests run-id failed-test tests test-records) (if (null? tests) tests (begin ~~(debug:print-info 1 "Discarding tests from " tests " that are waiting on " failed-test)~~ (debug:print-info 1 #f "Discarding tests from " tests " that are waiting on " failed-test) (let loop ((testn (car tests)) (remt (cdr tests)) (res '())) (let* ((test-dat (hash-table-ref/default test-records testn (vector #f #f '()))) (waitons (vector-ref test-dat 2))) ;; (print "mt:discard-blocked-tests run-id: " run-id " failed-test: " failed-test " testn: " testn " with waitons: " waitons) (if (null? remt) (let ((new-res (reverse res))) ;; (print " new-res: " new-res) new-res) (loop (car remt) (cdr remt) (if (member failed-test waitons) (begin ~~(debug:print 0 "Discarding test " testn "(" test-dat ") due to " failed-test)~~ (debug:print 0 #f "Discarding test " testn "(" test-dat ") due to " failed-test) res) (cons testn res))))))))) ;;====================================================================== ;; T R I G G E R S ;;======================================================================
︙
154 155 156 157 158 159 160 ~~161~~ 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 ~~178~~ 179 180 181 182 183 184 185	154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185	- + - +	(let ((cmd (configf:lookup tconfig "triggers" trigger)) (logf (conc test-rundir "/last-trigger.log"))) (if cmd ;; Putting the commandline into ( )'s means no control over the shell. ;; stdout and stderr will be caught in the NBFAKE or mt_launch.log files ;; or equivalent. No need to do this. Just run it? (let ((fullcmd (conc cmd " " test-id " " test-rundir " " trigger "&"))) ~~(debug:print-info 0 "TRIGGERED on " trigger ", running command " fullcmd)~~ (debug:print-info 0 #f "TRIGGERED on " trigger ", running command " fullcmd) (process-run fullcmd))))) (list (conc state "/" status) (conc state "/") (conc "/" status))) (pop-directory)) )))))) ;;====================================================================== ;; S T A T E A N D S T A T U S F O R T E S T S ;;====================================================================== ;; speed up for common cases with a little logic (define (mt:test-set-state-status-by-id run-id test-id newstate newstatus newcomment) (if (not (and run-id test-id)) (begin ~~(debug:print 0 "ERROR: bad data handed to mt:test-set-state-status-by-id, run-id=" run-id ", test-id=" test-id ", newstate=" newstate)~~ (debug:print 0 #f "ERROR: bad data handed to mt:test-set-state-status-by-id, run-id=" run-id ", test-id=" test-id ", newstate=" newstate) (print-call-chain (current-error-port)) #f) (begin (cond ((and newstate newstatus newcomment) (rmt:general-call 'state-status-msg run-id newstate newstatus newcomment test-id)) ((and newstate newstatus)
︙
213 214 215 216 217 218 219 ~~220~~ 221 222 223	213 214 215 216 217 218 219 220 221 222 223	- +	(hash-table-set! testconfigs test-name newtcfg) (if old-link-tree (setenv "MT_LINKTREE" old-link-tree) (unsetenv "MT_LINKTREE")) newtcfg)) (if (null? tal) (begin ~~(debug:print 0 "ERROR: No readable testconfig found for " test-name)~~ (debug:print 0 #f "ERROR: No readable testconfig found for " test-name) #f) (loop (car tal)(cdr tal))))))))))

︙
210 211 212 213 214 215 216 ~~217~~ 218 219 220 221 222 223 224	210 211 212 213 214 215 216 217 218 219 220 221 222 223 224	- +	((-1) "monitor.db") ((0) "main.db") (else (conc run-id ".db"))) #f))) (handle-exceptions exn (begin ~~(debug:print 0 "ERROR: Couldn't create path to " dbdir)~~ (debug:print 0 #f "ERROR: Couldn't create path to " dbdir) (exit 1)) (if (not (directory? dbdir))(create-directory dbdir #t))) (if fname (conc dbdir "/" fname) dbdir))) ;; -1 => monitor.db
︙
238 239 240 241 242 243 244 ~~245~~ 246 247 248 249 250 251 252	238 239 240 241 242 243 244 245 246 247 248 249 250 251 252	- +	#f))))) (if db db ;; merely return the already opened db (let* ((dbfile (areadb:dbfile-path areadat run-id)) ;; not already opened, so open it (db (if (file-exists? dbfile) (open-database dbfile) (begin ~~(debug:print 0 "ERROR: I was asked to open " dbfile ", but file does not exist or is not readable.")~~ (debug:print 0 #f "ERROR: I was asked to open " dbfile ", but file does not exist or is not readable.") #f)))) (case run-id ((-1)(areadat-monitordb-set! areadat db)) ((0) (areadat-maindb-set! areadat db)) (else (rundat-db-set! rundat db))) db))))
︙
261 262 263 264 265 266 267 ~~268~~ 269 270 271 272 273 274 275	261 262 263 264 265 266 267 268 269 270 271 272 273 274 275	- +	(let ((id (list-ref row 0)) (dat (apply make-rundat (append row (list #f #f))))) ;; add placeholders for tests and db (print row) (hash-table-set! runs id dat)))) (sql maindb (conc "SELECT id," (string-intersperse keys "\|\|'/'\|\|") ",runname,state,status,event_time FROM runs WHERE state != 'deleted';"))) ~~(debug:print 0 "ERROR: no main.db found at " (areadb:dbfile-path areadat 0)))~~ (debug:print 0 #f "ERROR: no main.db found at " (areadb:dbfile-path areadat 0))) areadat)) ;; given an areadat and target/runname patt fill up runs data ;; ;; ?????/ ;; given a list of run-ids refresh/retrieve runs data into areadat
︙
321 322 323 324 325 326 327 ~~328~~ 329 330 331 ~~332~~ 333 334 335 336 337 338 339	321 322 323 324 325 326 327 328 329 330 331 332 333 334 335 336 337 338 339	- + - +	"Areas" (string-intersperse (tree:node->path current-tree current-node) "/"))) (current-matrix (if (null? tab-ids) #f (tab-matrix current-tab))) (seen-nodes (make-hash-table)) (path-changed (if current-tab (equal? current-path (tab-view-path current-tab)) #t))) ~~;; (debug:print-info 0 "Current path: " current-path)~~ ;; (debug:print-info 0 #f "Current path: " current-path) ;; now for each area in the window gather the data (if path-changed (begin ~~(debug:print-info 0 "clearing matrix - path changed")~~ (debug:print-info 0 #f "clearing matrix - path changed") (dboard:clear-matrix current-tab))) (for-each (lambda (area-name) ;; (print "Processing for area-name " area-name) (let* ((area-dat (hash-table-ref areas area-name)) (area-path (areadat-path area-dat)) (runs (areadat-runs area-dat)))
︙
483 484 485 486 487 488 489 ~~490~~ 491 492 493 494 495 496 497 498 499 500 501 502 503 504 505 506 507 ~~508~~ 509 510 511 512 513 514 515	483 484 485 486 487 488 489 490 491 492 493 494 495 496 497 498 499 500 501 502 503 504 505 506 507 508 509 510 511 512 513 514 515	- + - +	(rows (tab-rows tab-dat)) (used-cols (hash-table-values headers)) (used-rows (hash-table-values rows)) (touched (make-hash-table)) ;; (vector row col) ==> true, touched cell (view-type (dboard:get-view-type keys current-path)) (changed #f) (state-statuses (list "PASS" "FAIL" "WARN" "CHECK" "SKIP" "RUNNING" "LAUNCHED"))) ~~;; (debug:print 0 "current-matrix=" current-matrix)~~ ;; (debug:print 0 #f "current-matrix=" current-matrix) (case view-type ((areas) ;; find row for this area, if not found, create new entry (let* ((curr-rownum (hash-table-ref/default rows area-name #f)) (next-rownum (+ (apply max (cons 0 used-rows)) 1)) (rownum (or curr-rownum next-rownum)) (coord (conc rownum ":0"))) (if (not curr-rownum)(hash-table-set! rows area-name rownum)) (if (not (equal? (iup:attribute current-matrix coord) area-name)) (begin (let loop ((hed (car state-statuses)) (tal (cdr state-statuses)) (count 1)) (if (not (equal? (iup:attribute current-matrix (conc "0:" count)) hed)) (iup:attribute-set! current-matrix (conc "0:" count) hed)) (iup:attribute-set! current-matrix (conc rownum ":" count) "0") (if (not (null? tal)) (loop (car tal)(cdr tal)(+ count 1)))) ~~(debug:print-info 0 "view-type=" view-type ", rownum=" rownum ", curr-rownum=" curr-rownum ", next-rownum=" next-rownum ", coord=" coord ", area-name=" area-name)~~ (debug:print-info 0 #f "view-type=" view-type ", rownum=" rownum ", curr-rownum=" curr-rownum ", next-rownum=" next-rownum ", coord=" coord ", area-name=" area-name) (iup:attribute-set! current-matrix coord area-name) (set! changed #t)))))) (if changed (iup:attribute-set! current-matrix "REDRAW" "ALL")))) ;; (dboard:clear-matrix current-matrix used-cols used-rows touched) ;; clear all
︙
571 572 573 574 575 576 577 ~~578~~ 579 580 581 582 583 584 585	571 572 573 574 575 576 577 578 579 580 581 582 583 584 585	- +	area-panels)) (tabs (data-tabs data))) (if (not (null? area-names)) (let loop ((index 0) (hed (car area-names)) (tal (cdr area-names))) ;; (hash-table-set! tabs index hed) ~~(debug:print 0 "Adding area " hed " with index " index " to dashboard")~~ (debug:print 0 #f "Adding area " hed " with index " index " to dashboard") (iup:attribute-set! tabtop (conc "TABTITLE" index) hed) (if (not (null? tal)) (loop (+ index 1)(car tal)(cdr tal))))) tabtop)))) ;;======================================================================
︙
728 729 730 731 732 733 734 ~~735~~ 736 737 738 739 740 741 742 743 744 ~~745~~ 746 747 748 749 750 751 752	728 729 730 731 732 733 734 735 736 737 738 739 740 741 742 743 744 745 746 747 748 749 750 751 752	- + - +	(file-name (pathname-strip-directory fname)) (curr-mtcfgdat (find-config "megatest.config" toppath: (or (get-environment-variable "MT_RUN_AREA_HOME")(current-directory)))) (curr-mtcfg (if (and curr-mtcfgdat (not (null? curr-mtcfgdat)))(cadr curr-mtcfgdat) #f)) (curr-mtpath (if curr-mtcfg (car curr-mtcfgdat) #f))) (if curr-mtpath (begin ~~(debug:print-info 0 "Creating config file " fname)~~ (debug:print-info 0 #f "Creating config file " fname) (if (not (file-exists? dirname)) (create-directory dirname #t)) (with-output-to-file fname (lambda () (let ((aname (pathname-strip-directory curr-mtpath))) (print "[" aname "]") (print "path " curr-mtpath)))) #t) (begin ~~(debug:print-info 0 "Need to create a config but no megatest.config found: " curr-mtcfgdat)~~ (debug:print-info 0 #f "Need to create a config but no megatest.config found: " curr-mtcfgdat) #f)))) ;; ) (define (dboard:read-mtconf apath) (let* ((mtconffile (conc apath "/megatest.config"))) (call-with-environment-variables (list (cons "MT_RUN_AREA_HOME" apath))
︙

︙
60 61 62 63 64 65 66 67 68 69 70 71 72 73 74	60 61 62 63 64 65 66 67 68 69 70 71 72 73 74	- +	(define heartbeat-mutex (make-mutex)) ;;====================================================================== ;; S E R V E R ;;====================================================================== (define (nmsg-transport:run dbstruct hostn run-id server-id #!key (retrynum 1000)) ~~(debug:print 2 "Attempting to start the server ...")~~ (debug:print 2 #f "Attempting to start the server ...") (let* ((start-port (portlogger:open-run-close portlogger:find-port)) (server-thread (make-thread (lambda () (nmsg-transport:try-start-server dbstruct run-id start-port server-id)) "server thread")) (tdbdat (tasks:open-db))) (thread-start! server-thread) (thread-sleep! 0.1)
︙
82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 ~~102~~ 103 ~~104~~ 105 106 107 108 109 110 111	82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111	- + - + - + - +	(tasks:server-set-state! (db:delay-if-busy tdbdat) server-id "running") (thread-start! (make-thread (lambda ()(nmsg-transport:keep-running server-id run-id)) "keep running")) (thread-join! server-thread)) (if (> retrynum 0) (begin ~~(debug:print 0 "WARNING: Failed to connect to server (self) on host " hostn ":" start-port ", trying again.")~~ (debug:print 0 #f "WARNING: Failed to connect to server (self) on host " hostn ":" start-port ", trying again.") (tasks:server-delete-record (db:delay-if-busy tdbdat) server-id "failed to start, never received server alive signature") (portlogger:open-run-close portlogger:set-failed start-port) (nmsg-transport:run dbstruct hostn run-id server-id)) (begin ~~(debug:print 0 "ERROR: could not find an open port to start server on. Giving up")~~ (debug:print 0 #f "ERROR: could not find an open port to start server on. Giving up") (exit 1)))))) (define (nmsg-transport:try-start-server dbstruct run-id portnum server-id) (let ((repsoc (nn-socket 'rep))) (nn-bind repsoc (conc "tcp://:" portnum)) (let loop ((msg-in (nn-recv repsoc))) (let ((dat (db:string->obj msg-in transport: 'nmsg))) ~~(debug:print 0 "server, received: " dat)~~ (debug:print 0 #f "server, received: " dat) (let ((result (api:execute-requests dbstruct dat))) ~~(debug:print 0 "server, sending: " result)~~ (debug:print 0 #f "server, sending: " result) (nn-send repsoc (db:obj->string result transport: 'nmsg))) (loop (nn-recv repsoc)))))) ;; all routes though here end in exit ... ;; (define (nmsg-transport:launch run-id) (let* ((tdbdat (tasks:open-db))
︙
120 121 122 123 124 125 126 ~~127~~ 128 129 130 131 132 133 134 135 136 137 138 ~~139~~ 140 141 142 ~~143~~ 144 145 146 147 148 149 150	120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150	- + - + - +	;; (daemon:ize) ;; (if alt-log-file ;; we should re-connect to this port, I think daemon:ize disrupts it ;; (begin ;; (current-error-port alt-log-file) ;; (current-output-port alt-log-file))))) (if (server:check-if-running run-id) (begin ~~(debug:print-info 0 "Server for run-id " run-id " already running")~~ (debug:print-info 0 #f "Server for run-id " run-id " already running") (exit 0))) (let loop ((server-id (tasks:server-lock-slot (db:delay-if-busy tdbdat) run-id)) (remtries 4)) (if (not server-id) (if (> remtries 0) (begin (thread-sleep! 2) (if (not (server:check-if-running run-id)) (loop (tasks:server-lock-slot (db:delay-if-busy tdbdat) run-id) (- remtries 1)) (begin ~~(debug:print-info 0 "Another server took the slot, exiting")~~ (debug:print-info 0 #f "Another server took the slot, exiting") (exit 0)))) (begin ;; since we didn't get the server lock we are going to clean up and bail out ~~(debug:print-info 2 "INFO: server pid=" (current-process-id) ", hostname=" (get-host-name) " not starting due to other candidates ahead in start queue")~~ (debug:print-info 2 #f "INFO: server pid=" (current-process-id) ", hostname=" (get-host-name) " not starting due to other candidates ahead in start queue") (tasks:server-delete-records-for-this-pid (db:delay-if-busy tdbdat) " http-transport:launch") )) ;; locked in a server id, try to start up (nmsg-transport:run dbstruct hostn run-id server-id)) (set! didsomething #t) (exit))))
︙
182 183 184 185 186 187 188 ~~189~~ 190 191 192 193 194 195 196	182 183 184 185 186 187 188 189 190 191 192 193 194 195 196	- +	(dat (vector "ping" our-key)) (result (condition-case (nmsg-transport:client-api-send-receive-raw req dat timeout: timeout) ((timeout)(set! success #f) #f))) (key (if success (vector-ref result 1) #f))) ~~(debug:print 0 "success=" success ", key=" key ", expected-key=" expected-key ", equal? " (equal? key expected-key))~~ (debug:print 0 #f "success=" success ", key=" key ", expected-key=" expected-key ", equal? " (equal? key expected-key)) (if (and success (or (not expected-key) ;; just getting a reply is good enough then (equal? key expected-key))) (if return-socket req (begin (if (not socket)(nn-close req)) ;; don't want a side effect of closing socket if handed it
︙
216 217 218 219 220 221 222 ~~223~~ 224 225 226 227 228 229 230	216 217 218 219 220 221 222 223 224 225 226 227 228 229 230	- +	(set! success #t) (set! result (db:string->obj res transport: 'nmsg)))) "send-recv")) (timeout (make-thread (lambda () (let loop ((count 0)) (thread-sleep! 1) ~~(debug:print-info 1 "send-receive-raw, still waiting after " count " seconds...")~~ (debug:print-info 1 #f "send-receive-raw, still waiting after " count " seconds...") (if (and keepwaiting (< count timeout)) ;; yes, this is very aproximate (loop (+ count 1)))) (if keepwaiting (begin (print "timeout waiting for ping") (thread-terminate! send-recv)))) "timeout")))
︙
238 239 240 241 242 243 244 ~~245 246~~ 247 ~~248~~ 249 250 251 252 253 254 255 256 257 258 259 260 261 262 263 264 265 266 267 ~~268~~ 269 270 271 272 273 274 275	238 239 240 241 242 243 244 245 246 247 248 249 250 251 252 253 254 255 256 257 258 259 260 261 262 263 264 265 266 267 268 269 270 271 272 273 274 275	- - + + - + - +	(if success (thread-terminate! timeout))) ;; raise timeout error if timed out (if success (if (and (vector? result) (vector-ref result 0)) ;; did it fail at the server? result ;; nope, all good (begin ~~(debug:print 0 "ERROR: error occured at server, info=" (vector-ref result 2)) (debug:print 0 " client call chain:")~~ (debug:print 0 #f "ERROR: error occured at server, info=" (vector-ref result 2)) (debug:print 0 #f " client call chain:") (print-call-chain (current-error-port)) ~~(debug:print 0 " server call chain:")~~ (debug:print 0 #f " server call chain:") (pp (vector-ref result 1) (current-error-port)) (signal (vector-ref result 0)))) (signal (make-composite-condition (make-property-condition 'timeout 'message "nmsg-transport:client-api-send-receive-raw timed out talking to server")))))) ;; run nmsg-transport:keep-running in a parallel thread to monitor that the db is being ;; used and to shutdown after sometime if it is not. ;; (define (nmsg-transport:keep-running server-id run-id) ;; if none running or if > 20 seconds since ;; server last used then start shutdown ;; This thread waits for the server to come alive (let* ((server-info (let loop () (let ((sdat #f)) (mutex-lock! heartbeat-mutex) (set! sdat server-info) (mutex-unlock! heartbeat-mutex) (if sdat (begin ~~(debug:print-info 0 "keep-running got sdat=" sdat)~~ (debug:print-info 0 #f "keep-running got sdat=" sdat) sdat) (begin (thread-sleep! 0.5) (loop)))))) (iface (car server-info)) (port (cadr server-info)) (last-access 0)
︙
295 296 297 298 299 300 301 ~~302~~ 303 304 ~~305~~ 306 307 308 ~~309~~ 310 311 312 313 314 315 316	295 296 297 298 299 300 301 302 303 304 305 306 307 308 309 310 311 312 313 314 315 316	- + - + - +	(set! last-access last-db-access) (mutex-unlock! heartbeat-mutex) (db:sync-touched inmemdb run-id force-sync: #t) (if (and server-run (> (+ last-access server-timeout) (current-seconds))) (begin ~~(debug:print-info 0 "Server continuing, seconds since last db access: " (- (current-seconds) last-access))~~ (debug:print-info 0 #f "Server continuing, seconds since last db access: " (- (current-seconds) last-access)) (loop 0)) (begin ~~(debug:print-info 0 "Starting to shutdown the server.")~~ (debug:print-info 0 #f "Starting to shutdown the server.") (set! time-to-exit #t) (db:sync-touched inmemdb run-id force-sync: #t) (tasks:server-delete-record (db:delay-if-busy tdbdat) server-id " http-transport:keep-running") ~~(debug:print-info 0 "Server shutdown complete. Exiting")~~ (debug:print-info 0 #f "Server shutdown complete. Exiting") (exit) )))))) ;;====================================================================== ;; C L I E N T S ;;======================================================================
︙
337 338 339 340 341 342 343 ~~344~~ 345 346 347 348 349 ~~350~~ 351 ~~352~~ 353 354 355 356 357 358	337 338 339 340 341 342 343 344 345 346 347 348 349 350 351 352 353 354 355 356 357 358	- + - + - +	;;====================================================================== ;; DO NOT USE ;; (define (nmsg-transport:client-signal-handler signum) (handle-exceptions exn ~~(debug:print " ... exiting ...")~~ (debug:print 0 #f " ... exiting ...") (let ((th1 (make-thread (lambda () (if (not received-response) (receive-message* runremote))) ;; flush out last call if applicable "eat response")) (th2 (make-thread (lambda () ~~(debug:print 0 "ERROR: Received ^C, attempting clean exit. Please be patient and wait a few seconds before hitting ^C again.")~~ (debug:print 0 #f "ERROR: Received ^C, attempting clean exit. Please be patient and wait a few seconds before hitting ^C again.") (thread-sleep! 3) ;; give the flush three seconds to do it's stuff ~~(debug:print 0 " Done.")~~ (debug:print 0 #f " Done.") (exit 4)) "exit on ^C timer"))) (thread-start! th2) (thread-start! th1) (thread-join! th2))))

︙
52 53 54 55 56 57 58 ~~59 60 61~~ 62 63 64 65 66 67 68	52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68	- - - + + +	(define (portlogger:open-run-close proc . params) (let* ((fname (conc "/tmp/." (current-user-name) "-portlogger.db")) (avail (tasks:wait-on-journal fname 10))) ;; wait up to about 10 seconds for the journal to go away (handle-exceptions exn (begin ;; (release-dot-lock fname) (debug:print 0 "ERROR: portlogger:open-run-close failed. " proc " " params) (debug:print 0 " message: " ((condition-property-accessor 'exn 'message) exn)) (debug:print 0 "exn=" (condition->list exn)) (debug:print 0 #f "ERROR: portlogger:open-run-close failed. " proc " " params) (debug:print 0 #f " message: " ((condition-property-accessor 'exn 'message) exn)) (debug:print 0 #f "exn=" (condition->list exn)) (if (file-exists? fname)(delete-file fname)) ;; brutally get rid of it (print-call-chain (current-error-port))) (let* (;; (lock (obtain-dot-lock fname 2 9 10)) (db (portlogger:open-db fname)) (res (apply proc db params))) (sqlite3:finalize! db) ;; (release-dot-lock fname)
︙
99 100 101 102 103 104 105 ~~106 107 108~~ 109 ~~110~~ 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 ~~131 132 133~~ 134 ~~135~~ 136 137 138 139 140 141 142	99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142	- - - + + + - + - - - + + + - +	(sqlite3:finalize! qry3) res)) (define (portlogger:get-prev-used-port db) (handle-exceptions exn (begin (debug:print 0 "EXCEPTION: portlogger database probably overloaded or unreadable. If you see this message again remove /tmp/.$USER-portlogger.db") (debug:print 0 " message: " ((condition-property-accessor 'exn 'message) exn)) (debug:print 0 "exn=" (condition->list exn)) (debug:print 0 #f "EXCEPTION: portlogger database probably overloaded or unreadable. If you see this message again remove /tmp/.$USER-portlogger.db") (debug:print 0 #f " message: " ((condition-property-accessor 'exn 'message) exn)) (debug:print 0 #f "exn=" (condition->list exn)) (print-call-chain (current-error-port)) ~~(debug:print 0 "Continuing anyway.")~~ (debug:print 0 #f "Continuing anyway.") #f) (sqlite3:fold-row (lambda (var curr) (or curr var curr)) #f db "SELECT (port) FROM ports WHERE state='released' LIMIT 1;"))) (define (portlogger:find-port db) (let* ((lowport (let ((val (configf:lookup configdat "server" "lowport"))) (if (and val (string->number val)) (string->number val) 32768))) (portnum (or (portlogger:get-prev-used-port db) (+ lowport ;; top of registered ports is 49152 but lets use ports in the registered range (random (- 64000 lowport)))))) (handle-exceptions exn (begin (debug:print 0 "EXCEPTION: portlogger database probably overloaded or unreadable. If you see this message again remove /tmp/.$USER-portlogger.db") (debug:print 0 " message: " ((condition-property-accessor 'exn 'message) exn)) (debug:print 0 "exn=" (condition->list exn)) (debug:print 0 #f "EXCEPTION: portlogger database probably overloaded or unreadable. If you see this message again remove /tmp/.$USER-portlogger.db") (debug:print 0 #f " message: " ((condition-property-accessor 'exn 'message) exn)) (debug:print 0 #f "exn=" (condition->list exn)) (print-call-chain (current-error-port)) ~~(debug:print 0 "Continuing anyway."))~~ (debug:print 0 #f "Continuing anyway.")) (portlogger:take-port db portnum)) portnum)) ;; set port to "released", "failed" etc. ;; (define (portlogger:set-port db portnum value) (sqlite3:execute db "UPDATE ports SET state=?,update_time=strftime('%s','now') WHERE port=?;" value portnum))
︙
154 155 156 157 158 159 160 ~~161 162~~ 163 ~~164~~ 165 166 167 168 169 170 171	154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171	- - + + - +	(let* ((dbfname (conc "/tmp/." (current-user-name) "-portlogger.db")) (db (portlogger:open-db dbfname)) (numargs (length args)) (result (handle-exceptions exn (begin ~~(debug:print 0 "EXCEPTION: portlogger database at " dbfname " probably overloaded or unreadable. Try removing it.") (debug:print 0 " message: " ((condition-property-accessor 'exn 'message) exn))~~ (debug:print 0 #f "EXCEPTION: portlogger database at " dbfname " probably overloaded or unreadable. Try removing it.") (debug:print 0 #f " message: " ((condition-property-accessor 'exn 'message) exn)) (print "exn=" (condition->list exn)) ~~(debug:print 0 " status: " ((condition-property-accessor 'sqlite3 'status) exn))~~ (debug:print 0 #f " status: " ((condition-property-accessor 'sqlite3 'status) exn)) (print-call-chain (current-error-port)) #f) (case (string->symbol (car args)) ;; commands with two or more params ((take)(portlogger:take-port db (string->number (cadr args)))) ((find)(portlogger:find-port db)) ((set) (let ((port (cadr args)) (state (caddr args)))
︙

︙
50 51 52 53 54 55 56 57 58 59 60 61 62 63 64	50 51 52 53 54 55 56 57 58 59 60 61 62 63 64	- +	(define (process:cmd-run-proc-each-line cmd proc . params) ;; (print "Called with cmd=" cmd ", proc=" proc ", params=" params) (handle-exceptions exn (begin (print "ERROR: Failed to run command: " cmd " " (string-intersperse params " ")) ~~(debug:print 0 " message: " ((condition-property-accessor 'exn 'message) exn))~~ (debug:print 0 #f " message: " ((condition-property-accessor 'exn 'message) exn)) (print "exn=" (condition->list exn)) #f) (let-values (((fh fho pid) (if (null? params) (process cmd) (process cmd params)))) (let loop ((curr (read-line fh)) (result '()))
︙
102 103 104 105 106 107 108 ~~109~~ 110 111 112 113 114 115 116	102 103 104 105 106 107 108 109 110 111 112 113 114 115 116	- +	(append result (list curr))) result)))) ;; here is an example line where the shell is sh or bash ;; "find / -print 2&>1 > findall.log" (define (run-n-wait cmdline #!key (params #f)(print-cmd #f)) (if print-cmd ~~(debug:print 0~~ (debug:print 0 #f (if (string? print-cmd) print-cmd "") cmdline (if params (string-intersperse params " ") "")))
︙

︙
51 52 53 54 55 56 57 58 59 60 61 62 63 64 65	51 52 53 54 55 56 57 58 59 60 61 62 63 64 65	- +	(start (vector-ref record 0)) (queries-per-second (/ (* count 1.0) (max (- (current-seconds) start) 1)))) (vector-set! record 1 count) (if (and (> count 10) (> queries-per-second 10)) (begin ~~(debug:print-info 1 "db write rate too high, starting a server, count=" count " start=" start " run-id=" run-id " queries-per-second=" queries-per-second)~~ (debug:print-info 1 #f "db write rate too high, starting a server, count=" count " start=" start " run-id=" run-id " queries-per-second=" queries-per-second) #t) #f)))) ;; if a server is either running or in the process of starting call client:setup ;; else return #f to let the calling proc know that there is no server available ;; (define (rmt:get-connection-info run-id)
︙
79 80 81 82 83 84 85 86 87 88 89 90 91 92 93	79 80 81 82 83 84 85 86 87 88 89 90 91 92 93	- +	(let ((expire-time (- (current-seconds) (server:get-timeout) 10))) ;; don't forget the 10 second margin (for-each (lambda (run-id) (let ((connection (hash-table-ref/default runremote run-id #f))) (if (and (vector? connection) (< (http-transport:server-dat-get-last-access connection) expire-time)) (begin ~~(debug:print-info 0 "Discarding connection to server for run-id " run-id ", too long between accesses")~~ (debug:print-info 0 #f "Discarding connection to server for run-id " run-id ", too long between accesses") ;; SHOULD CLOSE THE CONNECTION HERE (case transport-type ((nmsg)(nn-close (http-transport:server-dat-get-socket (hash-table-ref runremote run-id))))) (hash-table-delete! runremote run-id))))) (hash-table-keys runremote))) ;; (mutex-unlock! db-multi-sync-mutex)
︙
112 113 114 115 116 117 118 ~~119~~ 120 121 122 123 124 125 126	112 113 114 115 116 117 118 119 120 121 122 123 124 125 126	- +	(if success (begin ;; (mutex-unlock! send-receive-mutex) (case transport-type ((http) res) ;; (db:string->obj res)) ((nmsg) res))) ;; (vector-ref res 1))) (begin ;; let ((new-connection-info (client:setup run-id))) ~~(debug:print 0 "WARNING: Communication failed, trying call to rmt:send-receive again.")~~ (debug:print 0 #f "WARNING: Communication failed, trying call to rmt:send-receive again.") ;; (case transport-type ;; ((nmsg)(nn-close (http-transport:server-dat-get-socket connection-info)))) (hash-table-delete! runremote run-id) ;; don't keep using the same connection ;; NOTE: killing server causes this process to block forever. No idea why. Dec 2. ;; (if (eq? (modulo attemptnum 5) 0) ;; (tasks:kill-server-run-id run-id tag: "api-send-receive-failed")) ;; (mutex-unlock! send-receive-mutex) ;; close the mutex here to allow other threads access to communications
︙
151 152 153 154 155 156 157 ~~158~~ 159 160 161 162 163 ~~164~~ 165 166 167 168 169 170 171 172 173 174 ~~175 176~~ 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 ~~192 193~~ 194 195 ~~196~~ 197 198 199 200 201 202 203	151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203	- + - + - - + + - - + + - +	(let ((start-time (current-milliseconds)) (max-query (string->number (or (configf:lookup configdat "server" "server-query-threshold") "300"))) (newres (rmt:open-qry-close-locally cmd run-id params))) (let ((delta (- (current-milliseconds) start-time))) (if (> delta max-query) (begin ~~(debug:print-info 0 "Starting server as query time " delta " is over the limit of " max-query)~~ (debug:print-info 0 #f "Starting server as query time " delta " is over the limit of " max-query) (server:kind-run run-id))) ;; return the result! newres) ))) (begin ~~;; (debug:print 0 "ERROR: Communication failed!")~~ ;; (debug:print 0 #f "ERROR: Communication failed!") ;; (mutex-unlock! send-receive-mutex) ;; (exit) (rmt:open-qry-close-locally cmd run-id params) ))))) (define (rmt:update-db-stats run-id rawcmd params duration) (mutex-lock! db-stats-mutex) (handle-exceptions exn (begin ~~(debug:print 0 "WARNING: stats collection failed in update-db-stats") (debug:print 0 " message: " ((condition-property-accessor 'exn 'message) exn))~~ (debug:print 0 #f "WARNING: stats collection failed in update-db-stats") (debug:print 0 #f " message: " ((condition-property-accessor 'exn 'message) exn)) (print "exn=" (condition->list exn)) #f) ;; if this fails we don't care, it is just stats (let* ((cmd (conc "run-id=" run-id " " (if (eq? rawcmd 'general-call) (car params) rawcmd))) (stat-vec (hash-table-ref/default db-stats cmd #f))) (if (not (vector? stat-vec)) (let ((newvec (vector 0 0))) (hash-table-set! db-stats cmd newvec) (set! stat-vec newvec))) (vector-set! stat-vec 0 (+ (vector-ref stat-vec 0) 1)) (vector-set! stat-vec 1 (+ (vector-ref stat-vec 1) duration)))) (mutex-unlock! db-stats-mutex)) (define (rmt:print-db-stats) (let ((fmtstr "~40a~7-d~9-d~20,2-f")) ;; "~20,2-f" ~~(debug:print 18 "DB Stats\n========") (debug:print 18 (format #f "~40a~8a~10a~10a" "Cmd" "Count" "TotTime" "Avg"))~~ (debug:print 18 #f "DB Stats\n========") (debug:print 18 #f (format #f "~40a~8a~10a~10a" "Cmd" "Count" "TotTime" "Avg")) (for-each (lambda (cmd) (let ((cmd-dat (hash-table-ref db-stats cmd))) ~~(debug:print 18 (format #f fmtstr cmd (vector-ref cmd-dat 0) (vector-ref cmd-dat 1) (/ (vector-ref cmd-dat 1)(vector-ref cmd-dat 0))))))~~ (debug:print 18 #f (format #f fmtstr cmd (vector-ref cmd-dat 0) (vector-ref cmd-dat 1) (/ (vector-ref cmd-dat 1)(vector-ref cmd-dat 0)))))) (sort (hash-table-keys db-stats) (lambda (a b) (> (vector-ref (hash-table-ref db-stats a) 0) (vector-ref (hash-table-ref db-stats b) 0))))))) (define (rmt:get-max-query-average run-id) (mutex-lock! db-stats-mutex)
︙
237 238 239 240 241 242 243 ~~244~~ 245 246 247 ~~248~~ 249 250 251 252 253 254 255	237 238 239 240 241 242 243 244 245 246 247 248 249 250 251 252 253 254 255	- + - +	(resdat (api:execute-requests dbstruct-local (vector (symbol->string cmd) params))) (success (vector-ref resdat 0)) (res (vector-ref resdat 1)) (duration (- (current-milliseconds) start))) (if (not success) (if (> remretries 0) (begin ~~(debug:print 0 "ERROR: local query failed. Trying again.")~~ (debug:print 0 #f "ERROR: local query failed. Trying again.") (thread-sleep! (/ (random 5000) 1000)) ;; some random delay (rmt:open-qry-close-locally cmd run-id params remretries: (- remretries 1))) (begin ~~(debug:print 0 "ERROR: too many retries in rmt:open-qry-close-locally, giving up")~~ (debug:print 0 #f "ERROR: too many retries in rmt:open-qry-close-locally, giving up") #f)) (begin ;; (rmt:update-db-stats run-id cmd params duration) ;; mark this run as dirty if this was a write (if (not (member cmd api:read-only-queries)) (let ((start-time (current-seconds))) (mutex-lock! db-multi-sync-mutex)
︙
268 269 270 271 272 273 274 ~~275~~ 276 277 278 279 280 281 282	268 269 270 271 272 273 274 275 276 277 278 279 280 281 282	- +	(http-transport:client-api-send-receive run-id connection-info cmd params)))) ;; ((commfail) (vector #f "communications fail"))))) (if (and res (vector-ref res 0)) (vector-ref res 1) ;;; YES!! THIS IS CORRECT!! CHANGE IT HERE, THEN CHANGE rmt:send-receive ALSO!!! #f))) ;; (db:string->obj (vector-ref dat 1)) ;; (begin ~~;; (debug:print 0 "ERROR: rmt:send-receive-no-auto-client-setup failed, attempting to continue. Got " dat)~~ ;; (debug:print 0 #f "ERROR: rmt:send-receive-no-auto-client-setup failed, attempting to continue. Got " dat) ;; dat)))) ;; Wrap json library for strings (why the ports crap in the first place?) (define (rmt:dat->json-str dat) (with-output-to-string (lambda () (json-write dat))))
︙
363 364 365 366 367 368 369 ~~370~~ 371 372 373 374 375 376 377 378 379 380 ~~381~~ 382 383 384 385 386 387 388 389 390 391 392 393 394 ~~395~~ 396 397 398 399 400 401 402	363 364 365 366 367 368 369 370 371 372 373 374 375 376 377 378 379 380 381 382 383 384 385 386 387 388 389 390 391 392 393 394 395 396 397 398 399 400 401 402	- + - + - +	(define (rmt:get-test-id run-id testname item-path) (rmt:send-receive 'get-test-id run-id (list run-id testname item-path))) (define (rmt:get-test-info-by-id run-id test-id) (if (and (number? run-id)(number? test-id)) (rmt:send-receive 'get-test-info-by-id run-id (list run-id test-id)) (begin ~~(debug:print 0 "WARNING: Bad data handed to rmt:get-test-info-by-id run-id=" run-id ", test-id=" test-id)~~ (debug:print 0 #f "WARNING: Bad data handed to rmt:get-test-info-by-id run-id=" run-id ", test-id=" test-id) (print-call-chain (current-error-port)) #f))) (define (rmt:test-get-rundir-from-test-id run-id test-id) (rmt:send-receive 'test-get-rundir-from-test-id run-id (list run-id test-id))) (define (rmt:open-test-db-by-test-id run-id test-id #!key (work-area #f)) (let* ((test-path (if (string? work-area) work-area (rmt:test-get-rundir-from-test-id run-id test-id)))) ~~(debug:print 3 "TEST PATH: " test-path)~~ (debug:print 3 #f "TEST PATH: " test-path) (open-test-db test-path))) ;; WARNING: This currently bypasses the transaction wrapped writes system (define (rmt:test-set-state-status-by-id run-id test-id newstate newstatus newcomment) (rmt:send-receive 'test-set-state-status-by-id run-id (list run-id test-id newstate newstatus newcomment))) (define (rmt:set-tests-state-status run-id testnames currstate currstatus newstate newstatus) (rmt:send-receive 'set-tests-state-status run-id (list run-id testnames currstate currstatus newstate newstatus))) (define (rmt:get-tests-for-run run-id testpatt states statuses offset limit not-in sort-by sort-order qryvals last-update mode) (if (number? run-id) (rmt:send-receive 'get-tests-for-run run-id (list run-id testpatt states statuses offset limit not-in sort-by sort-order qryvals last-update mode)) (begin ~~(debug:print "ERROR: rmt:get-tests-for-run called with bad run-id=" run-id)~~ (debug:print 0 #f "ERROR: rmt:get-tests-for-run called with bad run-id=" run-id) (print-call-chain (current-error-port)) '()))) ;; get stuff via synchash (define (rmt:synchash-get run-id proc synckey keynum params) (rmt:send-receive 'synchash-get run-id (list run-id proc synckey keynum params)))
︙
419 420 421 422 423 424 425 ~~426~~ 427 428 429 430 431 432 433	419 420 421 422 423 424 425 426 427 428 429 430 431 432 433	- +	(lambda () (let ((res (rmt:send-receive 'get-tests-for-run-mindata hed (list hed testpatt states status not-in)))) (if (list? res) (begin (mutex-lock! multi-run-mutex) (set! result (append result res)) (mutex-unlock! multi-run-mutex)) ~~(debug:print 0 "ERROR: get-tests-for-run-mindata failed for run-id " hed ", testpatt " testpatt ", states " states ", status " status ", not-in " not-in))))~~ (debug:print 0 #f "ERROR: get-tests-for-run-mindata failed for run-id " hed ", testpatt " testpatt ", states " states ", status " status ", not-in " not-in)))) (conc "multi-run-thread for run-id " hed))) (newthreads (cons newthread threads))) (thread-start! newthread) (thread-sleep! 0.05) ;; give that thread some time to start (if (null? tal) newthreads (loop (car tal)(cdr tal) newthreads))))))
︙
613 614 615 616 617 618 619 ~~620~~ 621 622 623 624 ~~625~~ 626 627 628 629 630 631 632	613 614 615 616 617 618 619 620 621 622 623 624 625 626 627 628 629 630 631 632	- + - +	(selstr (string-intersperse keys ",")) (qrystr (string-intersperse (map (lambda (x)(conc x "=?")) keys) " AND "))) (if (not keyvals) #f (let ((prev-run-ids (rmt:get-prev-run-ids run-id))) ;; for each run starting with the most recent look to see if there is a matching test ;; if found then return that matching test record ~~(debug:print 4 "selstr: " selstr ", qrystr: " qrystr ", keyvals: " keyvals ", previous run ids found: " prev-run-ids)~~ (debug:print 4 #f "selstr: " selstr ", qrystr: " qrystr ", keyvals: " keyvals ", previous run ids found: " prev-run-ids) (if (null? prev-run-ids) #f (let loop ((hed (car prev-run-ids)) (tal (cdr prev-run-ids))) (let ((results (rmt:get-tests-for-run hed (conc test-name "/" item-path) '() '() #f #f #f #f #f #f #f 'normal))) ~~(debug:print 4 "Got tests for run-id " run-id ", test-name " test-name ", item-path " item-path ": " results)~~ (debug:print 4 #f "Got tests for run-id " run-id ", test-name " test-name ", item-path " item-path ": " results) (if (and (null? results) (not (null? tal))) (loop (car tal)(cdr tal)) (if (null? results) #f (car results)))))))))) ;;======================================================================
︙
645 646 647 648 649 650 651 ~~652~~ 653 654 655 656 657 658 659	645 646 647 648 649 650 651 652 653 654 655 656 657 658 659	- +	;;(define (rmt:get-steps-for-test run-id test-id) ;; (rmt:send-receive 'get-steps-data run-id (list test-id))) (define (rmt:teststep-set-status! run-id test-id teststep-name state-in status-in comment logfile) (let* ((state (items:check-valid-items "state" state-in)) (status (items:check-valid-items "status" status-in))) (if (or (not state)(not status)) ~~(debug:print 3 "WARNING: Invalid " (if status "status" "state")~~ (debug:print 3 #f "WARNING: Invalid " (if status "status" "state") " value \"" (if status state-in status-in) "\", update your validvalues section in megatest.config")) (rmt:send-receive 'teststep-set-status! run-id (list run-id test-id teststep-name state-in status-in comment logfile)))) (define (rmt:get-steps-for-test run-id test-id) (rmt:send-receive 'get-steps-for-test run-id (list run-id test-id))) ;;======================================================================
︙

︙
25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74	25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74	- + - + - + - +	(include "db_records.scm") ;; procstr is the name of the procedure to be called as a string (define (rpc-transport:autoremote procstr params) (handle-exceptions exn (begin ~~(debug:print 1 "Remote failed for " proc " " params)~~ (debug:print 1 #f "Remote failed for " proc " " params) (apply (eval (string->symbol procstr)) params)) ;; (if runremote ;; (apply (eval (string->symbol (conc "remote:" procstr))) params) (apply (eval (string->symbol procstr)) params))) ;; all routes though here end in exit ... ;; ;; start_server? ;; (define (rpc-transport:launch run-id) (set! run-id run-id) (if (args:get-arg "-daemonize") (daemon:ize)) (if (server:check-if-running run-id) (begin ~~(debug:print 0 "INFO: Server for run-id " run-id " already running")~~ (debug:print 0 #f "INFO: Server for run-id " run-id " already running") (exit 0))) (let loop ((server-id (open-run-close tasks:server-lock-slot tasks:open-db run-id)) (remtries 4)) (if (not server-id) (if (> remtries 0) (begin (thread-sleep! 2) (loop (open-run-close tasks:server-lock-slot tasks:open-db run-id) (- remtries 1))) (begin ;; since we didn't get the server lock we are going to clean up and bail out ~~(debug:print-info 2 "INFO: server pid=" (current-process-id) ", hostname=" (get-host-name) " not starting due to other candidates ahead in start queue")~~ (debug:print-info 2 #f "INFO: server pid=" (current-process-id) ", hostname=" (get-host-name) " not starting due to other candidates ahead in start queue") (open-run-close tasks:server-delete-records-for-this-pid tasks:open-db " rpc-transport:launch"))) (begin (rpc-transport:run (if (args:get-arg "-server")(args:get-arg "-server") "-") run-id server-id) (exit))))) (define (rpc-transport:run hostn run-id server-id) ~~(debug:print 2 "Attempting to start the rpc server ...")~~ (debug:print 2 #f "Attempting to start the rpc server ...") ;; (trace rpc:publish-procedure!) (rpc:publish-procedure! 'server:login server:login) (rpc:publish-procedure! 'testing (lambda () "Just testing")) (let* ((db #f) (hostname (get-host-name))
︙
97 98 99 100 101 102 103 ~~104~~ 105 106 107 108 109 110 111	97 98 99 100 101 102 103 104 105 106 107 108 109 110 111	- +	(tdb (tasks:open-db))) (thread-start! th1) (set! db inmemdb) (open-run-close tasks:server-set-interface-port tasks:open-db server-id ipaddrstr portnum) ~~(debug:print 0 "Server started on " host:port)~~ (debug:print 0 #f "Server started on " host:port) ;; (trace rpc:publish-procedure!) ;; (rpc:publish-procedure! 'server:login server:login) ;; (rpc:publish-procedure! 'testing (lambda () "Just testing")) ;;====================================================================== ;; ;; end of publish-procedure section
︙
121 122 123 124 125 126 127 ~~128~~ 129 130 ~~131~~ 132 133 ~~134 135~~ 136 137 138 139 140 141 142	121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142	- + - + - - + +	;; server last used then start shutdown (let loop ((count 0)) (thread-sleep! 5) ;; no need to do this very often (let ((numrunning -1)) ;; (db:get-count-tests-running db))) (if (or (> numrunning 0) (> (+ last-db-access 60)(current-seconds))) (begin ~~(debug:print-info 0 "Server continuing, tests running: " numrunning ", seconds since last db access: " (- (current-seconds) last-db-access))~~ (debug:print-info 0 #f "Server continuing, tests running: " numrunning ", seconds since last db access: " (- (current-seconds) last-db-access)) (loop (+ 1 count))) (begin ~~(debug:print-info 0 "Starting to shutdown the server side")~~ (debug:print-info 0 #f "Starting to shutdown the server side") (open-run-close tasks:server-delete-record tasks:open-db server-id " rpc-transport:try-start-server stop") (thread-sleep! 10) ~~(debug:print-info 0 "Max cached queries was " max-cache-size) (debug:print-info 0 "Server shutdown complete. Exiting")~~ (debug:print-info 0 #f "Max cached queries was " max-cache-size) (debug:print-info 0 #f "Server shutdown complete. Exiting") )))))) (define (rpc-transport:find-free-port-and-open port) (handle-exceptions exn (begin (print "Failed to bind to port " (rpc:default-server-port) ", trying next port")
︙
160 161 162 163 164 165 166 ~~167~~ 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 ~~183~~ 184 185 186 187 188 189 190	160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190	- + - +	(begin (print "LOGIN_FAILED") (exit 1)))))) (define (rpc-transport:client-setup run-id #!key (remtries 10)) (if runremote (begin ~~(debug:print 0 "ERROR: Attempt to connect to server but already connected")~~ (debug:print 0 #f "ERROR: Attempt to connect to server but already connected") #f) (let* ((host-info (hash-table-ref/default runremote run-id #f))) ;; (open-run-close db:get-var #f "SERVER")) (if host-info (let ((iface (car host-info)) (port (cadr host-info)) (ping-res ((rpc:procedure 'server:login host port) toppath))) (if ping-res (let ((server-dat (list iface port #f #f #f))) (hash-table-set! runremote run-id server-dat) server-dat) (begin (server:try-running run-id) (thread-sleep! 2) (rpc-transport:client-setup run-id (- remtries 1))))) (let* ((server-db-info (open-run-close tasks:get-server tasks:open-db run-id))) ~~(debug:print-info 0 "client:setup server-dat=" server-dat ", remaining-tries=" remaining-tries)~~ (debug:print-info 0 #f "client:setup server-dat=" server-dat ", remaining-tries=" remaining-tries) (if server-db-info (let* ((iface (tasks:hostinfo-get-interface server-db-info)) (port (tasks:hostinfo-get-port server-db-info)) (server-dat (list iface port #f #f #f)) (ping-res ((rpc:procedure 'server:login host port) toppath))) (if start-res (begin
︙
199 200 201 202 203 204 205 ~~206~~ 207 208 209 ~~210 211~~ 212 213 214 215 216 217 218 219 ~~220~~ 221 222 ~~223~~ 224 ~~225~~ 226	199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226	- + - - + + - + - + - +	(thread-sleep! 2) (rpc-transport:client-setup run-id (- remtries 1))))))))) ;; ;; (port (if (and hostinfo (> (length hostdat) 1))(cadr hostdat) #f))) ;; (if (and port ;; (string->number port)) ;; (let ((portn (string->number port))) ~~;; (debug:print-info 2 "Setting up to connect to host " host ":" port)~~ ;; (debug:print-info 2 #f "Setting up to connect to host " host ":" port) ;; (handle-exceptions ;; exn ;; (begin ~~;; (debug:print 0 "ERROR: Failed to open a connection to the server at host: " host " port: " port) ;; (debug:print 0 " EXCEPTION: " ((condition-property-accessor 'exn 'message) exn))~~ ;; (debug:print 0 #f "ERROR: Failed to open a connection to the server at host: " host " port: " port) ;; (debug:print 0 #f " EXCEPTION: " ((condition-property-accessor 'exn 'message) exn)) ;; ;; (open-run-close ;; ;; (lambda (db . param) ;; ;; (sqlite3:execute db "DELETE FROM metadat WHERE var='SERVER'")) ;; ;; #f) ;; (set! runremote #f)) ;; (if (and (not (args:get-arg "-server")) ;; no point in the server using the server using the server ;; ((rpc:procedure 'server:login host portn) toppath)) ;; (begin ~~;; (debug:print-info 2 "Logged in and connected to " host ":" port)~~ ;; (debug:print-info 2 #f "Logged in and connected to " host ":" port) ;; (set! runremote (vector host portn))) ;; (begin ~~;; (debug:print-info 2 "Failed to login or connect to " host ":" port)~~ ;; (debug:print-info 2 #f "Failed to login or connect to " host ":" port) ;; (set! runremote #f))))) ~~;; (debug:print-info 2 "no server available")))))~~ ;; (debug:print-info 2 #f "no server available")))))

︙
38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64	38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64	- + - +	;; SQLITE3 HELPERS ;;====================================================================== (define (db:general-sqlite-error-dump exn stmt run-id params) (let ((err-status ((condition-property-accessor 'sqlite3 'status #f) exn))) ;; check for (exn sqlite3) ((condition-property-accessor 'exn 'message) exn) (print "err-status: " err-status) ~~(debug:print 0 "ERROR: query " stmt " failed, params: " params ", error: " ((condition-property-accessor 'exn 'message) exn))~~ (debug:print 0 #f "ERROR: query " stmt " failed, params: " params ", error: " ((condition-property-accessor 'exn 'message) exn)) (print-call-chain (current-error-port)))) ;; convert to -inline (define (db:first-result-default db stmt default . params) (handle-exceptions exn (let ((err-status ((condition-property-accessor 'sqlite3 'status #f) exn))) ;; check for (exn sqlite3) ((condition-property-accessor 'exn 'message) exn) (if (eq? err-status 'done) default (begin ~~(debug:print 0 "ERROR: query " stmt " failed, params: " params ", error: " ((condition-property-accessor 'exn 'message) exn))~~ (debug:print 0 #f "ERROR: query " stmt " failed, params: " params ", error: " ((condition-property-accessor 'exn 'message) exn)) (print-call-chain (current-error-port)) default))) (apply sqlite3:first-result db stmt params))) ;; Get/open a database ;; if run-id => get run specific db ;; if #f => get main db
︙
109 110 111 112 113 114 115 ~~116~~ 117 118 119 120 121 122 123	109 110 111 112 113 114 115 116 117 118 119 120 121 122 123	- +	(db:get-db dbstruct run-id) dbstruct)) ;; cheat, allow for passing in a dbdat (db (db:dbdat-get-db dbdat))) (db:delay-if-busy dbdat) (handle-exceptions exn (begin ~~(debug:print 0 "ERROR: sqlite3 issue in db:with-db, dbstruct=" dbstruct ", run-id=" run-id ", proc=" proc ", params=" params " error: " ((condition-property-accessor 'exn 'message) exn))~~ (debug:print 0 #f "ERROR: sqlite3 issue in db:with-db, dbstruct=" dbstruct ", run-id=" run-id ", proc=" proc ", params=" params " error: " ((condition-property-accessor 'exn 'message) exn)) (print-call-chain (current-error-port))) (let ((res (apply proc db params))) (if (vector? dbstruct)(db:done-with dbstruct run-id r/w)) res)))) ;;====================================================================== ;; K E E P F I L E D B I N dbstruct
︙
150 151 152 153 154 155 156 ~~157~~ 158 159 160 161 162 163 164	150 151 152 153 154 155 156 157 158 159 160 161 162 163 164	- +	(let* ((dbdir (db:get-dbdir)) (fname (if run-id (if (eq? run-id 0) "main.db" (conc run-id ".db")) #f))) (handle-exceptions exn (begin ~~(debug:print 0 "ERROR: Couldn't create path to " dbdir)~~ (debug:print 0 #f "ERROR: Couldn't create path to " dbdir) (exit 1)) (if (not (directory? dbdir))(create-directory dbdir #t))) (if fname (conc dbdir "/" fname) dbdir))) (define (db:get-dbdir)
︙
190 191 192 193 194 195 196 ~~197~~ 198 199 200 201 202 203 204	190 191 192 193 194 195 196 197 198 199 200 201 202 203 204	- +	(db (sqlite3:open-database fname))) (sqlite3:set-busy-handler! db (make-busy-timeout 136000)) (db:set-sync db) ;; (sqlite3:execute db "PRAGMA synchronous = 0;") (if (not file-exists)(initproc db)) ;; (release-dot-lock fname) db) (begin ~~(debug:print 2 "WARNING: opening db in non-writable dir " fname)~~ (debug:print 2 #f "WARNING: opening db in non-writable dir " fname) (sqlite3:open-database fname))))) ;; ) ;; This routine creates the db. It is only called if the db is not already opened ;; (define (db:open-rundb dbstruct run-id #!key (attemptnum 0)(do-not-open #f)) ;; (conc toppath "/megatest.db") (car configinfo))) (let* ((local (dbr:dbstruct-get-local dbstruct)) (rdb (if local
︙
216 217 218 219 220 221 222 ~~223~~ 224 225 226 227 228 229 230	216 217 218 219 220 221 222 223 224 225 226 227 228 229 230	- +	(db (db:lock-create-open dbpath ;; this is the database physically on disk (lambda (db) (handle-exceptions exn (begin ;; (release-dot-lock dbpath) (if (> attemptnum 2) ~~(debug:print 0 "ERROR: tried twice, cannot create/initialize db for run-id " run-id ", at path " dbpath)~~ (debug:print 0 #f "ERROR: tried twice, cannot create/initialize db for run-id " run-id ", at path " dbpath) (db:open-rundb dbstruct run-id attemptnum (+ attemptnum 1)))) (db:initialize-run-id-db db) (sqlite3:execute db "INSERT OR IGNORE INTO tests (id,run_id,testname,event_time,item_path,state,status) VALUES (?,?,'bogustest',strftime('%s','now'),'nowherepath','DELETED','n/a');" (* run-id 30000) ;; allow for up to 30k tests per run run-id)
︙
317 318 319 320 321 322 323 ~~324~~ 325 326 327 328 329 330 331 332 333 334 335 336 337 338 339 340 341 342 343 ~~344~~ 345 346 347 348 349 350 351	317 318 319 320 321 322 323 324 325 326 327 328 329 330 331 332 333 334 335 336 337 338 339 340 341 342 343 344 345 346 347 348 349 350 351	- + - +	(rundb (dbr:dbstruct-get-rundb dbstruct)) (inmem (dbr:dbstruct-get-inmem dbstruct)) (maindb (dbr:dbstruct-get-main dbstruct)) (refdb (dbr:dbstruct-get-refdb dbstruct)) (olddb (dbr:dbstruct-get-olddb dbstruct)) ;; (runid (dbr:dbstruct-get-run-id dbstruct)) ) ~~(debug:print-info 4 "Syncing for run-id: " run-id)~~ (debug:print-info 4 #f "Syncing for run-id: " run-id) ;; (mutex-lock! http-mutex) (if (eq? run-id 0) ;; runid equal to 0 is main.db (if maindb (if (or (not (number? mtime)) (not (number? stime)) (> mtime stime) force-sync) (begin (db:delay-if-busy maindb) (db:delay-if-busy olddb) (let ((num-synced (db:sync-tables (db:sync-main-list maindb) maindb olddb))) (dbr:dbstruct-set-stime! dbstruct (current-milliseconds)) num-synced) 0)) (begin ;; this can occur when using local access (i.e. not in a server) ;; need a flag to turn it off. ;; ~~(debug:print 3 "WARNING: call to sync main.db to megatest.db but main not initialized")~~ (debug:print 3 #f "WARNING: call to sync main.db to megatest.db but main not initialized") 0)) ;; any other runid is a run (if (or (not (number? mtime)) (not (number? stime)) (> mtime stime) force-sync) (begin
︙
479 480 481 482 483 484 485 ~~486~~ 487 488 489 490 ~~491~~ 492 493 494 495 496 497 498 499 500 501 502 503 ~~504~~ 505 506 ~~507~~ 508 509 510 511 512 513 514 515 516 517 518 519 520 521 ~~522 523~~ 524 525 526 527 528 529 530	479 480 481 482 483 484 485 486 487 488 489 490 491 492 493 494 495 496 497 498 499 500 501 502 503 504 505 506 507 508 509 510 511 512 513 514 515 516 517 518 519 520 521 522 523 524 525 526 527 528 529 530	- + - + - + - + - - + +	(define (db:move-and-recreate-db dbdat) (let* ((dbpath (db:dbdat-get-path dbdat)) (dbdir (pathname-directory dbpath)) (fname (pathname-strip-directory dbpath)) (fnamejnl (conc fname "-journal")) (tmpname (conc fname "." (current-process-id))) (tmpjnl (conc fnamejnl "." (current-process-id)))) ~~(debug:print 0 "ERROR: " fname " appears corrupted. Making backup \"old/" fname "\"")~~ (debug:print 0 #f "ERROR: " fname " appears corrupted. Making backup \"old/" fname "\"") (system (conc "cd " dbdir ";mkdir -p old;cat " fname " > old/" tmpname)) (system (conc "rm -f " dbpath)) (if (file-exists? fnamejnl) (begin ~~(debug:print 0 "ERROR: " fnamejnl " found, moving it to old dir as " tmpjnl)~~ (debug:print 0 #f "ERROR: " fnamejnl " found, moving it to old dir as " tmpjnl) (system (conc "cd " dbdir ";mkdir -p old;cat " fnamejnl " > old/" tmpjnl)) (system (conc "rm -f " dbdir "/" fnamejnl)))) ;; attempt to recreate database (system (conc "cd " dbdir ";sqlite3 old/" tmpname " .dump \| sqlite3 " fname)))) ;; return #f to indicate the dbdat should be closed/reopened ;; else return dbdat ;; (define (db:repair-db dbdat #!key (numtries 1)) (let* ((dbpath (db:dbdat-get-path dbdat)) (dbdir (pathname-directory dbpath)) (fname (pathname-strip-directory dbpath))) ~~(debug:print-info 0 "Checking db " dbpath " for errors.")~~ (debug:print-info 0 #f "Checking db " dbpath " for errors.") (cond ((not (file-write-access? dbdir)) ~~(debug:print 0 "WARNING: can't write to " dbdir ", can't fix " fname)~~ (debug:print 0 #f "WARNING: can't write to " dbdir ", can't fix " fname) #f) ;; handle special cases, megatest.db and monitor.db ;; ;; NOPE: apply this same approach to all db files ;; (else ;; ((equal? fname "megatest.db") ;; this file can be regenerated if needed (handle-exceptions exn (begin ;; (db:move-and-recreate-db dbdat) (if (> numtries 0) (db:repair-db dbdat numtries: (- numtries 1)) #f) ~~(debug:print 0 "FATAL: file " dbpath " was found corrupted, an attempt to fix has been made but you must start over.") (debug:print 0~~ (debug:print 0 #f "FATAL: file " dbpath " was found corrupted, an attempt to fix has been made but you must start over.") (debug:print 0 #f " check the following:\n" " 1. full directories, look in ~/ /tmp and " dbdir "\n" " 2. write access to " dbdir "\n\n" " if the automatic recovery failed you may be able to recover data by doing \"" (if (member fname '("megatest.db" "monitor.db")) "megatest -cleanup-db" "megatest -import-megatest.db;megatest -cleanup-db")
︙
553 554 555 556 557 558 559 ~~560~~ 561 ~~562~~ 563 ~~564 565~~ 566 567 ~~568~~ 569 570 ~~571~~ 572 573 574 575 576 577 578 579 580 581 582 583 584 585 ~~586 587~~ 588 ~~589~~ 590 ~~591~~ 592 593 594 595 596 597 598	553 554 555 556 557 558 559 560 561 562 563 564 565 566 567 568 569 570 571 572 573 574 575 576 577 578 579 580 581 582 583 584 585 586 587 588 589 590 591 592 593 594 595 596 597 598	- + - + - - + + - + - + - - + + - + - +	;; (define (db:sync-tables tbls fromdb todb . slave-dbs) (mutex-lock! db-sync-mutex) (handle-exceptions exn (begin (mutex-unlock! db-sync-mutex) ~~(debug:print 0 "EXCEPTION: database probably overloaded or unreadable in db:sync-tables.")~~ (debug:print 0 #f "EXCEPTION: database probably overloaded or unreadable in db:sync-tables.") (print-call-chain (current-error-port)) ~~(debug:print 0 " message: " ((condition-property-accessor 'exn 'message) exn))~~ (debug:print 0 #f " message: " ((condition-property-accessor 'exn 'message) exn)) (print "exn=" (condition->list exn)) ~~(debug:print 0 " status: " ((condition-property-accessor 'sqlite3 'status) exn)) (debug:print 0 " src db: " (db:dbdat-get-path fromdb))~~ (debug:print 0 #f " status: " ((condition-property-accessor 'sqlite3 'status) exn)) (debug:print 0 #f " src db: " (db:dbdat-get-path fromdb)) (for-each (lambda (dbdat) (let ((dbpath (db:dbdat-get-path dbdat))) ~~(debug:print 0 " dbpath: " dbpath)~~ (debug:print 0 #f " dbpath: " dbpath) (if (not (db:repair-db dbdat)) (begin ~~(debug:print 0 "ERROR: Failed to rebuild " dbpath ", exiting now.")~~ (debug:print 0 #f "ERROR: Failed to rebuild " dbpath ", exiting now.") (exit))))) (cons todb slave-dbs)) 0) ;; (if server-run ;; we are inside a server, throw a sync-failed error ;; (signal (make-composite-condition ;; (make-property-condition 'sync-failed 'message "db:sync-tables failed in a server context."))) ;; 0)) ;; return zero for num synced ;; (set! time-to-exit #t) ;; let watch dog know that it is time to die. ;; (tasks:server-set-state! (db:delay-if-busy tdbdat) server-id "shutting-down") ;; (portlogger:open-run-close portlogger:set-port port "released") ;; (exit 1))) (cond ~~((not fromdb) (debug:print 3 "WARNING: db:sync-tables called with fromdb missing") -1) ((not todb) (debug:print 3 "WARNING: db:sync-tables called with todb missing") -2)~~ ((not fromdb) (debug:print 3 #f "WARNING: db:sync-tables called with fromdb missing") -1) ((not todb) (debug:print 3 #f "WARNING: db:sync-tables called with todb missing") -2) ((not (sqlite3:database? (db:dbdat-get-db fromdb))) ~~(debug:print 0 "ERROR: db:sync-tables called with fromdb not a database " fromdb) -3)~~ (debug:print 0 #f "ERROR: db:sync-tables called with fromdb not a database " fromdb) -3) ((not (sqlite3:database? (db:dbdat-get-db todb))) ~~(debug:print 0 "ERROR: db:sync-tables called with todb not a database " todb) -4)~~ (debug:print 0 #f "ERROR: db:sync-tables called with todb not a database " todb) -4) (else (let ((stmts (make-hash-table)) ;; table-field => stmt (all-stmts '()) ;; ( ( stmt1 value1 ) ( stml2 value2 )) (numrecs (make-hash-table)) (start-time (current-milliseconds)) (tot-count 0)) (for-each ;; table
︙
633 634 635 636 637 638 639 ~~640~~ 641 642 643 644 645 646 647	633 634 635 636 637 638 639 640 641 642 643 644 645 646 647	- +	full-sel) ;; tack on remaining records in fromdat (if (not (null? fromdat)) (set! fromdats (cons fromdat fromdats))) (if (common:low-noise-print 120 "sync-records") ~~(debug:print-info 4 "found " totrecords " records to sync"))~~ (debug:print-info 4 #f "found " totrecords " records to sync")) ;; read the target table (sqlite3:for-each-row (lambda (a . b) (hash-table-set! todat a (apply vector a b))) (db:dbdat-get-db todb) full-sel)
︙
677 678 679 680 681 682 683 ~~684~~ 685 686 687 688 689 690 ~~691~~ 692 693 694 695 696 697 698	677 678 679 680 681 682 683 684 685 686 687 688 689 690 691 692 693 694 695 696 697 698	- + - +	)) fromdats) (sqlite3:finalize! stmth))) (append (list todb) slave-dbs)))) tbls) (let* ((runtime (- (current-milliseconds) start-time)) (should-print (common:low-noise-print 120 "db sync" (> runtime 500)))) ;; low and high sync times treated as separate. ~~(if should-print (debug:print 3 "INFO: db sync, total run time " runtime " ms"))~~ (if should-print (debug:print 3 #f "INFO: db sync, total run time " runtime " ms")) (for-each (lambda (dat) (let ((tblname (car dat)) (count (cdr dat))) (set! tot-count (+ tot-count count)) (if (> count 0) ~~(if should-print (debug:print 0 (format #f " ~10a ~5a" tblname count))))))~~ (if should-print (debug:print 0 #f (format #f " ~10a ~5a" tblname count)))))) (sort (hash-table->alist numrecs)(lambda (a b)(> (cdr a)(cdr b)))))) tot-count))) (mutex-unlock! db-sync-mutex))) ;; options: ;; ;; 'killservers - kills all servers
︙
745 746 747 748 749 750 751 ~~752~~ 753 754 755 756 757 758 759 760 761 762 763 764 765 766 767 768 ~~769~~ 770 771 772 773 774 775 776 777 778 779 780 781 782 783 784 785 786 787 ~~788~~ 789 790 791 792 793 794 795	745 746 747 748 749 750 751 752 753 754 755 756 757 758 759 760 761 762 763 764 765 766 767 768 769 770 771 772 773 774 775 776 777 778 779 780 781 782 783 784 785 786 787 788 789 790 791 792 793 794 795	- + - + - +	(begin (db:sync-tables (db:sync-main-list mtdb) mtdb (db:get-db dbstruct #f)) (for-each (lambda (run-id) (db:delay-if-busy mtdb) (let ((testrecs (db:get-all-tests-info-by-run-id mtdb run-id)) (dbstruct (if toppath (make-dbr:dbstruct path: toppath local: #t) #f))) ~~(debug:print 0 "INFO: Propagating " (length testrecs) " records for run-id=" run-id " to run specific db")~~ (debug:print 0 #f "INFO: Propagating " (length testrecs) " records for run-id=" run-id " to run specific db") (db:replace-test-records dbstruct run-id testrecs) (sqlite3:finalize! (db:dbdat-get-db (dbr:dbstruct-get-rundb dbstruct))))) run-ids))) ;; now ensure all newdb data are synced to megatest.db ;; do not use the run-ids list passed in to the function ;; (if (member 'new2old options) (let* ((maindb (make-dbr:dbstruct path: toppath local: #t)) (src-run-ids (if run-ids run-ids (db:get-all-run-ids (db:dbdat-get-db (db:get-db maindb 0))))) (all-run-ids (sort (delete-duplicates (cons 0 src-run-ids)) <)) (count 1) (total (length all-run-ids)) (dead-runs '())) (for-each (lambda (run-id) ~~(debug:print 0 "Processing run " (if (eq? run-id 0) " main.db " run-id) ", " count " of " total)~~ (debug:print 0 #f "Processing run " (if (eq? run-id 0) " main.db " run-id) ", " count " of " total) (set! count (+ count 1)) (let* ((fromdb (if toppath (make-dbr:dbstruct path: toppath local: #t) #f)) (frundb (db:dbdat-get-db (db:get-db fromdb run-id)))) ;; (db:delay-if-busy frundb) ;; (db:delay-if-busy mtdb) ;; (db:clean-up frundb) (if (eq? run-id 0) (let ((maindb (db:dbdat-get-db (db:get-db fromdb #f)))) (db:sync-tables (db:sync-main-list dbstruct) (db:get-db fromdb #f) mtdb) (set! dead-runs (db:clean-up-maindb (db:get-db fromdb #f))) ;; ;; Feb 18, 2016: add field last_update to runs table ;; ;; remove all these some time after september 2016 (added in v1.6031 ;; (handle-exceptions exn (if (string-match ".duplicate." ((condition-property-accessor 'exn 'message) exn)) ~~(debug:print 0 "Column last_update already added to runs table")~~ (debug:print 0 #f "Column last_update already added to runs table") (db:general-sqlite-error-dump exn "alter table runs ..." run-id "none")) (sqlite3:execute maindb "ALTER TABLE runs ADD COLUMN last_update INTEGER DEFAULT 0")) ;; these schema changes don't need exception handling (sqlite3:execute maindb
︙
823 824 825 826 827 828 829 ~~830~~ 831 832 833 834 835 836 837	823 824 825 826 827 828 829 830 831 832 833 834 835 836 837	- +	;; remove this some time after September 2016 (added in version v1.6031 ;; (for-each (lambda (table-name) (handle-exceptions exn (if (string-match ".duplicate." ((condition-property-accessor 'exn 'message) exn)) ~~(debug:print 0 "Column last_update already added to " table-name " table")~~ (debug:print 0 #f "Column last_update already added to " table-name " table") (db:general-sqlite-error-dump exn "alter table " table-name " ..." #f "none")) (sqlite3:execute frundb (conc "ALTER TABLE " table-name " ADD COLUMN last_update INTEGER DEFAULT 0"))) (sqlite3:execute frundb (conc "DROP TRIGGER IF EXISTS update_" table-name "_trigger;"))
︙
848 849 850 851 852 853 854 ~~855~~ 856 857 858 859 860 861 862 863 864 ~~865~~ 866 867 868 869 870 ~~871~~ 872 ~~873~~ 874 875 876 ~~877~~ 878 879 880 881 882 883 884 885 886 887 888 889 ~~890 891~~ 892 ~~893~~ 894 895 ~~896~~ 897 898 899 900 901 902 903	848 849 850 851 852 853 854 855 856 857 858 859 860 861 862 863 864 865 866 867 868 869 870 871 872 873 874 875 876 877 878 879 880 881 882 883 884 885 886 887 888 889 890 891 892 893 894 895 896 897 898 899 900 901 902 903	- + - + - + - + - + - - + + - + - +	all-run-ids) ;; removed deleted runs (let ((dbdir (tasks:get-task-db-path))) (for-each (lambda (run-id) (let ((fullname (conc dbdir "/" run-id ".db"))) (if (file-exists? fullname) (begin ~~(debug:print 0 "Removing database file for deleted run " fullname)~~ (debug:print 0 #f "Removing database file for deleted run " fullname) (delete-file fullname))))) dead-runs)))) ;; (db:close-all dbstruct) ;; (sqlite3:finalize! mdb) )) ;; keeping it around for debugging purposes only (define (open-run-close-no-exception-handling proc idb . params) ~~(debug:print-info 11 "open-run-close-no-exception-handling START given a db=" (if idb "yes " "no ") ", params=" params)~~ (debug:print-info 11 #f "open-run-close-no-exception-handling START given a db=" (if idb "yes " "no ") ", params=" params) (if (or db-write-access (not (member proc db:all-write-procs))) (let* ((db (cond ((pair? idb) (db:dbdat-get-db idb)) ((sqlite3:database? idb) idb) ~~((not idb) (debug:print 0 "ERROR: cannot open-run-close with #f anymore"))~~ ((not idb) (debug:print 0 #f "ERROR: cannot open-run-close with #f anymore")) ((procedure? idb) (idb)) ~~(else (debug:print 0 "ERROR: cannot open-run-close with #f anymore"))))~~ (else (debug:print 0 #f "ERROR: cannot open-run-close with #f anymore")))) (res #f)) (set! res (apply proc db params)) (if (not idb)(sqlite3:finalize! dbstruct)) ~~(debug:print-info 11 "open-run-close-no-exception-handling END" )~~ (debug:print-info 11 #f "open-run-close-no-exception-handling END" ) res) #f)) (define (open-run-close-exception-handling proc idb . params) (handle-exceptions exn (let ((sleep-time (random 30)) (err-status ((condition-property-accessor 'sqlite3 'status #f) exn))) (case err-status ((busy) (thread-sleep! sleep-time)) (else ~~(debug:print 0 "EXCEPTION: database probably overloaded or unreadable.") (debug:print 0 " message: " ((condition-property-accessor 'exn 'message) exn))~~ (debug:print 0 #f "EXCEPTION: database probably overloaded or unreadable.") (debug:print 0 #f " message: " ((condition-property-accessor 'exn 'message) exn)) (print "exn=" (condition->list exn)) ~~(debug:print 0 " status: " ((condition-property-accessor 'sqlite3 'status) exn))~~ (debug:print 0 #f " status: " ((condition-property-accessor 'sqlite3 'status) exn)) (print-call-chain (current-error-port)) (thread-sleep! sleep-time) ~~(debug:print-info 0 "trying db call one more time....this may never recover, if necessary kill process " (current-process-id) " on host " (get-host-name) " to clean up")))~~ (debug:print-info 0 #f "trying db call one more time....this may never recover, if necessary kill process " (current-process-id) " on host " (get-host-name) " to clean up"))) (apply open-run-close-exception-handling proc idb params)) (apply open-run-close-no-exception-handling proc idb params))) ;; (define open-run-close (define open-run-close open-run-close-exception-handling) ;; open-run-close-no-exception-handling ;; open-run-close-exception-handling)
︙
1014 1015 1016 1017 1018 1019 1020 ~~1021~~ 1022 1023 1024 1025 1026 1027 1028	1014 1015 1016 1017 1018 1019 1020 1021 1022 1023 1024 1025 1026 1027 1028	- +	(sqlite3:execute db "CREATE TABLE IF NOT EXISTS extradat (id INTEGER PRIMARY KEY, run_id INTEGER, key TEXT, val TEXT);") (sqlite3:execute db "CREATE TABLE IF NOT EXISTS metadat (id INTEGER PRIMARY KEY, var TEXT, val TEXT, CONSTRAINT metadat_constraint UNIQUE (var));") (sqlite3:execute db "CREATE TABLE IF NOT EXISTS access_log (id INTEGER PRIMARY KEY, user TEXT, accessed TIMESTAMP, args TEXT);") ;; Must do this after running patch db !! No more. ;; cannot use db:set-var since it will deadlock, hardwire the code here (sqlite3:execute db "INSERT OR REPLACE INTO metadat (var,val) VALUES (?,?);" "MEGATEST_VERSION" (common:version-signature)) ~~(debug:print-info 11 "db:initialize END")))))~~ (debug:print-info 11 #f "db:initialize END"))))) ;;====================================================================== ;; R U N S P E C I F I C D B ;;====================================================================== (define (db:initialize-run-id-db db) (sqlite3:with-transaction
︙
1306 1307 1308 1309 1310 1311 1312 ~~1313~~ 1314 1315 1316 1317 1318 1319 1320 1321 1322 1323 1324 1325 1326 1327 1328 1329 1330 1331 1332 ~~1333~~ 1334 1335 1336 1337 1338 1339 1340	1306 1307 1308 1309 1310 1311 1312 1313 1314 1315 1316 1317 1318 1319 1320 1321 1322 1323 1324 1325 1326 1327 1328 1329 1330 1331 1332 1333 1334 1335 1336 1337 1338 1339 1340	- + - +	(sqlite3:for-each-row (lambda (test-id run-dir uname testname item-path) (if (and (equal? uname "n/a") (equal? item-path "")) ;; this is a toplevel test ;; what to do with toplevel? call rollup? (begin (set! toplevels (cons (list test-id run-dir uname testname item-path run-id) toplevels)) ~~(debug:print-info 0 "Found old toplevel test in RUNNING state, test-id=" test-id))~~ (debug:print-info 0 #f "Found old toplevel test in RUNNING state, test-id=" test-id)) (set! incompleted (cons (list test-id run-dir uname testname item-path run-id) incompleted)))) db "SELECT id,rundir,uname,testname,item_path FROM tests WHERE run_id=? AND (strftime('%s','now') - event_time) > (run_duration + ?) AND state IN ('RUNNING','REMOTEHOSTSTART');" run-id deadtime) ;; in LAUNCHED for more than one day. Could be long due to job queues TODO/BUG: Need override for this in config ;; (db:delay-if-busy dbdat) (sqlite3:for-each-row (lambda (test-id run-dir uname testname item-path) (if (and (equal? uname "n/a") (equal? item-path "")) ;; this is a toplevel test ;; what to do with toplevel? call rollup? (set! toplevels (cons (list test-id run-dir uname testname item-path run-id) toplevels)) (set! oldlaunched (cons (list test-id run-dir uname testname item-path run-id) oldlaunched)))) db "SELECT id,rundir,uname,testname,item_path FROM tests WHERE run_id=? AND (strftime('%s','now') - event_time) > 86400 AND state IN ('LAUNCHED');" run-id) ~~(debug:print-info 18 "Found " (length oldlaunched) " old LAUNCHED items, " (length toplevels) " old LAUNCHED toplevel tests and " (length incompleted) " tests marked RUNNING but apparently dead.")~~ (debug:print-info 18 #f "Found " (length oldlaunched) " old LAUNCHED items, " (length toplevels) " old LAUNCHED toplevel tests and " (length incompleted) " tests marked RUNNING but apparently dead.") (if (and (null? incompleted) (null? oldlaunched) (null? toplevels)) #f #t))) ;; select end_time-now from
︙
1365 1366 1367 1368 1369 1370 1371 ~~1372~~ 1373 1374 1375 1376 1377 1378 1379 1380 1381 1382 1383 1384 1385 1386 1387 1388 1389 1390 1391 ~~1392~~ 1393 1394 1395 1396 1397 1398 1399 1400 1401 1402 1403 1404 1405 1406 1407 ~~1408~~ 1409 1410 1411 1412 1413 1414 1415	1365 1366 1367 1368 1369 1370 1371 1372 1373 1374 1375 1376 1377 1378 1379 1380 1381 1382 1383 1384 1385 1386 1387 1388 1389 1390 1391 1392 1393 1394 1395 1396 1397 1398 1399 1400 1401 1402 1403 1404 1405 1406 1407 1408 1409 1410 1411 1412 1413 1414 1415	- + - + - +	(sqlite3:for-each-row (lambda (test-id run-dir uname testname item-path) (if (and (equal? uname "n/a") (equal? item-path "")) ;; this is a toplevel test ;; what to do with toplevel? call rollup? (begin (set! toplevels (cons (list test-id run-dir uname testname item-path run-id) toplevels)) ~~(debug:print-info 0 "Found old toplevel test in RUNNING state, test-id=" test-id))~~ (debug:print-info 0 #f "Found old toplevel test in RUNNING state, test-id=" test-id)) (set! incompleted (cons (list test-id run-dir uname testname item-path run-id) incompleted)))) db "SELECT id,rundir,uname,testname,item_path FROM tests WHERE run_id=? AND (strftime('%s','now') - event_time) > (run_duration + ?) AND state IN ('RUNNING','REMOTEHOSTSTART');" run-id deadtime) ;; in LAUNCHED for more than one day. Could be long due to job queues TODO/BUG: Need override for this in config ;; (db:delay-if-busy dbdat) (sqlite3:for-each-row (lambda (test-id run-dir uname testname item-path) (if (and (equal? uname "n/a") (equal? item-path "")) ;; this is a toplevel test ;; what to do with toplevel? call rollup? (set! toplevels (cons (list test-id run-dir uname testname item-path run-id) toplevels)) (set! oldlaunched (cons (list test-id run-dir uname testname item-path run-id) oldlaunched)))) db "SELECT id,rundir,uname,testname,item_path FROM tests WHERE run_id=? AND (strftime('%s','now') - event_time) > 86400 AND state IN ('LAUNCHED');" run-id) ~~(debug:print-info 18 "Found " (length oldlaunched) " old LAUNCHED items, " (length toplevels) " old LAUNCHED toplevel tests and " (length incompleted) " tests marked RUNNING but apparently dead.")~~ (debug:print-info 18 #f "Found " (length oldlaunched) " old LAUNCHED items, " (length toplevels) " old LAUNCHED toplevel tests and " (length incompleted) " tests marked RUNNING but apparently dead.") ;; These are defunct tests, do not do all the overhead of set-state-status. Force them to INCOMPLETE. ;; (db:delay-if-busy dbdat) (let* (;; (min-incompleted (filter (lambda (x) ;; (let* ((testpath (cadr x)) ;; (tdatpath (conc testpath "/testdat.db")) ;; (dbexists (file-exists? tdatpath))) ;; (or (not dbexists) ;; if no file then something wrong - mark as incomplete ;; (> (- (current-seconds)(file-modification-time tdatpath)) 600)))) ;; no change in 10 minutes to testdat.db - she's dead Jim ;; incompleted)) (min-incompleted-ids (map car incompleted)) ;; do 'em all (all-ids (append min-incompleted-ids (map car oldlaunched)))) (if (> (length all-ids) 0) (begin ~~(debug:print 0 "WARNING: Marking test(s); " (string-intersperse (map conc all-ids) ", ") " as INCOMPLETE")~~ (debug:print 0 #f "WARNING: Marking test(s); " (string-intersperse (map conc all-ids) ", ") " as INCOMPLETE") (sqlite3:execute db (conc "UPDATE tests SET state='INCOMPLETE' WHERE id IN (" (string-intersperse (map conc all-ids) ",") ");"))))) ;; Now do rollups for the toplevel tests
︙
1434 1435 1436 1437 1438 1439 1440 ~~1441~~ 1442 1443 1444 1445 1446 1447 1448	1434 1435 1436 1437 1438 1439 1440 1441 1442 1443 1444 1445 1446 1447 1448	- +	;; a. If test dir exists, set the the test to state='UNKNOWN', Set the run to 'unknown' ;; b. If test dir gone, delete the test record ;; 2. Look at run records ;; a. If have tests that are not deleted, set state='unknown' ;; b. .... ;; (define (db:clean-up dbdat) ~~;; (debug:print 0 "WARNING: db clean up not fully ported to v1.60, cleanup action will be on megatest.db")~~ ;; (debug:print 0 #f "WARNING: db clean up not fully ported to v1.60, cleanup action will be on megatest.db") (let* ((db (db:dbdat-get-db dbdat)) (count-stmt (sqlite3:prepare db "SELECT (SELECT count(id) FROM tests)+(SELECT count(id) FROM runs);")) (statements (map (lambda (stmt) (sqlite3:prepare db stmt)) (list ;; delete all tests that belong to runs that are 'deleted'
︙
1457 1458 1459 1460 1461 1462 1463 ~~1464~~ 1465 1466 1467 ~~1468~~ 1469 1470 1471 1472 1473 1474 1475 1476 1477 1478 1479 1480 1481 1482 1483 1484 1485 1486 1487 ~~1488~~ 1489 1490 1491 1492 1493 1494 1495 1496 1497 1498 1499 1500 1501 1502 1503 1504 ~~1505~~ 1506 1507 1508 ~~1509~~ 1510 1511 1512 1513 1514 1515 1516 1517 1518 1519 1520 1521 1522 1523 1524 1525 1526 1527 1528 ~~1529~~ 1530 1531 1532 1533 1534 1535 1536	1457 1458 1459 1460 1461 1462 1463 1464 1465 1466 1467 1468 1469 1470 1471 1472 1473 1474 1475 1476 1477 1478 1479 1480 1481 1482 1483 1484 1485 1486 1487 1488 1489 1490 1491 1492 1493 1494 1495 1496 1497 1498 1499 1500 1501 1502 1503 1504 1505 1506 1507 1508 1509 1510 1511 1512 1513 1514 1515 1516 1517 1518 1519 1520 1521 1522 1523 1524 1525 1526 1527 1528 1529 1530 1531 1532 1533 1534 1535 1536	- + - + - + - + - + - +	"DELETE FROM runs WHERE id NOT IN (SELECT DISTINCT r.id FROM runs AS r INNER JOIN tests AS t ON t.run_id=r.id);" )))) (db:delay-if-busy dbdat) (sqlite3:with-transaction db (lambda () (sqlite3:for-each-row (lambda (tot) ~~(debug:print-info 0 "Records count before clean: " tot))~~ (debug:print-info 0 #f "Records count before clean: " tot)) count-stmt) (map sqlite3:execute statements) (sqlite3:for-each-row (lambda (tot) ~~(debug:print-info 0 "Records count after clean: " tot))~~ (debug:print-info 0 #f "Records count after clean: " tot)) count-stmt))) (map sqlite3:finalize! statements) (sqlite3:finalize! count-stmt) ;; (db:find-and-mark-incomplete db) (db:delay-if-busy dbdat) (sqlite3:execute db "VACUUM;"))) ;; Clean out old junk and vacuum the database ;; ;; Ultimately do something like this: ;; ;; 1. Look at test records either deleted or part of deleted run: ;; a. If test dir exists, set the the test to state='UNKNOWN', Set the run to 'unknown' ;; b. If test dir gone, delete the test record ;; 2. Look at run records ;; a. If have tests that are not deleted, set state='unknown' ;; b. .... ;; (define (db:clean-up-rundb dbdat) ~~;; (debug:print 0 "WARNING: db clean up not fully ported to v1.60, cleanup action will be on megatest.db")~~ ;; (debug:print 0 #f "WARNING: db clean up not fully ported to v1.60, cleanup action will be on megatest.db") (let* ((db (db:dbdat-get-db dbdat)) (count-stmt (sqlite3:prepare db "SELECT (SELECT count(id) FROM tests);")) (statements (map (lambda (stmt) (sqlite3:prepare db stmt)) (list ;; delete all tests that belong to runs that are 'deleted' ;; (conc "DELETE FROM tests WHERE run_id NOT IN (" (string-intersperse (map conc valid-runs) ",") ");") ;; delete all tests that are 'DELETED' "DELETE FROM tests WHERE state='DELETED';" )))) (db:delay-if-busy dbdat) (sqlite3:with-transaction db (lambda () (sqlite3:for-each-row (lambda (tot) ~~(debug:print-info 0 "Records count before clean: " tot))~~ (debug:print-info 0 #f "Records count before clean: " tot)) count-stmt) (map sqlite3:execute statements) (sqlite3:for-each-row (lambda (tot) ~~(debug:print-info 0 "Records count after clean: " tot))~~ (debug:print-info 0 #f "Records count after clean: " tot)) count-stmt))) (map sqlite3:finalize! statements) (sqlite3:finalize! count-stmt) ;; (db:find-and-mark-incomplete db) (db:delay-if-busy dbdat) (sqlite3:execute db "VACUUM;"))) ;; Clean out old junk and vacuum the database ;; ;; Ultimately do something like this: ;; ;; 1. Look at test records either deleted or part of deleted run: ;; a. If test dir exists, set the the test to state='UNKNOWN', Set the run to 'unknown' ;; b. If test dir gone, delete the test record ;; 2. Look at run records ;; a. If have tests that are not deleted, set state='unknown' ;; b. .... ;; (define (db:clean-up-maindb dbdat) ~~;; (debug:print 0 "WARNING: db clean up not fully ported to v1.60, cleanup action will be on megatest.db")~~ ;; (debug:print 0 #f "WARNING: db clean up not fully ported to v1.60, cleanup action will be on megatest.db") (let* ((db (db:dbdat-get-db dbdat)) (count-stmt (sqlite3:prepare db "SELECT (SELECT count(id) FROM runs);")) (statements (map (lambda (stmt) (sqlite3:prepare db stmt)) (list ;; delete all tests that belong to runs that are 'deleted'
︙
1545 1546 1547 1548 1549 1550 1551 ~~1552~~ 1553 1554 1555 ~~1556~~ 1557 1558 1559 1560 1561 1562 1563	1545 1546 1547 1548 1549 1550 1551 1552 1553 1554 1555 1556 1557 1558 1559 1560 1561 1562 1563	- + - +	db "SELECT id FROM runs WHERE state='deleted';") (db:delay-if-busy dbdat) (sqlite3:with-transaction db (lambda () (sqlite3:for-each-row (lambda (tot) ~~(debug:print-info 0 "Records count before clean: " tot))~~ (debug:print-info 0 #f "Records count before clean: " tot)) count-stmt) (map sqlite3:execute statements) (sqlite3:for-each-row (lambda (tot) ~~(debug:print-info 0 "Records count after clean: " tot))~~ (debug:print-info 0 #f "Records count after clean: " tot)) count-stmt))) (map sqlite3:finalize! statements) (sqlite3:finalize! count-stmt) ;; (db:find-and-mark-incomplete db) (db:delay-if-busy dbdat) (sqlite3:execute db "VACUUM;") dead-runs))
︙
1589 1590 1591 1592 1593 1594 1595 ~~1596~~ 1597 1598 1599 1600 1601 1602 1603	1589 1590 1591 1592 1593 1594 1595 1596 1597 1598 1599 1600 1601 1602 1603	- +	;; ;; scale by 10, average with current value. ;; (set! global-delta (/ (+ global-delta (* (- (current-milliseconds) start-ms) ;; (if throttle throttle 0.01))) ;; 2)) ;; (if (> (abs (- last-global-delta-printed global-delta)) 0.08) ;; don't print all the time, only if it changes a bit ;; (begin ~~;; (debug:print-info 4 "launch throttle factor=" global-delta)~~ ;; (debug:print-info 4 #f "launch throttle factor=" global-delta) ;; (set! last-global-delta-printed global-delta))) (define (db:set-var dbstruct var val) (let* ((dbdat (db:get-db dbstruct #f)) (db (db:dbdat-get-db dbdat))) (sqlite3:execute db "INSERT OR REPLACE INTO metadat (var,val) VALUES (?,?);" var val)))

︙
15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38	15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38	- + - +	(define (setup-env-defaults fname run-id already-seen keyvals #!key (environ-patt #f)(change-env #t)) (let* ((keys (map car keyvals)) (thekey (if keyvals (string-intersperse (map (lambda (x)(if x x "-na-")) (map cadr keyvals)) "/") (or (common:args-get-target) (get-environment-variable "MT_TARGET") (begin ~~(debug:print 0 "ERROR: setup-env-defaults called with no run-id or -target or -reqtarg")~~ (debug:print 0 #f "ERROR: setup-env-defaults called with no run-id or -target or -reqtarg") "nothing matches this I hope")))) ;; Why was system disallowed in the reading of the runconfigs file? ;; NOTE: Should be setting env vars based on (target\|default) (confdat (read-config fname #f #t environ-patt: environ-patt sections: (list "default" thekey))) (whatfound (make-hash-table)) (finaldat (make-hash-table)) (sections (list "default" thekey))) (if (not target)(set! target thekey)) ;; may save a db access or two but repeats db:get-target code ~~(debug:print 4 "Using key=\"" thekey "\"")~~ (debug:print 4 #f "Using key=\"" thekey "\"") (if change-env (for-each ;; NB// This can be simplified with new content of keyvals having all that is needed. (lambda (keyval) (safe-setenv (car keyval)(cadr keyval))) keyvals))
︙
49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80	49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80	- + - + - + - +	change-env) (safe-setenv envvar val)) (hash-table-set! finaldat envvar val))) (map car section-dat))))) sections) (if already-seen (begin ~~(debug:print 2 "Key settings found in runconfig.config:")~~ (debug:print 2 #f "Key settings found in runconfig.config:") (for-each (lambda (fullkey) ~~(debug:print 2 (format #f "~20a ~a\n" fullkey (hash-table-ref/default whatfound fullkey 0))))~~ (debug:print 2 #f (format #f "~20a ~a\n" fullkey (hash-table-ref/default whatfound fullkey 0)))) sections) ~~(debug:print 2 "---")~~ (debug:print 2 #f "---") (set! already-seen-runconfig-info #t))) ;; finaldat ;; was returning this "finaldat" which would be good but conflicts with other uses confdat )) (define (set-run-config-vars run-id keyvals targ-from-db) (push-directory toppath) ;; the push/pop doesn't appear to do anything ... (let ((runconfigf (conc toppath "/runconfigs.config")) (targ (or (common:args-get-target) targ-from-db (get-environment-variable "MT_TARGET")))) (pop-directory) (if (file-exists? runconfigf) (setup-env-defaults runconfigf run-id #t keyvals environ-patt: (conc "(default" (if targ (conc "\|" targ ")") ")"))) ~~(debug:print 0 "WARNING: You do not have a run config file: " runconfigf))))~~ (debug:print 0 #f "WARNING: You do not have a run config file: " runconfigf))))

︙
50 51 52 53 54 55 56 ~~57 58~~ 59 60 61 62 63 64 65	50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65	- - + +	;; start_server ;; (define (server:launch run-id) (case transport-type ((http)(http-transport:launch run-id)) ((nmsg)(nmsg-transport:launch run-id)) ((rpc) (rpc-transport:launch run-id)) ~~(else (debug:print 0 "ERROR: unknown server type " transport-type)))) ;; (else (debug:print 0 "ERROR: No known transport set, transport=" transport ", using rpc")~~ (else (debug:print 0 #f "ERROR: unknown server type " transport-type)))) ;; (else (debug:print 0 #f "ERROR: No known transport set, transport=" transport ", using rpc") ;; (rpc-transport:launch run-id))))) ;;====================================================================== ;; S E R V E R U T I L I T I E S ;;====================================================================== ;; Get the transport
︙
81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 ~~100~~ 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 ~~118~~ 119 120 121 122 123 124 125 126 127 128 129 ~~130~~ 131 ~~132~~ 133 134 135 136 137 138 139 140 141 142 143 ~~144~~ 145 146 147 148 149 150 151	81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151	- + - + - + - + - + - +	(write (list (current-directory) (argv))))))) ;; When using zmq this would send the message back (two step process) ;; with spiffy or rpc this simply returns the return data to be returned ;; (define (server:reply return-addr query-sig success/fail result) ~~(debug:print-info 11 "server:reply return-addr=" return-addr ", result=" result)~~ (debug:print-info 11 #f "server:reply return-addr=" return-addr ", result=" result) ;; (send-message pubsock target send-more: #t) ;; (send-message pubsock (case (server:get-transport) ((rpc) (db:obj->string (vector success/fail query-sig result))) ((http) (db:obj->string (vector success/fail query-sig result))) ((zmq) (let ((pub-socket (vector-ref runremote 1))) (send-message pub-socket return-addr send-more: #t) (send-message pub-socket (db:obj->string (vector success/fail query-sig result))))) ((fs) result) (else ~~(debug:print 0 "ERROR: unrecognised transport type: " transport-type)~~ (debug:print 0 #f "ERROR: unrecognised transport type: " transport-type) result))) ;; Given a run id start a server process ### NOTE ### > file 2>&1 ;; if the run-id is zero and the target-host is set ;; try running on that host ;; (define (server:run run-id) (let* ((curr-host (get-host-name)) (curr-ip (server:get-best-guess-address curr-host)) (target-host (configf:lookup configdat "server" "homehost" )) (testsuite (common:get-testsuite-name)) (logfile (conc toppath "/logs/" run-id ".log")) (cmdln (conc (common:get-megatest-exe) " -server " (or target-host "-") " -run-id " run-id (if (equal? (configf:lookup configdat "server" "daemonize") "yes") (conc " -daemonize -log " logfile) "") " -m testsuite:" testsuite))) ;; (conc " >> " logfile " 2>&1 &"))))) ~~(debug:print 0 "INFO: Starting server (" cmdln ") as none running ...")~~ (debug:print 0 #f "INFO: Starting server (" cmdln ") as none running ...") (push-directory toppath) (if (not (directory-exists? "logs"))(create-directory "logs")) ;; Rotate logs, logic: ;; if > 500k and older than 1 week, remove previous compressed log and compress this log (directory-fold (lambda (file rem) (if (and (string-match "^..log" file) (> (file-size (conc "logs/" file)) 200000)) (let ((gzfile (conc "logs/" file ".gz"))) (if (file-exists? gzfile) (begin ~~(debug:print-info 0 "removing " gzfile)~~ (debug:print-info 0 #f "removing " gzfile) (delete-file gzfile))) ~~(debug:print-info 0 "compressing " file)~~ (debug:print-info 0 #f "compressing " file) (system (conc "gzip logs/" file))))) '() "logs") ;; host.domain.tld match host? (if (and target-host ;; look at target host, is it host.domain.tld or ip address and does it ;; match current ip or hostname (not (string-match (conc "("curr-host "\|" curr-host"\\..)") target-host)) (not (equal? curr-ip target-host))) (begin ~~(debug:print-info 0 "Starting server on " target-host ", logfile is " logfile)~~ (debug:print-info 0 #f "Starting server on " target-host ", logfile is " logfile) (setenv "TARGETHOST" target-host))) (setenv "TARGETHOST_LOGF" logfile) (common:wait-for-normalized-load 4 " delaying server start due to load") ;; do not try starting servers on an already overloaded machine, just wait forever (system (conc "nbfake " cmdln)) (unsetenv "TARGETHOST_LOGF") (if (get-environment-variable "TARGETHOST")(unsetenv "TARGETHOST")) ;; (system cmdln)
︙
191 192 193 194 195 196 197 ~~198~~ 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 ~~216~~ 217 218 219 220 221 222 223	191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223	- + - +	((nmsg)(nmsg-transport:ping (tasks:hostinfo-get-interface server) (tasks:hostinfo-get-port server) timeout: 2))))) ;; if the server didn't respond we must remove the record (if res #t (begin ~~(debug:print-info 0 "server at " server " not responding, removing record")~~ (debug:print-info 0 #f "server at " server " not responding, removing record") (tasks:server-force-clean-running-records-for-run-id (db:delay-if-busy tdbdat) run-id " server:check-if-running") res))) #f)))) ;; called in megatest.scm, host-port is string hostname:port ;; (define (server:ping run-id host:port) (let ((tdbdat (tasks:open-db))) (let* ((host-port (let ((slst (string-split host:port ":"))) (if (eq? (length slst) 2) (list (car slst)(string->number (cadr slst))) #f))) (toppath (launch:setup)) (server-db-dat (if (not host-port)(tasks:get-server (db:delay-if-busy tdbdat) run-id) #f))) (if (not run-id) (begin ~~(debug:print 0 "ERROR: must specify run-id when doing ping, -run-id n")~~ (debug:print 0 #f "ERROR: must specify run-id when doing ping, -run-id n") (print "ERROR: No run-id") (exit 1)) (if (and (not host-port) (not server-db-dat)) (begin (print "ERROR: bad host:port") (exit 1))
︙
250 251 252 253 254 255 256 ~~257~~ 258 259 ~~260~~ 261 262 263 264 265 266 267 268 269 270 271 272	250 251 252 253 254 255 256 257 258 259 260 261 262 263 264 265 266 267 268 269 270 271 272	- + - +	(loop (read-line) inl)))))) (define (server:login toppath) (lambda (toppath) (set! last-db-access (current-seconds)) (if (equal? toppath toppath) (begin ~~;; (debug:print-info 2 "login successful")~~ ;; (debug:print-info 2 #f "login successful") #t) (begin ~~;; (debug:print-info 2 "login failed")~~ ;; (debug:print-info 2 #f "login failed") #f)))) (define (server:get-timeout) (let ((tmo (configf:lookup configdat "server" "timeout"))) (if (and (string? tmo) (string->number tmo)) (* 60 60 (string->number tmo)) ;; (* 3 24 60 60) ;; default to three days (* 60 1) ;; default to one minute ;; (* 60 60 25) ;; default to 25 hours )))

︙
23 24 25 26 27 28 29 30 31 32 33 34 35 ~~36 37 38~~ 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71	23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71	- + - - - + + + - + - +	;; Tasks db ;;====================================================================== ;; wait up to aprox n seconds for a journal to go away ;; (define (tasks:wait-on-journal path n #!key (remove #f)(waiting-msg #f)) (if (not (string? path)) ~~(debug:print 0 "ERROR: Called tasks:wait-on-journal with path=" path " (not a string)")~~ (debug:print 0 #f "ERROR: Called tasks:wait-on-journal with path=" path " (not a string)") (let ((fullpath (conc path "-journal"))) (handle-exceptions exn (begin (print-call-chain (current-error-port)) (debug:print 0 " message: " ((condition-property-accessor 'exn 'message) exn)) (debug:print 0 " exn=" (condition->list exn)) (debug:print 0 "tasks:wait-on-journal failed. Continuing on, you can ignore this call-chain") (debug:print 0 #f " message: " ((condition-property-accessor 'exn 'message) exn)) (debug:print 0 #f " exn=" (condition->list exn)) (debug:print 0 #f "tasks:wait-on-journal failed. Continuing on, you can ignore this call-chain") #t) ;; if stuff goes wrong just allow it to move on (let loop ((journal-exists (file-exists? fullpath)) (count n)) ;; wait ten times ... (if journal-exists (begin (if (and waiting-msg (eq? (modulo n 30) 0)) ~~(debug:print 0 waiting-msg))~~ (debug:print 0 #f waiting-msg)) (if (> count 0) (begin (thread-sleep! 1) (loop (file-exists? fullpath) (- count 1))) (begin (if remove (system (conc "rm -rf " fullpath))) #f))) #t)))))) (define (tasks:get-task-db-path) (let ((dbdir (or (configf:lookup configdat "setup" "monitordir") (configf:lookup configdat "setup" "dbdir") (conc (configf:lookup configdat "setup" "linktree") "/.db")))) (handle-exceptions exn (begin ~~(debug:print 0 "ERROR: Couldn't create path to " dbdir)~~ (debug:print 0 #f "ERROR: Couldn't create path to " dbdir) (exit 1)) (if (not (directory? dbdir))(create-directory dbdir #t))) dbdir)) ;; If file exists AND ;; file readable ;; ==> open it
︙
79 80 81 82 83 84 85 ~~86 87~~ 88 89 90 91 ~~92 93~~ 94 95 96 97 98 99 100	79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100	- - + + - - + +	(if task-db task-db (handle-exceptions exn (if (> numretries 0) (begin (print-call-chain (current-error-port)) ~~(debug:print 0 " message: " ((condition-property-accessor 'exn 'message) exn)) (debug:print 0 " exn=" (condition->list exn))~~ (debug:print 0 #f " message: " ((condition-property-accessor 'exn 'message) exn)) (debug:print 0 #f " exn=" (condition->list exn)) (thread-sleep! 1) (tasks:open-db numretries (- numretries 1))) (begin (print-call-chain (current-error-port)) ~~(debug:print 0 " message: " ((condition-property-accessor 'exn 'message) exn)) (debug:print 0 " exn=" (condition->list exn))))~~ (debug:print 0 #f " message: " ((condition-property-accessor 'exn 'message) exn)) (debug:print 0 #f " exn=" (condition->list exn)))) (let* ((dbpath (tasks:get-task-db-path)) (dbfile (conc dbpath "/monitor.db")) (avail (tasks:wait-on-journal dbpath 10)) ;; wait up to about 10 seconds for the journal to go away (exists (file-exists? dbpath)) (write-access (file-write-access? dbpath)) (mdb (cond ;; what the hek is toppath doing here? ((and (string? toppath)(file-write-access? toppath))
︙
284 285 286 287 288 289 290 ~~291~~ 292 293 294 295 296 297 298 299 300 ~~301~~ 302 303 304 305 306 307 308	284 285 286 287 288 289 290 291 292 293 294 295 296 297 298 299 300 301 302 303 304 305 306 307 308	- + - +	(loop (get-rand-port)(- remtries 1)) (get-rand-port)) port)))))) (define (tasks:server-am-i-the-server? mdb run-id) (let* ((all (tasks:server-get-servers-vying-for-run-id mdb run-id)) (first (if (null? all) ~~#f;; (begin (debug:print 0 "ERROR: no servers listed, should be at least one by now.")~~ #f;; (begin (debug:print 0 #f "ERROR: no servers listed, should be at least one by now.") ;; (sqlite3:finalize! mdb) ;; (exit 1)) (car (db:get-rows all))))) (if first (let* ((header (db:get-header all)) (id (db:get-value-by-header first header "id")) (hostname (db:get-value-by-header first header "hostname")) (pid (db:get-value-by-header first header "pid")) (priority (db:get-value-by-header first header "priority"))) ~~;; (debug:print 0 "INFO: am-i-the-server got record " first)~~ ;; (debug:print 0 #f "INFO: am-i-the-server got record " first) ;; for now a basic check. add tiebreaking by priority later (if (and (equal? hostname (get-host-name)) (equal? pid (current-process-id))) id #f)) #f)))
︙
324 325 326 327 328 329 330 ~~331 332 333~~ 334 335 336 ~~337~~ 338 339 ~~340~~ 341 342 343 344 345 346 347	324 325 326 327 328 329 330 331 332 333 334 335 336 337 338 339 340 341 342 343 344 345 346 347	- - - + + + - + - +	(define (tasks:get-server mdb run-id #!key (retries 10)) (let ((res #f) (best #f)) (handle-exceptions exn (begin (print-call-chain (current-error-port)) ~~(debug:print 0 "WARNING: tasks:get-server db access error.") (debug:print 0 " message: " ((condition-property-accessor 'exn 'message) exn)) (debug:print 0 " for run " run-id)~~ (debug:print 0 #f "WARNING: tasks:get-server db access error.") (debug:print 0 #f " message: " ((condition-property-accessor 'exn 'message) exn)) (debug:print 0 #f " for run " run-id) (print-call-chain (current-error-port)) (if (> retries 0) (begin ~~(debug:print 0 " trying call to tasks:get-server again in 10 seconds")~~ (debug:print 0 #f " trying call to tasks:get-server again in 10 seconds") (thread-sleep! 10) (tasks:get-server mdb run-id retries: (- retries 0))) ~~(debug:print 0 "10 tries of tasks:get-server all crashed and burned. Giving up and returning \"no server found\"")))~~ (debug:print 0 #f "10 tries of tasks:get-server all crashed and burned. Giving up and returning \"no server found\""))) (sqlite3:for-each-row (lambda (id interface port pubport transport pid hostname) (set! res (vector id interface port pubport transport pid hostname))) mdb ;; removed: ;; strftime('%s','now')-heartbeat < 10 AND mt_version = ? "SELECT id,interface,port,pubport,transport,pid,hostname FROM servers
︙
371 372 373 374 375 376 377 ~~378~~ 379 380 381 ~~382~~ 383 384 385 386 387 388 389 390 391 392 393 394 395 396 ~~397~~ 398 399 400 401 402 403 404	371 372 373 374 375 376 377 378 379 380 381 382 383 384 385 386 387 388 389 390 391 392 393 394 395 396 397 398 399 400 401 402 403 404	- + - + - +	(configf:lookup configdat "server" "required")) ;; (maxqry (cdr (rmt:get-max-query-average run-id))) ;; (threshold (string->number (or (configf:lookup configdat "server" "server-query-threshold") "10")))) ;; (cond ;; (forced ;; (if (common:low-noise-print 60 run-id "server required is set") ~~;; (debug:print-info 0 "Server required is set, starting server for run-id " run-id "."))~~ ;; (debug:print-info 0 #f "Server required is set, starting server for run-id " run-id ".")) ;; #t) ;; ((> maxqry threshold) ;; (if (common:low-noise-print 60 run-id "Max query time execeeded") ~~;; (debug:print-info 0 "Max avg query time of " maxqry "ms exceeds limit of " threshold "ms, server needed for run-id " run-id "."))~~ ;; (debug:print-info 0 #f "Max avg query time of " maxqry "ms exceeds limit of " threshold "ms, server needed for run-id " run-id ".")) ;; #t) ;; (else ;; #f)))) ;; try to start a server and wait for it to be available ;; (define (tasks:start-and-wait-for-server tdbdat run-id delay-max-tries) ;; ensure a server is running for this run (let loop ((server-dat (tasks:get-server (db:delay-if-busy tdbdat) run-id)) (delay-time 0)) (if (and (not server-dat) (< delay-time delay-max-tries)) (begin (if (common:low-noise-print 60 "tasks:start-and-wait-for-server" run-id) ~~(debug:print 0 "Try starting server for run-id " run-id))~~ (debug:print 0 #f "Try starting server for run-id " run-id)) (thread-sleep! (/ (random 2000) 1000)) (server:kind-run run-id) (thread-sleep! (min delay-time 1)) (loop (tasks:get-server (db:delay-if-busy tdbdat) run-id)(+ delay-time 1)))))) (define (tasks:get-all-servers mdb) (let ((res '()))
︙
422 423 424 425 426 427 428 ~~429~~ 430 431 432 433 434 435 436 437 438 439 440 441 442 443 444 445 ~~446~~ 447 448 ~~449~~ 450 451 452 453 454 455 456	422 423 424 425 426 427 428 429 430 431 432 433 434 435 436 437 438 439 440 441 442 443 444 445 446 447 448 449 450 451 452 453 454 455 456	- + - + - +	FROM servers WHERE run_id=? AND state NOT LIKE 'defunct%' ORDER BY start_time DESC;" run-id) (reverse res))) ;; no elegance here ... ;; (define (tasks:kill-server hostname pid) ~~(debug:print-info 0 "Attempting to kill server process " pid " on host " hostname)~~ (debug:print-info 0 #f "Attempting to kill server process " pid " on host " hostname) (setenv "TARGETHOST" hostname) (setenv "TARGETHOST_LOGF" "server-kills.log") (system (conc "nbfake kill " pid)) (unsetenv "TARGETHOST_LOGF") (unsetenv "TARGETHOST")) ;; look up a server by run-id and send it a kill, also delete the record for that server ;; (define (tasks:kill-server-run-id run-id #!key (tag "default")) (let* ((tdbdat (tasks:open-db)) (sdat (tasks:get-server (db:delay-if-busy tdbdat) run-id))) (if sdat (let ((hostname (vector-ref sdat 6)) (pid (vector-ref sdat 5)) (server-id (vector-ref sdat 0))) (tasks:server-set-state! (db:delay-if-busy tdbdat) server-id "killed") ~~(debug:print-info 0 "Killing server " server-id " for run-id " run-id " on host " hostname " with pid " pid)~~ (debug:print-info 0 #f "Killing server " server-id " for run-id " run-id " on host " hostname " with pid " pid) (tasks:kill-server hostname pid) (tasks:server-delete-record (db:delay-if-busy tdbdat) server-id tag) ) ~~(debug:print-info 0 "No server found for run-id " run-id ", nothing to kill"))~~ (debug:print-info 0 #f "No server found for run-id " run-id ", nothing to kill")) ;; (sqlite3:finalize! tdb) )) ;;====================================================================== ;; M O N I T O R S ;;======================================================================
︙
517 518 519 520 521 522 523 ~~524~~ 525 526 527 528 529 530 531	517 518 519 520 521 522 523 524 525 526 527 528 529 530 531	- +	"SELECT count(id) FROM monitors WHERE last_update < (strftime('%s','now') - 300) AND username=?;" (car (user-information (current-user-id)))) res)) ;; (define (tasks:start-monitor db mdb) (if (> (tasks:get-num-alive-monitors mdb) 2) ;; have two running, no need for more ~~(debug:print-info 1 "Not starting monitor, already have more than two running")~~ (debug:print-info 1 #f "Not starting monitor, already have more than two running") (let* ((megatestdb (conc toppath "/megatest.db")) (monitordbf (conc (db:dbfile-path #f) "/monitor.db")) (last-db-update 0)) ;; (file-modification-time megatestdb))) (task:register-monitor mdb) (let loop ((count 0) (next-touch 0)) ;; next-touch is the time where we need to update last_update ;; if the db has been modified we'd best look at the task queue
︙
745 746 747 748 749 750 751 ~~752 753~~ 754 755 756 757 758 759 760 ~~761~~ 762 763 764 765 766 767 ~~768 769~~ 770 771 772 773 774 775 776 777 778 779 780 781 782 ~~783~~ 784 785 786 787 788 789 790	745 746 747 748 749 750 751 752 753 754 755 756 757 758 759 760 761 762 763 764 765 766 767 768 769 770 771 772 773 774 775 776 777 778 779 780 781 782 783 784 785 786 787 788 789 790	- - + + - + - - + + - +	;; ;; do a remote call to get the task queue info but do the killing as self here. ;; (define (tasks:kill-runner target run-name testpatt) (let ((records (rmt:tasks-find-task-queue-records target run-name testpatt "running" "run-tests")) (hostpid-rx (regexp "\\s+(\\w+)\\s+(\\d+)$"))) ;; host pid is at end of param string (if (null? records) (debug:print 0 "No run launching processes found for " target " / " run-name " with testpatt " (or testpatt "* no testpatt specified! ")) (debug:print 0 "Found " (length records) " run(s) to kill.")) (debug:print 0 #f "No run launching processes found for " target " / " run-name " with testpatt " (or testpatt " no testpatt specified! ")) (debug:print 0 #f "Found " (length records) " run(s) to kill.")) (for-each (lambda (record) (let ((param-key (list-ref record 8)) (match-dat (string-search hostpid-rx param-key))) (if match-dat (let ((hostname (cadr match-dat)) (pid (string->number (caddr match-dat)))) ~~(debug:print 0 "Sending SIGINT to process " pid " on host " hostname)~~ (debug:print 0 #f "Sending SIGINT to process " pid " on host " hostname) (if (equal? (get-host-name) hostname) (if (process:alive? pid) (begin (handle-exceptions exn (begin ~~(debug:print 0 "Kill of process " pid " on host " hostname " failed.") (debug:print 0 " message: " ((condition-property-accessor 'exn 'message) exn))~~ (debug:print 0 #f "Kill of process " pid " on host " hostname " failed.") (debug:print 0 #f " message: " ((condition-property-accessor 'exn 'message) exn)) #t) (process-signal pid signal/int) (thread-sleep! 5) (if (process:alive? pid) (process-signal pid signal/kill))))) ;; (call-with-environment-variables (let ((old-targethost (getenv "TARGETHOST"))) (setenv "TARGETHOST" hostname) (setenv "TARGETHOST_LOGF" "server-kills.log") (system (conc "nbfake kill " pid)) (if old-targethost (setenv "TARGETHOST" old-targethost)) (unsetenv "TARGETHOST") (unsetenv "TARGETHOST_LOGF")))) ~~(debug:print 0 "ERROR: no record or improper record for " target "/" run-name " in tasks_queue in main.db"))))~~ (debug:print 0 #f "ERROR: no record or improper record for " target "/" run-name " in tasks_queue in main.db")))) records))) ;; (define (tasks:start-run dbstruct mdb task) ;; (let ((flags (make-hash-table))) ;; (hash-table-set! flags "-rerun" "NOT_STARTED") ;; (if (not (string=? (tasks:task-get-params task) "")) ;; (hash-table-set! flags "-setvars" (tasks:task-get-params task)))
︙

︙
43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 ~~100~~ 101 102 103 104 105 106 107 108 109 ~~110~~ 111 112 113 114 115 116 117 ~~118~~ 119 120 121 122 123 124 125 126 127 128 129 ~~130~~ 131 132 133 134 135 136 137	43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137	- + - + - + - + - + - + - + - + - +	;; Create the sqlite db for the individual test(s) ;; ;; Moved these tables into <runid>.db ;; THIS CODE TO BE REMOVED ;; (define (open-test-db work-area) ~~(debug:print-info 11 "open-test-db " work-area)~~ (debug:print-info 11 #f "open-test-db " work-area) (if (and work-area (directory? work-area) (file-read-access? work-area)) (let* ((dbpath (conc work-area "/testdat.db")) (dbexists (file-exists? dbpath)) (work-area-writeable (file-write-access? work-area)) (db (handle-exceptions ;; open the db if area writeable or db pre-existing. open in-mem otherwise. if exception, open in-mem exn (begin (print-call-chain (current-error-port)) ~~(debug:print 2 "ERROR: problem accessing test db " work-area ", you probably should clean and re-run this test"~~ (debug:print 2 #f "ERROR: problem accessing test db " work-area ", you probably should clean and re-run this test" ((condition-property-accessor 'exn 'message) exn)) (set! dbexists #f) ;; must force re-creation of tables, more tom-foolery (sqlite3:open-database ":memory:")) ;; open an in-memory db to allow readonly access (if (or work-area-writeable dbexists) (sqlite3:open-database dbpath) (sqlite3:open-database ":memory:")))) (tdb-writeable (and (file-write-access? work-area) (file-write-access? dbpath))) (handler (make-busy-timeout (if (args:get-arg "-override-timeout") (string->number (args:get-arg "-override-timeout")) 136000)))) (if (and tdb-writeable db-write-access) (sqlite3:set-busy-handler! db handler)) (if (not dbexists) (begin (db:set-sync db) ;; (sqlite3:execute db "PRAGMA synchronous = FULL;") ~~(debug:print-info 11 "Initialized test database " dbpath)~~ (debug:print-info 11 #f "Initialized test database " dbpath) (tdb:testdb-initialize db))) ;; (sqlite3:execute db "PRAGMA synchronous = 0;") ~~(debug:print-info 11 "open-test-db END (sucessful)" work-area)~~ (debug:print-info 11 #f "open-test-db END (sucessful)" work-area) ;; now let's test that everything is correct (handle-exceptions exn (begin (print-call-chain (current-error-port)) ~~(debug:print 0 "ERROR: problem accessing test db " work-area ", you probably should clean and re-run this test or remove the file "~~ (debug:print 0 #f "ERROR: problem accessing test db " work-area ", you probably should clean and re-run this test or remove the file " dbpath ".\n " ((condition-property-accessor 'exn 'message) exn)) #f) ;; Is there a cheaper single line operation that will check for existance of a table ;; and raise an exception ? (sqlite3:execute db "SELECT id FROM test_data LIMIT 1;")) db) ;; no work-area or not readable - create a placeholder to fake rest of world out (let ((baddb (sqlite3:open-database ":memory:"))) ~~(debug:print-info 11 "open-test-db END (unsucessful)" work-area)~~ (debug:print-info 11 #f "open-test-db END (unsucessful)" work-area) ;; provide an in-mem db (this is dangerous!) (tdb:testdb-initialize baddb) baddb))) ;; find and open the testdat.db file for an existing test (define (tdb:open-test-db-by-test-id test-id #!key (work-area #f)) (let* ((test-path (if work-area work-area (rmt:test-get-rundir-from-test-id test-id)))) ~~(debug:print 3 "TEST PATH: " test-path)~~ (debug:print 3 #f "TEST PATH: " test-path) (open-test-db test-path))) ;; find and open the testdat.db file for an existing test (define (tdb:open-test-db-by-test-id-local dbstruct run-id test-id #!key (work-area #f)) (let* ((test-path (if work-area work-area (db:test-get-rundir-from-test-id dbstruct run-id test-id)))) ~~(debug:print 3 "TEST PATH: " test-path)~~ (debug:print 3 #f "TEST PATH: " test-path) (open-test-db test-path))) ;; find and open the testdat.db file for an existing test (define (tdb:open-run-close-db-by-test-id-local dbstruct run-id test-id work-area proc . params) (let* ((test-path (if work-area work-area (db:test-get-rundir-from-test-id dbstruct run-id test-id))) (tdb (open-test-db test-path))) (apply proc tdb params))) (define (tdb:testdb-initialize db) ~~(debug:print 11 "db:testdb-initialize START")~~ (debug:print 11 #f "db:testdb-initialize START") (sqlite3:with-transaction db (lambda () (for-each (lambda (sqlcmd) (sqlite3:execute db sqlcmd)) (list "CREATE TABLE IF NOT EXISTS test_rundat (
︙
169 170 171 172 173 174 175 ~~176~~ 177 178 179 180 181 182 183	169 170 171 172 173 174 175 176 177 178 179 180 181 182 183	- +	;; the ackstate is set to 1 once the command has been completed "CREATE TABLE IF NOT EXISTS test_meta ( id INTEGER PRIMARY KEY, var TEXT, val TEXT, ackstate INTEGER DEFAULT 0, CONSTRAINT metadat_constraint UNIQUE (var));")))) ~~(debug:print 11 "db:testdb-initialize END"))~~ (debug:print 11 #f "db:testdb-initialize END")) ;; This routine moved to db:read-test-data ;; (define (tdb:read-test-data tdb test-id categorypatt) (let ((res '())) (sqlite3:for-each-row (lambda (id test_id category variable value expected tol units comment status type)
︙
206 207 208 209 210 211 212 ~~213~~ 214 215 216 217 218 219 220 221 222 223 224 ~~225~~ 226 227 228 229 230 231 232	206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226 227 228 229 230 231 232	- + - +	;; '()))) ;; NOTE: Run this local with #f for db !!! (define (tdb:load-test-data run-id test-id) (let loop ((lin (read-line))) (if (not (eof-object? lin)) (begin ~~(debug:print 4 lin)~~ (debug:print 4 #f lin) (rmt:csv->test-data run-id test-id lin) (loop (read-line))))) ;; roll up the current results. ;; FIXME: Add the status too (rmt:test-data-rollup run-id test-id #f)) ;; NOTE: Run this local with #f for db !!! (define (tdb:load-logpro-data run-id test-id) (let loop ((lin (read-line))) (if (not (eof-object? lin)) (begin ~~(debug:print 4 lin)~~ (debug:print 4 #f lin) (rmt:csv->test-data run-id test-id lin) (loop (read-line))))) ;; roll up the current results. ;; FIXME: Add the status too (rmt:test-data-rollup run-id test-id #f)) (define (tdb:get-prev-tol-for-test tdb test-id category variable)
︙
244 245 246 247 248 249 250 ~~251~~ 252 253 254 255 256 ~~257~~ 258 259 260 261 262 263 264 265 266 267 268 269 270 271 272 273 274 ~~275~~ 276 277 278 279 280 281 282 283 284 285 286 287 ~~288~~ 289 290 291 292 293 294 295	244 245 246 247 248 249 250 251 252 253 254 255 256 257 258 259 260 261 262 263 264 265 266 267 268 269 270 271 272 273 274 275 276 277 278 279 280 281 282 283 284 285 286 287 288 289 290 291 292 293 294 295	- + - + - + - +	;; ;; NOT USED, WILL BE REMOVED ;; (define (tdb:get-steps-table steps);; organise the steps for better readability (let ((res (make-hash-table))) (for-each (lambda (step) ~~(debug:print 6 "step=" step)~~ (debug:print 6 #f "step=" step) (let ((record (hash-table-ref/default res (tdb:step-get-stepname step) ;; stepname start end status Duration Logfile (vector (tdb:step-get-stepname step) "" "" "" "" "")))) ~~(debug:print 6 "record(before) = " record~~ (debug:print 6 #f "record(before) = " record "\nid: " (tdb:step-get-id step) "\nstepname: " (tdb:step-get-stepname step) "\nstate: " (tdb:step-get-state step) "\nstatus: " (tdb:step-get-status step) "\ntime: " (tdb:step-get-event_time step)) (case (string->symbol (tdb:step-get-state step)) ((start)(vector-set! record 1 (tdb:step-get-event_time step)) (vector-set! record 3 (if (equal? (vector-ref record 3) "") (tdb:step-get-status step))) (if (> (string-length (tdb:step-get-logfile step)) 0) (vector-set! record 5 (tdb:step-get-logfile step)))) ((end) (vector-set! record 2 (any->number (tdb:step-get-event_time step))) (vector-set! record 3 (tdb:step-get-status step)) (vector-set! record 4 (let ((startt (any->number (vector-ref record 1))) (endt (any->number (vector-ref record 2)))) ~~(debug:print 4 "record[1]=" (vector-ref record 1)~~ (debug:print 4 #f "record[1]=" (vector-ref record 1) ", startt=" startt ", endt=" endt ", get-status: " (tdb:step-get-status step)) (if (and (number? startt)(number? endt)) (seconds->hr-min-sec (- endt startt)) "-1"))) (if (> (string-length (tdb:step-get-logfile step)) 0) (vector-set! record 5 (tdb:step-get-logfile step)))) (else (vector-set! record 2 (tdb:step-get-state step)) (vector-set! record 3 (tdb:step-get-status step)) (vector-set! record 4 (tdb:step-get-event_time step)))) (hash-table-set! res (tdb:step-get-stepname step) record) ~~(debug:print 6 "record(after) = " record~~ (debug:print 6 #f "record(after) = " record "\nid: " (tdb:step-get-id step) "\nstepname: " (tdb:step-get-stepname step) "\nstate: " (tdb:step-get-state step) "\nstatus: " (tdb:step-get-status step) "\ntime: " (tdb:step-get-event_time step)))) ;; (else (vector-set! record 1 (tdb:step-get-event_time step))) (sort steps (lambda (a b)
︙
305 306 307 308 309 310 311 ~~312~~ 313 314 315 316 317 ~~318~~ 319 320 321 322 323 324 325 326 327 328 329 330 331 332 333 334 335 ~~336~~ 337 338 339 340 341 342 343 344 345 346 347 348 ~~349~~ 350 351 352 353 354 355 356	305 306 307 308 309 310 311 312 313 314 315 316 317 318 319 320 321 322 323 324 325 326 327 328 329 330 331 332 333 334 335 336 337 338 339 340 341 342 343 344 345 346 347 348 349 350 351 352 353 354 355 356	- + - + - + - +	;; get a pretty table to summarize steps ;; (define (tdb:get-steps-table-list steps) ;; organise the steps for better readability (let ((res (make-hash-table))) (for-each (lambda (step) ~~(debug:print 6 "step=" step)~~ (debug:print 6 #f "step=" step) (let ((record (hash-table-ref/default res (tdb:step-get-stepname step) ;; stepname start end status (vector (tdb:step-get-stepname step) "" "" "" "" "")))) ~~(debug:print 6 "record(before) = " record~~ (debug:print 6 #f "record(before) = " record "\nid: " (tdb:step-get-id step) "\nstepname: " (tdb:step-get-stepname step) "\nstate: " (tdb:step-get-state step) "\nstatus: " (tdb:step-get-status step) "\ntime: " (tdb:step-get-event_time step)) (case (string->symbol (tdb:step-get-state step)) ((start)(vector-set! record 1 (tdb:step-get-event_time step)) (vector-set! record 3 (if (equal? (vector-ref record 3) "") (tdb:step-get-status step))) (if (> (string-length (tdb:step-get-logfile step)) 0) (vector-set! record 5 (tdb:step-get-logfile step)))) ((end) (vector-set! record 2 (any->number (tdb:step-get-event_time step))) (vector-set! record 3 (tdb:step-get-status step)) (vector-set! record 4 (let ((startt (any->number (vector-ref record 1))) (endt (any->number (vector-ref record 2)))) ~~(debug:print 4 "record[1]=" (vector-ref record 1)~~ (debug:print 4 #f "record[1]=" (vector-ref record 1) ", startt=" startt ", endt=" endt ", get-status: " (tdb:step-get-status step)) (if (and (number? startt)(number? endt)) (seconds->hr-min-sec (- endt startt)) "-1"))) (if (> (string-length (tdb:step-get-logfile step)) 0) (vector-set! record 5 (tdb:step-get-logfile step)))) (else (vector-set! record 2 (tdb:step-get-state step)) (vector-set! record 3 (tdb:step-get-status step)) (vector-set! record 4 (tdb:step-get-event_time step)))) (hash-table-set! res (tdb:step-get-stepname step) record) ~~(debug:print 6 "record(after) = " record~~ (debug:print 6 #f "record(after) = " record "\nid: " (tdb:step-get-id step) "\nstepname: " (tdb:step-get-stepname step) "\nstate: " (tdb:step-get-state step) "\nstatus: " (tdb:step-get-status step) "\ntime: " (tdb:step-get-event_time step)))) ;; (else (vector-set! record 1 (tdb:step-get-event_time step))) (sort steps (lambda (a b)
︙
393 394 395 396 397 398 399 ~~400~~ 401	393 394 395 396 397 398 399 400 401	- +	(define (tdb:remote-update-testdat-meta-info run-id test-id work-area cpuload diskfree minutes) (let ((tdb (rmt:open-test-db-by-test-id run-id test-id work-area: work-area))) (if (sqlite3:database? tdb) (begin (sqlite3:execute tdb "INSERT INTO test_rundat (update_time,cpuload,diskfree,run_duration) VALUES (strftime('%s','now'),?,?,?);" cpuload diskfree minutes) (sqlite3:finalize! tdb)) ~~(debug:print 2 "Can't update testdat.db for test " test-id " read-only or non-existant"))))~~ (debug:print 2 #f "Can't update testdat.db for test " test-id " read-only or non-existant"))))

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19	1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19	- +	;;====================================================================== ;; S E R V E R ;;====================================================================== ;; Run like this: ;; ;; ./rununittest.sh server 1;(cd simplerun;megatest -stop-server 0) (delete-file* "logs/1.log") (define run-id 1) ~~(test "setup for run" #t (begin (launch:setup~~-for-run~~)~~ (test "setup for run" #t (begin (launch:setup) (string? (getenv "MT_RUN_AREA_HOME")))) ;; NON Server tests go here (test #f #f (db:dbdat-get-path db)) (test #f #f (db:get-run-name-from-id db run-id)) ;; (test #f '("SYSTEM" "RELEASE") (rmt:get-keys))
︙
177 178 179 180 181 182 183 ~~184~~ 185 186 187 188 189 190 191	177 178 179 180 181 182 183 184 185 186 187 188 189 190 191	- +	;; ;; ;; Not sure how the following should work, replacing it with system of megatest -server ;; ;; (test "launch server" #t (let ((pid (process-fork (lambda () ;; ;; ;; (daemon:ize) ;; ;; (server:launch 'http))))) ;; ;; (set! server-pid pid) ;; ;; (number? pid))) ~~;; (system "../../bin/megatest -server - -debug 22 > server.log 2> server.log &")~~ ;; (system "../../bin/megatest -server - -debugbcom 22 > server.log 2> server.log &") ;; ;; (let loop ((n 10)) ;; (thread-sleep! 1) ;; need to wait for server to start. ;; (let ((res (open-run-close tasks:get-best-server tasks:open-db))) ;; (print "tasks:get-best-server returned " res) ;; (if (and (not res) ;; (> n 0))
︙