-
Notifications
You must be signed in to change notification settings - Fork 15
ENH: improve upload.sh to retry, extract from README.md into a script to automatically test blazegraph #16
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: master
Are you sure you want to change the base?
Conversation
…es have been changed / region definitions update (e.g. Left-Thalamus-Proper -> Left-Thalamus)
…s upon exit 56 if blazegraph is still starting we might fail to upload for a bit -- here we will retry a number of times
Now invoke like ./run_tests_under_blazegraph ./test_1.sh
…eader to get csv not xml
pushed more fixes etc. Now the query invocation looks like
so reading from file directly and getting CSV back (first I forgot to pass that). It then takes ~4 minutes with about 800% CPU load by blazegraph (java) process to answer. It produces CSV but different instead of
getting
so for some reason I have lost "study", I got If I cut/paste line from README as is and run under So, the questions:
|
but I am confused -- it seems that e.g. none of the .ttl files has that sample site I supposed to see in the first result record: ❯ git grep 'ABIDE UCLA_1 Site' -- **/*.ttl
❯ so likely that output recorded in git is some echo from the past and is not expected to be obtained given now contained here data. is that so @dbkeator? |
Similarly to
@djarecka - ping on a potential runner for the oxygraph. |
ok, will come back to it this week. Last time it didn't finished the query before I had to close the laptop and go home |
- data-config.ttl is required to make a new database in graphDB
@yarikoptic - you're running |
nevermind, after removing extra ttl files (that I merged for oxigraph), your example indeed run in 4-5min |
for me
@surchs -- didn't you get some other graphdb running the test? do you mind sharing the runner for it too? |
Yo @yarikoptic: I heard you like graph store benchmarks, so I put one in my fork of your fork, so you can merge while you merge: yarikoptic#1 🎉 |
Adding tests for graphDB
@yarikoptic - it might be that it doesn't have the data. I think that why I was merging the ttl files before uploading |
upload.sh
Outdated
ttl) ct=text/turtle;; | ||
*) ct=text;; # fail it | ||
esac | ||
curl --silent -X POST -H "Content-Type: $ct" --data-binary "@$1" ${GRAPHDB_API_URL:-http://localhost:8889/bigdata/sparql} |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@yarikoptic - I had to change to http://localhost:7878/store?default
for oxigraph to see any data
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
going to http://localhost:7878/store?default IIRC would immediately trigger download of something. web UI to send queries was on top URL IIRC.
But upload.sh is fow uploading data, not query
…On Fri, Mar 15, 2024, 4:15 PM Yaroslav Halchenko ***@***.***> wrote:
***@***.**** commented on this pull request.
------------------------------
In upload.sh
<#16 (comment)>
:
> @@ -1,10 +1,21 @@
#!/bin/bash
-set -eu
+set -u
+
+for attempt in {1..5}; do
+ echo "Uploading #$attempt: $1"
+ case "${1##*.}" in
+ jsonld) ct=application/ld+json;;
+ ttl) ct=text/turtle;;
+ *) ct=text;; # fail it
+ esac
+ curl --silent -X POST -H "Content-Type: $ct" --data-binary "@$1" ${GRAPHDB_API_URL:-http://localhost:8889/bigdata/sparql}
going to http://localhost:7878/store?default IIRC would immediately
trigger download of something. web UI to send queries was on top URL IIRC.
—
Reply to this email directly, view it on GitHub
<#16 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AAMV6GRM4QDSXRNRDU6NG4TYYNJFFAVCNFSM6AAAAABDCBEZ52VHI2DSMVQWIX3LMV43YUDVNRWFEZLROVSXG5CSMV3GSZLXHMYTSNBQGQ3DAOBUGY>
.
You are receiving this because you were mentioned.Message ID:
***@***.***>
|
Co-authored-by: Dorota Jarecka <djarecka@gmail.com>
@yarikoptic @satra I ran the test and the query is not running forever like @djarecka mentioned. Though in AWS it seems like that but I realize it's because the EC2 instance I had was not powerful enough. Running on my local mac, the query executed fine. Executing query on browser: ![]() System config: Chip: Apple M3 Pro Tested Oxygraph version: |
thanks tek. that's pretty reasonable for that query. @yarikoptic - perhaps see if you can replicate on your machine using the same container as @tekrajchhetri ? or give @tekrajchhetri access to the machine to try. |
thanks @tekrajchhetri ! that is great -- may be smth was fixed up in oxigraph! We should get back into business. Would be great to merge
edit: I will retry whenever get a breather ;-) |
@yarikoptic Probably because the latest update to Oxygraph repo was last week :) |
@yarikoptic @satra Seems that the longer time (earlier one) might be due to the initial bootstrapping of the database. I ran the query again multiple times and the execution time averages to 63.XXX seconds. ![]() ![]() ![]() ![]() Thank you. |
FWIW, filed
|
indeed it ran in sensible time now and produced some output! (different from what is committed but it is expected). Now it is time to get those updated queries and more consistently ordered outputs! |
Now you can run
and provide different test files... or replace the runner to start oxygraph instead.
note that I am now using an original .sparql file, not hand changed for curl invocation one. This way there is need to only modify the .sparql file and we should get rid of that all duplication in README.md and _curl.sparql
attn @djarecka