Meet at 2pm, location TBA
Tae Woo: finished with finals, done with coursework!! w00t! Fixed some bugs with GreenFIE-HD: some offsets were 0-based instead of 1-based and vice versa. Still more bugs to work on, but it's coming.
Christopher: spoke with Justin (parents both Tawainese, is a Taiwan RM) vis a vis Jiapu project; Justin will finish 240 during Summer Term; finished finals; fixed gray background problem; strange problem with hover handler (stopped working for some reason), but it's fixed by changing it to mouse-enter/mouse-leave events; fixed another strange issue with highlighting that he fixed
Dr. Embley: convert-to-GedcomX program now works; it will need some refactoring, but not quite yet; has bounding-box success! Currently working on cross-tool integration; has found an off-by-one error in the OSMX document coming from somewhere upstream in the tool chain. There will be a host of other issues. Blocker: resource
URL is not being passed downstream.
Dr. Liddle: wrote final, gave finals, graded finals (still reconciling, wrapping up grading); Jenkins is working properly
Dr. Woodfield: has been working on the build; is beginning to use the wrappers; other program runs fine; needs a large data set to be able to generate soft-constraint distributions (Dr. Embley will check on whether we can get this data)
Sidebars:
Demo COMET updates
Tool chain inconsistencies/issues
Woodfield's build
Directory structure adjustments (Liddle/Embley)
Liddle/Almquist tomorrow at 10am (communicating with ListReader)
Tae Woo: busy with class projects this week; learned about “expected value”, precision/recall, and thinks perhaps this could help with his project
Dr. Packer: building Alpha as we speak; has Netbeans 8; will look at test-case errors next time
Christopher: lots of class projects this week; found a way to populate form, but getting empty string when sending it back
Dr. Woodfield: spent a lot of time learning the Ontology Workbench; needs to learn more about Netbeans now
Dr. Embley: thin thread works all the way through: extraction from page to GedcomX!
Dr. Liddle: lots of time refactoring; lots of time with other DEG members
Dr. Lonsdale: made progress with OntoSoar, and now has committed; got locations working and can associate them with different types of life events; has christening and other events in new ontology working; still left: multiple-word locations (e.g. “Salt Lake City”) and 4-ary relationships; will be gone the next four weeks
Sidebars:
Liddle/Kim: Netbeans 8 and Java 8
Woodfield/Liddle: wrappers-consuming project setup
Liddle/Embley: keep Reema's stuff??
Liddle/Embley: access to file assets while running on dithers (server) instead of workstation
Dr. Packer: working with Dr. Embley on ListReader
Christopher: added “Execute ListReader” button to UI; working on train ListReader and populate form
Dr. Woodfield: got Netbeans running with JDK 8; unable to get into Jenkins
Dr. Embley: quarterly goal for Q1 was to get a thin thread through all steps in the ensemble; hasn't quite completed the very last piece: GedcomX; Q2 goal is a full thread with all the tools (OntoSoar, ListReader, COMET, GreenFIE-HD, all producing GedcomX output)
Dr. Liddle: worked with Drs. Embley and Lonsdale quite a bit; did a ton of refactoring work
Dr. Lonsdale: finally got Maven, Soar, Ontos, and Netbeans all playing nicely with each other [next week is last Wednesday he'll be here for a while – getting back in town May 6]
Tae Woo: made progress on programming GreenFIE-HD; calculating precision and recall (working on some bugs still); has some recovery issues right now
Sidebars:
ListReader test cases for Java 8 (Liddle, Packer) – AStar code no longer is relevant – likewise CRF; likely need to work on HMM code: see rev. 5865, 5866
Dr. Woodfield needs access to Jenkins (Liddle, Woodfield) – done
The Alpha refactor & wrappers (everyone)
RegEx highlighting for COMET (Embley, Almquist, Liddle)
User Story 4 for OntoSoar (Embley, Lonsdale, Liddle)
Dr. Lonsdale: made some progress on Soar setup – version number was a big issue; still not to the point where we can build at-will and be fully integrated with the rest of the Alpha project tree
Tae Woo: hasn't made much progress this week, but finished OsmxToJson so it emits correct JSON now
Dr. Packer: has reduced DEG hours, and is spending more time with Dr. Embley to get ListReader to run on his machines
Dr. Woodfield: decided that modifying the XSD won't be helpful; syntax checking at the XML level isn't particularly helpful; working on Netbeans install (noted Wiki instructions are old)
Christopher: made the red delete button smaller; multiple fields turning grey doesn't have an easy solution (might be a Bootstrap issue); for populating fields with ListReader annotations, has an idea of how to get the fields, but it needs more discussion
Dr. Embley: mounting a steep learning curve, getting into wrappers; have agreement with Randy on GedcomX interface; highlighting demo is working well
Dr. Liddle: worked with Embley & Lonsdale on various implementation issues and the ensemble layout
Sidebars:
KAE build
Netbeans build issues with Dr. Woodfield
Christopher: how to get fields from labels
Ensemble directory structure
Dr. Liddle: made good progress on PDFIndexer issues (fixed bug, redid character bounding boxes to match PDFBox 2.0.0-SNAPSHOT)
Dr. Packer: finished testing HMM with production mode scripts; worked on 3 pages of example; looking for smooth transition path
Dr. Woodfield: looked at the object wrapper stuff, made progress understanding it; looked at long-term view of how to use the constraint checking tool; needs to figure out how to use the code
Christopher: Got rid of hover issues and added “annoying popup” on navigation away from page; also resized the delete button
Dr. Embley: almost fully resolved on GEDCOMX with Randy; worked with Liddle to access pages; doing lots of programming
Dr. Lonsdale: spent too much time trying to get SOAR built properly in both Eclipse and Netbeans to no avail (will meet with Liddle on Friday)
Tae Woo: just about finished coding for the import (needs about an hour's worth of work still); evaluator should be done within about a week after that; need to begin integration with COMET (will meet with Embley/Liddle after next week's meeting)
Sidebars:
Tae Woo: has done un-nest operation, bidirectionally; need to re-nest and emit JSON
Dr. Packer: working on text string passing between ListReader & COMET; adding text is a bit harder than previously thought
Christopher: resource ID seems to be the best way forward – see sidebar; some bug fixes; working on 288, 299
Dr. Woodfield: dynamic constraint checking may be difficult, and it could be lower-hanging fruit to run after saving from annotator?
Dr. Embley: did a demo for John from PDF through FROnTIER, to COMET, merger, inferencing for gender etc., Duke to de-duplicate, final output through Peter's code to produce GEDCOMX (which isn't even close to what we want). Nearly finished mock-up of the GEDCOMX that we do want. Figured out how to get bounding boxes into it (4-corner relative or pixel-based). We could actually take our work and put it into FamilySearch.
Dr. Liddle: helped various people with programming; progress studying web components and Polymer (interesting possibilities for us); discussed architecture of Fe6 management console thoroughly with Dr. Embley
Dr. Lonsdale: figured out that it's not OSMXDocumentLoader causing grief, but something in the classpath settings somewhere
Sidebars:
JSON being passed between COMET and ListReader (both Dr. Packer and Christopher have discussion items)
How important is it to have text string in what's sent to ListReader? Answer: not very. Let's punt on passing text back and forth.
COMET needs to know source ID or resource object, or some way to get to it; using only a path will cause half of the other things to break that depend on the full resource.
Hosting FamilySearch data extensions
Current priorities
Bounding boxes
Refactoring projects in the SVN repository (e.g. OntoSoar bound up with other modules)
Welcome back Peter! And Liddle!
Dr. Liddle: got some work done getting object-existence rules into the converter.
Peter: happy about having thesis paper accepted to AAAI where it was well received.
Dr. Lonsdale: coding frenzy - ocean = OntoSoar running on Eclipse Lunar; upgraded to Soar 9.4; got everything to build in Eclipse, also on NetBeans. Had to comment out an OSMX test in loader code. Document loader in OntoSoar has some issues with changes in the library. Needs permission to commit code to SVN repository.
Tae Woo: busy with class projects and being sick; working on unnesting records; next up will be sorting by offset, then dealing with parent1/parent2 merge situations
Dr. Packer: turned a version of ListReader to Christopher to play with; next up is adding text to the JSON
Christopher: hasn't yet been able to use the new ListReader; just changed it to work with the new format; discussed using parameter instead of file with Dr. Packer; has ListReader highlighting working
Dr. Embley: did some coding (not quite a coding frenzy, snow falling) dealing with the full line from PDF to GEDCOMX; will need to deal with jobs soon (directory structure, etc.)
Dr. Woodfield: working on constraints, regular expressions, adding semantics
Sidebars:
SVN repo access for lonz
Annotator2 popup on hover needs to go away
How to deal with bounding boxes (begin by setting up a tool that post-processes OSMX files to read bounding-box info from XML files)
What Dr. Embley is promising to hand over to Randy Wilson
ListReader: demo, also Dr. Packer & Christopher on parameter discussion
Discussed Fe6 Architecture (quarterly goals and the current status of each in the progress reports below)
Dr. Liddle: in Hawaii (but coded object-existence rules)
Dr. Lonsdale: OntoSoar running under Eclipse, needs new target ontology
Tae Woo: Coding osmx2json – import done; unnest, merge, nest, export lacking
Dr. Packer: got passed major bug, coordinating with Christopher about transferring info back and forth
Christopher: highlighting largely working, will turn attention to ListReader interface
Dr. Woodfield: questions about mismatch in conceptual models
Dr. Embley: Fe6 architecture; object-existence rules (w/ Dr. Liddle), functional for single-entry fields
Sidebars:
(email) Christopher and Dr. Packer: ListReader interface via file transfer vs. parameter passing
(next week) bounding boxes – pass through vs. determine during FamilyTree generation
(next week) artifacts during highlighting (two fields with identical annotations, focus field highlighted after initial hover, multiple entry fields grey after initial hover, pencil-+-x still shows in annotator2)
Today is RootsTech developer conference; no Dr. Embley today
Dr. Liddle: IRB has approved my proposal; got some programming done with Dr. Embley; fixed OntologyEditor w.r.t. relationship connections; spent some time working with Christopher
Dr. Embley presented at FHTW
Tae Woo: has been programming on getting form JSON ready for the next phase in the pipeline; is implementing it in a way that works just for his thesis; issue: sometimes COMET freezes when advancing to next page; next up is getting various JSON structures finalized
Christopher: hover highlighting is mostly working, but there are some special cases that still need work; deprecated standard mode, so forms now default to grid mode; has questions about adding to nested fields (two similar user stories may need coalescing)
Dr. Woodfield: made progress on CS 340; nearing the end of screens, JS, backend; persistence is up next; two things from conference yesterday: (1) thinking about a mechanism to apply probabilistic constraints to the Church's One Tree to Rule Them All; (2) need a declarative mechanism for specifying rules and constraints. It's time for a new modeling standard.
Sidebars:
Christopher: demo highlighting, user stories on nested fields
Tae Woo: how to really convert to form JSON
Dr. Lonsdale: AAAI conference was really good (1000 people); our talk went well – Peter did a great job
Dr. Woodfield: still being inundated by CS 340; Dr. L & Dr. L may be able to serve as advisers to CS MS students
Dr. Packer: has gotten back into the code, and BigInteger fix seems to be working and working reasonably quickly; will continue with change to JSON instead of OSMX, keeping track of labels to send info back to annotator to highlight different parts of query string
Christopher: has highlight functionality working (calling function from console); finding what is needed in the DOM on hover is interesting, and needs some more work
Dr. Embley: RootsTech development summit is next Wednesday, so won't be to DEG meeting; our FHTW workshop paper was accepted; working on a paper with George Nagy; did some programming; have object existence rules been deployed to annotator2? (needs to be checked)
Dr. Liddle: worked with Dr. Embley on some bugs & coding issues; got IRB application submitted
Sidebars:
Work out example for Christopher to test highlighting
Work out example for calling ListReader ???
Embley presentation outline
Notes:
Congratulations, Joseph, on passing your thesis defense
Joseph: passed defense, got paperwork done
Dr. Lonsdale: wants to sidebar on Maven problem (784 TNRB, 9:30am Thursday)
Tae Woo: has been programming on OsmxToJson, especially converting record part; figured out some solutions, feels we have it; needs deploy of annotator2
Dr. Packer: has been working with Dr. Embley on the paper; should be done next week with those; ListReader development follows
Christopher: has split the highlighting story into three; has something for general highlighting and knows the basic idea for single entry and multiple entry
Dr. Liddle: has been doing miscellaneous things and refactoring osmx2.xsd
Dr. Embley: we have the Springer book Towards the Multilingual Semantic Web, submitted FHTW paper, met with Dr. Lonsdale; wants to talk about recruiting students
Dr. Woodfield: still working on CS 340 for another few weeks
Sidebars:
Rejuvenating the group: could Scott be the pseudo adviser plus a real adviser
OSMX2 .xsd discussion
New flow for downstream that takes as input all the OSMX documents that are produced along the way (solves the problem with gender we had discussed)