go to post Jon Willeke · Oct 1, 2020 I would not say that try-catch is slow. If you reach the end of a try block, it simply branches over the catch block. This is a very cheap operation. In the nominal case (i.e., no error or exception), the cost of a try block is probably less than setting a %Status variable, and it would take several try blocks to match the cost of setting $zt once. The catch block does almost all of the work. It instantiates the exception object, unwinds the stack, etc. I don't know offhand how catch compares to $zt, but the performance of the exceptional cases is usually not as important.
go to post Jon Willeke · Sep 28, 2020 RAII isn't a well established pattern in ObjectScript, so you frequently end up duplicating code in the main block and the catch block that could otherwise live in a finally block. Besides "resources", clean-up code may restore state of any kind--globals, files, the current namespace, etc.
go to post Jon Willeke · Sep 28, 2020 As I recall, some of the conventions of the object library date back to Caché 2.0 in the late nineties, whereas try-catch wasn't introduced until Caché 4.0. One arguable advantage of %Status-based interfaces is that they work well with the noisy %UnitTest framework. If you $$$AssertStatusOK on a series of method calls, the test log reads like a transcript of your activity. You can often understand the failure without even needing to read the test program. Also, try-catch is kind of annoying to use without finally, which was never implemented.
go to post Jon Willeke · Sep 11, 2020 One of my stylistic peeves is that parentheses around an if condition are not necessary, and I prefer to omit them for readability. You'll often need parens for a function call and/or to evaluate operators correctly, so best not to clutter the code with a superfluous pair.
go to post Jon Willeke · Sep 9, 2020 It has been a while since I've looked at the Python binding, but my recollection is that it is based on the C binding (not to be confused with the C callin API), which is based on the C++ binding, which does not have to be on the same system as the Caché server. However, you do need to satisfy those dependencies on the client.
go to post Jon Willeke · Aug 31, 2020 You're getting a "Datatype validation" error, which suggests that the XML parsed fine, but that the value "/shared/BENANDERSON" is not valid for the PArray property of the ExecuteResult class. If PArray is an array, I don't remember how that's typically projected to XML, and I can't find an example in the documentation. Lists are projected as a container element with the contents nested as elements within. Are you reading in something that was written using an %XML.Writer, or are you designing a class to read an XML file from somewhere else? It might help to see more context in the XML, and the relevant definitions from the ExecuteResult class.
go to post Jon Willeke · Aug 12, 2020 You need to identify what character(s) the trailing squares actually are. Typically an empty box is not whitespace, but something for which your font doesn't have a glyph. Can you post the output of zzdump for one of the strings?
go to post Jon Willeke · Jul 20, 2020 FIPS 180-4 describes SHA-512 et al., FIPS 198-1 describes HMAC, and PKCS #5 describes PBKDF2, which depends on HMAC-SHA. As for NIST, special publication 800-132 (now ten years old) states: "This Recommendation approves PBKDF2 as the PBKDF using HMAC with any approved hash function as the PRF." For more recent guidance, consider special publication 800-63B. As I understand it, none of the weaknesses in SHA affect HMAC or PBKDF2. However, if SHA-1 is no longer FIPS approved, the NIST guidance would indicate replacing it with, say, SHA-2 or SHA-3. In terms of strength, PBKDF2 essentially has two parameters, the hash function, and the iteration count. For the hash function, bigger is usually slower, therefore stronger. For the iteration count, PKCS #5 and NIST 800-132 both suggest a minimum of 1,000. NIST 800-63B states: "the iteration count SHOULD be as large as verification server performance will allow, typically at least 10,000 iterations."
go to post Jon Willeke · Jul 8, 2020 I see this behavior in the examples included with that post; e.g., "Age":"66". The %ZJSON package is large, and I don't see a trivial fix offhand. You might play around with the GenExportLiteral() method, which has a mind-numbing quantity of quotation marks. Otherwise, I suggest opening an issue on Github.
go to post Jon Willeke · May 20, 2020 This is for a simple train-test split as part of testing for IntegratedML, currently in beta.
go to post Jon Willeke · May 20, 2020 One thing I've done to split machine learning datasets is to use an auxiliary table that maps IDs to a random number. I write a stored procedure that returns a random number. I create a table with two columns, one for the ID of the source table, and one to hold a random number. I populate the column for the source IDs: insert into random_table (source_id) select id from source_table I then populate the column for the random number: update random_table set random_number = MySchema.MySP_Random(1E9) Then I can select with an ORDER BY clause on the random number: select top 10 source_id from random_table order by random_number, source_id It depends on your use case whether this will be appropriate for a source table with millions of rows. It's an expensive way to select just one row.
go to post Jon Willeke · Apr 24, 2020 I agree that the most likely path to a Rust binding is to wrap a C or C++ API. If you're content with a local client, callin and/or callout is the place to start. As you said, it shouldn't be too hard to write a callout library in Rust. Callin, on the other hand, (and callback from a callout library) is a bit more involved, requiring a lot of unsafe code. If you want a remote client, you could look at wrapping the C or C++ binding, but that's a dead end that is not supported in IRIS. You might also look into relational access or an ORM. Diesel looks promising, but I don't know whether it (or Rust in general) works well with ODBC.
go to post Jon Willeke · Mar 25, 2020 Although $list is just a string serialization format, it is conventional to declare such a string in a method parameter or return type as %List (i.e., %Library.List).
go to post Jon Willeke · Feb 27, 2020 I found this limitation surprising too. My understanding is that it has to do with the current device being the Native API's TCP connection to the server. I can never remember how to mention someone on this forum, but I'll try to make @Raj.Singh aware of your feedback.
go to post Jon Willeke · Feb 3, 2020 In Caché, the dev/cpp and dev/python directories contain the C++ and Python bindings, respectively. As far as I know, these are not available for InterSystems IRIS. They were shipped as source, so maybe they would work with IRIS? I doubt that it's been tested. The dev/iris-callin directory contains the headers for the C callin API. This includes files that are compatible with programs written using headers that are shipped with Caché in the dev/cache/callin directory.
go to post Jon Willeke · Nov 13, 2019 This is a simple way to make a secure connection using the same certificates as, say, your browser. If you trust Safari to go to https://example.com/, then you can use this to do the same in InterSystems IRIS. If the certificate on the other end was issued by a well known CA, then it will likely work.
go to post Jon Willeke · Nov 5, 2019 It appears that your link is to a Docker image of the application installed on YottaDB, a fork of GT.M. I followed the link at the bottom of the page to the FOIA release page: https://code.osehra.org/journal/journal/view/1576 After downloading the latest copy and skimming the documentation, this release includes a CACHE.DAT database and extensive installation and configuration instructions for Caché. If you want to run the application on InterSystems IRIS, you would do better to start there than the YottaDB-based image. For a container deployment, there are three things you'll want to be aware of: durable %SYS, bind mounts, and ports. I believe that durable %SYS is covered in the documentation. Basically, it will store all of the database and mapping configuration that you do outside of the container. You'll need a bind mount for durable %SYS, and for the RPMS database. You'll want to export the web server port so that you can access the management portal. One other suggestion: if you're new to InterSystems IRIS and/or containers, you may want to start with Docker locally on your system before tackling a cloud deployment. Good luck, and have fun.
go to post Jon Willeke · Oct 29, 2019 FWIW, you can also replace the call to $extract with two-argument $ascii: $a(md5hash,i)
go to post Jon Willeke · Oct 28, 2019 Whereas the digest methods in the %SYSTEM .Encryption class return binary strings, which are documented in terms of their byte length, it is indeed conventional to display them using hexadecimal. Instead of $select, I more usually see $translate and $justify: s h0x=h0x_$tr($j(chr,2)," ",0) If %xsd.hexBinary is covenient, though, I'd say use that. For a sanity check, you can compare whatever implementation you choose with the zzdump command: USER>s digest=$system.Encryption.MD5Hash("12345678") USER>w ##class(%xsd.hexBinary).LogicalToXSD(digest) 25D55AD283AA400AF464C76D713C07AD USER>zzdump digest 0000: 25 D5 5A D2 83 AA 40 0A F4 64 C7 6D 71 3C 07 AD
go to post Jon Willeke · Oct 1, 2019 Note that for your specific case, our examples have been off by one: USER>w $e("16175551234",*-10,*) 16175551234 You actually want something like this: USER>w $e("16175551234",*-9,*) 6175551234 If you're not sure whether the transformation is being applied at all, maybe do something more dramatic, like $reverse, as a sanity check?