This action will delete this post on this instance and on all federated instances, and it cannot be undone. Are you certain you want to delete this post?
This action will delete this post on this instance and on all federated instances, and it cannot be undone. Are you certain you want to delete this post?
This action will block this actor and hide all of their past and future posts. Are you certain you want to block this actor?
This action will block this object. Are you certain you want to block this object?
Introduction | https://epiktistes.com/introduction |
---|---|
GitHub | https://github.com/toddsundsted/ktistec |
Pronouns | he/him |
🌎 | Sector 001 |
Release v2.4.6 of Ktistec is out. As mentioned in an earlier post, this release focuses on database performance improvements. This means caching the results of expensive queries (like counting all posts with a particular hashtag or mention). On my instance at least, pages like the notifications page are now snappier.
There are still slow queries (queries that take more than 50msec). Most of those are requests for pages of old posts where none of the necessary database pages are in the page cache. I have increased the page cache size, and that reduces the frequency, but I don't see an immediate fix.
Fixed
Changed
Removed
X-Auth-Token
.Other
POST
socket operations.I don't have an immediate plan for the next release. There have been a bunch of feature requests that I think have merit. I'll probably get started on some of those.
You can specify the SQLite database and pass options (pragmas like cache_size
, journal_mode
, ...) on the command line when you start the Ktistec server. The following example sets the cache size to 20,000 pages (up from the default of 2,000 pages) which improves performance on larger instances.
KTISTEC_DB=~/ktistec.db\?cache_size=-20000 ./server
You can also enable the write-ahead log (but make sure you know what that means).
KTISTEC_DB=~/ktistec.db\?journal_mode=wal\&synchronous=normal ./server
Pragmas supported are limited to those listed here.
A new release of ktistec that improves database performance is imminent. In the past, database optimization usually meant "fixing a bunch of poorly constructed queries", and I'm sure there's more of that to do—I'm not an expert. But this time, I found most of the queries were as good as they were going to get on my watch (I'm not an expert). If you have a million records and you need to filter and count them, that's just going to take some time...
So this time, I focused on caching the results of queries like that (which really means I focused on cache invalidation, right). A case in point is commit d544b1af. Previously, the nodeinfo
endpoint filtered and counted posts on every request, and it took +80msec to do that. Worse, the filtering pushed everything else out of the sqlite page cache, which made the next, unrelated database query slow!
Caching this value, and only recounting when I post something, not only dropped the service time for the request to ~1msec but actually improved database performance, generally!
More to come...
Finally I could release Blueprint 1.0.0 after almost 2 years in development.
it's interesting to see what scans show up in the logs:
2025-01-24 16:24:11 UTC 404 GET /.env 1.16ms 2025-01-24 16:24:11 UTC 404 GET /.env 563.87µs 2025-01-24 16:24:14 UTC 404 GET /.aws/credentials 601.43µs 2025-01-24 16:24:14 UTC 404 GET /.aws/credentials 498.43µs 2025-01-24 16:24:16 UTC 404 GET /.env.example 609.78µs 2025-01-24 16:24:16 UTC 404 GET /.env.example 544.13µs 2025-01-24 16:24:18 UTC 404 GET /.env.production 798.14µs 2025-01-24 16:24:19 UTC 404 GET /admin/.env 628.06µs 2025-01-24 16:24:23 UTC 404 GET /api/.env 906.66µs 2025-01-24 16:24:25 UTC 404 GET /app/.env 574.45µs 2025-01-24 16:24:27 UTC 404 GET /app_dev.php/_profiler/open?file=app/config/parameters.yml 537.69µs 2025-01-24 16:24:33 UTC 404 GET /app_dev.php/_profiler/phpinfo 841.8µs 2025-01-24 16:24:35 UTC 404 GET /backend/.env 513.92µs 2025-01-24 16:24:36 UTC 404 GET /core/.env 661.94µs 2025-01-24 16:24:38 UTC 404 GET /credentials 649.68µs 2025-01-24 16:24:40 UTC 404 GET /crm/.env 480.42µs 2025-01-24 16:24:43 UTC 404 GET /demo/.env 579.16µs 2025-01-24 16:24:49 UTC 404 GET /info/ 614.09µs 2025-01-24 16:24:51 UTC 404 GET /infos/ 705.33µs 2025-01-24 16:24:54 UTC 404 GET /pinfo.php 489.59µs 2025-01-24 16:24:58 UTC 404 GET /vendor/.env 780.1µs
this reminds me that i have to make responding to those requests much much slower...
If you're running an instance of Ktistec and want to see what other ActivityPub instances are sending you, turn on JSON-LD processing debug
logging.
ktistec.json_ld
setting.Ktistec will dump received activities to the log, after the activity has been parsed into JSON but before JSON-LD expansion.
2025-01-22 14:53:17 UTC 409 POST /actors/toddsundsted/inbox 4.29ms 2025-01-22T14:53:17.597172Z DEBUG - ktistec.json_ld: {"@context" => ["https://www.w3.org/ns/activitystreams", "https://w3id.org/security/v1"], "id" => "https://random.site/users/FooBar#delete", "type" => "Delete", "actor" => "https://random.site/users/FooBar", "object" => "https://random.site/users/FooBar", "to" => ["https://www.w3.org/ns/activitystreams#Public"], "signature" => {"type" => "RsaSignature2017", "creator" => "https://random.site/users/FooBar#main-key", "created" => "2025-01-22T14:52:40Z", "signatureValue" => "01234567890abcdefghijklmnopqrstuvwxyz=="}}
Answer to a FAQ:
The server returns HTTP status code 409 ("Conflict") if it has already received an activity.
Finally published a library I've wanted for a long time: a Crystal type for dealing with both calendar and monotonic durations in the same object.
Crystal is fast because methods are monomorphized at compile time. In simple terms, that means that at compile time, a polymorphic method is replaced by one or more type-specific instantiations of that method. The following polymorphic code...
def plus(x, y) x + y end
...is effectively replaced by two methods—one that does integer addition if called with two integers, and one that does string concatenation if called with two strings.
This extends to inherited methods, which are implicitly also passed self
. You can see this in action if you dump and inspect the symbols in a compiled program:
class FooBar def self.foo puts "#{self}.foo" end def bar puts "#{self}.bar" end end FooBar.foo FooBar.new.bar class Quux < FooBar end Quux.foo Quux.new.bar
Dumping the symbols, you see multiple instantiations of the methods foo
and bar
:
... _*FooBar#bar:Nil _*FooBar::foo:Nil _*FooBar@Object::to_s<String::Builder>:Nil _*FooBar@Reference#to_s<String::Builder>:Nil _*FooBar@Reference::new:FooBar _*Quux@FooBar#bar:Nil _*Quux@FooBar::foo:Nil _*Quux@Object::to_s<String::Builder>:Nil _*Quux@Reference#to_s<String::Builder>:Nil _*Quux@Reference::new:Quux ...
The optimizer in release builds is pretty good at cleaning up the obvious duplication. But during my optimization work on Ktistec, I found that a lot of duplicate code shows up anyway.
Most pernicious are weighty methods that don't depend on class or instance state (don't make explicit or implicit reference to self
). As I blogged about earlier, this commit replaced calls to the inherited method map
on subclasses with calls to the method map
defined on the base class and reduced the executable size by ~5.8%. The code was identical and the optimizer could remove the unused duplicates.
So, as a general rule, if you intend to use inheritance, put utility code that doesn't reference the state or the methods on the class or instance in an adjacent utility class—as I eventually did with this commit.
(The full thread starts here.)
Ktistec release v2.4.5 rolls out the build time and executable size optimizations I've been blogging about here. It also fixes a few small bugs.
Fixed
Changed
I've started a branch full of query optimizations. My general rule—as highlighted in the server logs—is if a query takes longer than 50msec, it takes too long. It's time to address some problems...
The Ktistec executable is now ~24.7% smaller and build times are 28% faster.
I've been blogging about optimizations here, here, and here. This is the summary of the final outcome, with links to commits for the curious. I have one more post planned with a summary of my thoughts.
Here's my approach. Use nm
to dump the symbols in a release build executable and then look for things that seem redundant. The first change and associated post below is a great example of what I mean—my original implementation led to the specialization of the #==
method for every pairwise combination of model classes even though the result of the comparison was just false
.
This might seem like a strange approach if you come from a compiled language where you mostly write all of the code yourself or invoke generics explicitly, but Crystal takes your code and does that for you. And it's not always obvious up front (to me, at least) what the final cost will be.
I've include counts of the lines added/removed because the point of this whole post is to say if you measure first and then optimize, a small change can have a big impact.
Here are the changes:
#==
. (+7 -5)Hash
. (+2 -2)__for_internal_use_only
that get passed both named tuples and hashes by going all in with named tuples. It also eliminates instantiations of the Hash
generic type itself for these cases. Reduces executable size by ~2.2%.__for_internal_use_only
entirely.InstanceMethods
 instance methods. (+1 -5)map
 from base ActivityPub model classes. (+10 -2)map
is a class method defined on each ActivityPub base model class. Each definition maps JSON-LD to a hash that is used to instantiate the class. Class methods defined on a base class are available on subclasses, as well. Calling the method on the subclass results in a copy of the method. This change reduces the executable size by ~5.8%.map
 into helper. (+104 -88) map
method does not depend on class/instance state. This change ensures that the mapping code is not duplicated even if a subclass's map
method is accidentally again called. It looks like a lot of changes but this commit is mostly reorganization. It reduces executable size by ~0.4%.I'm off to optimize some queries now...