Blob


1 .\" Copyright (c) 2021 Omar Polo <op@omarpolo.com>
2 .\"
3 .\" Permission to use, copy, modify, and distribute this software for any
4 .\" purpose with or without fee is hereby granted, provided that the above
5 .\" copyright notice and this permission notice appear in all copies.
6 .\"
7 .\" THE SOFTWARE IS PROVIDED "AS IS" AND THE AUTHOR DISCLAIMS ALL WARRANTIES
8 .\" WITH REGARD TO THIS SOFTWARE INCLUDING ALL IMPLIED WARRANTIES OF
9 .\" MERCHANTABILITY AND FITNESS. IN NO EVENT SHALL THE AUTHOR BE LIABLE FOR
10 .\" ANY SPECIAL, DIRECT, INDIRECT, OR CONSEQUENTIAL DAMAGES OR ANY DAMAGES
11 .\" WHATSOEVER RESULTING FROM LOSS OF USE, DATA OR PROFITS, WHETHER IN AN
12 .\" ACTION OF CONTRACT, NEGLIGENCE OR OTHER TORTIOUS ACTION, ARISING OUT OF
13 .\" OR IN CONNECTION WITH THE USE OR PERFORMANCE OF THIS SOFTWARE.
14 .Dd $Mdocdate: July 29 2021$
15 .Dt GMID 1
16 .Os
17 .Sh NAME
18 .Nm gmid
19 .Nd simple and secure Gemini server
20 .Sh SYNOPSIS
21 .Nm
22 .Bk -words
23 .Op Fl fnv
24 .Op Fl c Ar config
25 .Op Fl D Ar macro Ns = Ns Ar value
26 .Op Fl P Ar pidfile
27 .Ek
28 .Nm
29 .Bk -words
30 .Op Fl 6hVv
31 .Op Fl d Pa certs-dir
32 .Op Fl H Ar hostname
33 .Op Fl p Ar port
34 .Op Fl x Pa cgi
35 .Op Pa dir
36 .Ek
37 .Sh DESCRIPTION
38 .Nm
39 is a simple and minimal gemini server that can serve static files,
40 execute CGI scripts and talk to FastCGI applications.
41 It can run without a configuration file with a limited set of features
42 available.
43 .Pp
44 .Nm
45 rereads the configuration file when it receives
46 .Dv SIGHUP .
47 .Pp
48 The options are as follows:
49 .Bl -tag -width 14m
50 .It Fl c Pa config
51 Specify the configuration file.
52 .It Fl D Ar macro Ns = Ns Ar value
53 Define
54 .Ar macro
55 to be set to
56 .Ar value
57 on the command line.
58 Overrides the definition of
59 .Ar macro
60 in the config file if present.
61 .It Fl f
62 Stays and logs on the foreground.
63 .It Fl n
64 Check that the configuration is valid, but don't start the server.
65 If specified two or more time, dump the configuration in addition to
66 verify it.
67 .It Fl P Pa pidfile
68 Write daemon's pid to the given location.
69 .Ar pidfile
70 will also act as lock: if another process is holding a lock on that
71 file,
72 .Nm
73 will refuse to start.
74 .El
75 .Pp
76 If no configuration file is given,
77 .Nm
78 runs in
79 .Dq config-less mode
80 .Pq i.e. runs in the foreground to serve a directory from the shell
81 and looks for the following options
82 .Bl -tag -width 14m
83 .It Fl 6
84 Enable IPv6.
85 .It Fl d Pa certs-path
86 Directory where certificates for the config-less mode are stored.
87 By default it is
88 .Pa $XDG_DATA_HOME/gmid ,
89 i.e.
90 .Pa ~/.local/share/gmid .
91 .It Fl H Ar hostname
92 The hostname
93 .Po
94 .Ar localhost
95 by default
96 .Pc .
97 Certificates for the given
98 .Ar hostname
99 are searched inside the
100 .Pa certs-dir
101 directory given with the
102 .Fl d
103 option.
104 They have the form
105 .Pa hostname.cert.pem
106 and
107 .Pa hostname.key.pem .
108 If a certificate or a key doesn't exist for a given hostname, they
109 will be generated automatically.
110 .It Fl h , Fl -help
111 Print the usage and exit.
112 .It Fl p Ar port
113 The port to listen on, by default 1965.
114 .It Fl V , Fl -version
115 Print the version and exit.
116 .It Fl v
117 Verbose mode.
118 Multiple
119 .Fl v
120 options increase the verbosity.
121 .It Fl x Pa path
122 Enable execution of
123 .Sx CGI
124 scripts.
125 See the description of the
126 .Ic cgi
127 option in the
128 .Sq Servers
129 section below to learn how
130 .Pa path
131 is processed.
132 Cannot be provided more than once.
133 .It Pa dir
134 The root directory to serve.
135 By default the current working directory is assumed.
136 .El
137 .Sh CONFIGURATION FILE
138 The configuration file is divided into three sections:
139 .Bl -tag -width xxxx
140 .It Sy Macros
141 User-defined variables may be defined and used later, simplifying the
142 configuration file.
143 .It Sy Global Options
144 Global settings for
145 .Nm .
146 .It Sy Servers
147 Virtual hosts definition.
148 .El
149 .Pp
150 Within the sections, empty lines are ignored and comments can be put
151 anywhere in the file using a hash mark
152 .Pq Sq # ,
153 and extend to the end of the current line.
154 A boolean is either the symbol
155 .Sq on
156 or
157 .Sq off .
158 A string is a sequence of characters wrapped in double quotes,
159 .Dq like this .
160 Multiple strings one next to the other are joined into a single
161 string:
162 .Bd -literal -offset indent
163 # equivalent to "temporary-failure"
164 block return 40 "temporary" "-" "failure"
165 .Ed
166 .Pp
167 Furthermore, quoting is necessary only when a string needs to contain
168 special characters
169 .Pq like spaces or punctuation ,
170 something that looks like a number or a reserved keyword.
171 The last example could have been written also as:
172 .Bd -literal -offset indent
173 block return 40 temporary "-" failure
174 .Ed
175 .Pp
176 Strict ordering of the sections is not enforced, so that is possible
177 to mix macros, options and
178 .Ic server
179 blocks.
180 However, defining all the
181 .Ic server
182 blocks after the macros and the global options is recommended.
183 .Pp
184 Newlines are often optional, except around top-level instructions, and
185 semicolons
186 .Dq \&;
187 can also be optionally used to separate options.
188 .Pp
189 Additional configuration files can be included with the
190 .Ic include
191 keyword, for example:
192 .Bd -literal -offset indent
193 include "/etc/gmid.conf.local"
194 .Ed
195 .Ss Macros
196 Macros can be defined that will later be expanded in context.
197 Macro names must start with a letter, digit or underscore and may
198 contain any of those characters.
199 Macros names may not be reserved words.
200 Macros are not expanded inside quotes.
201 .Pp
202 Two kinds of macros are supported: variable-like and proper macros.
203 When a macro is invoked with a
204 .Dq $
205 before its name its expanded as a string, whereas when it's invoked
206 with a
207 .Dq @
208 its expanded in-place.
209 .Pp
210 For example:
211 .Bd -literal -offset indent
212 dir = "/var/gemini"
213 certdir = "/etc/keys"
214 common = "lang it; auto index on"
216 server "foo" {
217 root $dir "/foo" # -> /var/gemini/foo
218 cert $certdir "/foo.crt" # -> /etc/keys/foo.crt
219 key $certdir "/foo.pem" # -> /etc/keys/foo.pem
220 @common
222 .Ed
223 .Ss Global Options
224 .Bl -tag -width 12m
225 .It Ic chroot Pa path
226 .Xr chroot 2
227 the process to the given
228 .Pa path .
229 The daemon has to be run with root privileges and thus the option
230 .Ic user
231 needs to be provided, so privileges can be dropped.
232 Note that
233 .Nm
234 will enter the chroot after loading the TLS keys, but before opening
235 the virtual host root directories.
236 It's recommended to keep the TLS keys outside the chroot.
237 Future version of
238 .Nm
239 may enforce this.
240 .It Ic ipv6 Ar bool
241 Enable or disable IPv6 support, off by default.
242 .It Ic map Ar mime-type Cm to-ext Ar file-extension
243 Map
244 .Ar mime-type
245 to the given
246 .Ar file-extension .
247 Both argument are strings.
248 .It Ic port Ar portno
249 The port to listen on.
250 1965 by default.
251 .It Ic prefork Ar number
252 Run the specified number of server processes.
253 This increases the performance and prevents delays when connecting to
254 a server.
255 When not in config-less mode,
256 .Nm
257 runs 3 server processes by default.
258 The maximum number allowed is 16.
259 .It Ic protocols Ar string
260 Specify the TLS protocols to enable.
261 Refer to
262 .Xr tls_config_parse_protocols 3
263 for the valid protocol string values.
264 By default, both TLSv1.3 and TLSv1.2 are enabled.
265 Use
266 .Dq tlsv1.3
267 to enable only TLSv1.3.
268 .It Ic user Ar string
269 Run the daemon as the given user.
270 .El
271 .Ss Servers
272 Every virtual host is defined by a
273 .Ic server
274 block:
275 .Bl -tag -width Ds
276 .It Ic server Ar hostname Brq ...
277 Match the server name using shell globbing rules.
278 It can be an explicit name,
279 .Ar www.example.com ,
280 or a name including a wildcards,
281 .Ar *.example.com .
282 .El
283 .Pp
284 Followed by a block of options that is enclosed in curly brackets:
285 .Bl -tag -width Ds
286 .It Ic alias Ar name
287 Specify an additional alias
288 .Ar name
289 for this server.
290 .It Ic auto Ic index Ar bool
291 If no index file is found, automatically generate a directory listing.
292 Disabled by default.
293 .It Ic block Op Ic return Ar code Op Ar meta
294 Send a reply and close the connection;
295 by default
296 .Ar code
297 is 40
298 and
299 .Ar meta
300 is
301 .Dq temporary failure .
302 If
303 .Ar code
304 is in the 3x range, then
305 .Ar meta
306 is mandatory.
307 Inside
308 .Ar meta ,
309 the following special sequences are supported:
310 .Bl -tag -width Ds -compact
311 .It \&%\&%
312 is replaced with a single
313 .Sq \&% .
314 .It \&%p
315 is replaced with the request path.
316 .It \&%q
317 is replaced with the query string of the request.
318 .It \&%P
319 is replaced with the server port.
320 .It \&%N
321 is replaced with the server name.
322 .El
323 .It Ic cert Pa file
324 Path to the certificate to use for this server.
325 The
326 .Pa file
327 should contain a PEM encoded certificate.
328 This option is mandatory.
329 .It Ic cgi Pa path
330 Execute
331 .Sx CGI
332 scripts that matches
333 .Pa path
334 using shell globbing rules.
335 .It Ic default type Ar string
336 Set the default media type that is used if the media type for a
337 specified extension is not found.
338 If not specified, the
339 .Ic default type
340 is set to
341 .Dq application/octet-stream .
342 .It Ic entrypoint Pa path
343 Handle all the requests for the current virtual host using the
344 .Sx CGI
345 script at
346 .Pa path ,
347 relative to the current document root.
348 .It Ic env Ar name Cm = Ar value
349 Set the environment variable
350 .Ar name
351 to
352 .Ar value
353 when executing CGI scripts.
354 Can be provided more than once.
355 .\" don't document the "spawn <prog>" form because it probably won't
356 .\" be kept.
357 .It Ic fastcgi Oo Ic tcp Oc Pa socket Oo Cm port Ar port Oc
358 Enable
359 .Sx FastCGI
360 instead of serving files.
361 The
362 .Pa socket
363 can either be a UNIX-domain socket or a TCP socket.
364 If the FastCGI application is listening on a UNIX domain socket,
365 .Pa socket
366 is a local path name within the
367 .Xr chroot 2
368 root directory of
369 .Nm .
370 Otherwise, the
371 .Ic tcp
372 keyword must be provided and
373 .Pa socket
374 is interpreted as a hostname or an IP address.
375 .Ar port
376 can be either a port number or the name of a service enclosed in
377 double quotes.
378 If not specified defaults to 9000.
379 .It Ic index Ar string
380 Set the directory index file.
381 If not specified, it defaults to
382 .Pa index.gmi .
383 .It Ic key Pa file
384 Specify the private key to use for this server.
385 The
386 .Pa file
387 should contain a PEM encoded private key.
388 This option is mandatory.
389 .It Ic lang Ar string
390 Specify the language tag for the text/gemini content served.
391 If not specified, no
392 .Dq lang
393 parameter will be added in the response.
394 .It Ic location Pa path Brq ...
395 Specify server configuration rules for a specific location.
396 The
397 .Pa path
398 argument will be matched against the request path with shell globbing
399 rules.
400 In case of multiple location statements in the same context, the first
401 matching location will be put into effect and the later ones ignored.
402 Therefore is advisable to match for more specific paths first and for
403 generic ones later on.
405 .Ic location
406 section may include most of the server configuration rules
407 except
408 .Ic alias , Ic cert , Ic cgi , Ic entrypoint , Ic env , Ic key ,
409 .Ic location , Ic param No and Ic proxy .
410 .It Ic log Ar bool
411 Enable or disable the logging for the current server or location block.
412 .It Ic param Ar name Cm = Ar value
413 Set the param
414 .Ar name
415 to
416 .Ar value
417 for FastCGI.
418 .It Ic ocsp Ar file
419 Specify an OCSP response to be stapled during TLS handshakes
420 with this server.
421 The
422 .Ar file
423 should contain a DER-format OCSP response retrieved from an
424 OCSP server for the
425 .Ic cert
426 in use.
427 If the OCSP response in
428 .Ar file
429 is empty, OCSP stapling will not be used.
430 The default is to not use OCSP stapling.
431 .It Ic proxy Ar option
432 Enable requests proxying.
433 .Nm
434 can forward Gemini requests to other hosts on behalf of the client
435 if configured to do so.
436 Multiple options may be specified within curly braces.
437 Valid options are:
438 .Bl -tag -width Ds
439 .It Ic cert Ar file
440 Specify the client certificate to use when making requests.
441 .It Ic key Ar file
442 Specify the client certificate key to use when making requests.
443 .It Ic protocols Ar string
444 Specify the TLS protocols allowed when making remote requests.
445 Refer to the
446 .Xr tls_config_parse_protocols 3
447 function for the valid protocol string values.
448 By default, both TLSv1.2 and TLSv1.3 are enabled.
449 .It Ic relay-to Ar host : Ns Op Ar port
450 Relay the request to the given
451 .Ar host
452 at the given
453 .Ar port
454 .Pq 1965 by default.
455 .It Ic verifyname Ar bool
456 Enable or disable the TLS server name verification
457 .Pq enabled by default.
458 .El
459 .It Ic root Pa directory
460 Specify the root directory for this server
461 .Pq alas the current Dq document root .
462 It's relative to the chroot if enabled.
463 .It Ic require Ic client Ic ca Pa path
464 Allow requests only from clients that provide a certificate signed by
465 the CA certificate in
466 .Pa path .
467 It needs to be a PEM-encoded certificate and it's not relative to the
468 chroot.
469 .It Ic strip Ar number
470 Strip
471 .Ar number
472 components from the beginning of the path before doing a lookup in the
473 root directory.
474 It's also considered for the
475 .Ar meta
476 parameter in the scope of a
477 .Ic block return .
478 .El
479 .Sh CGI
480 When a request for an executable file matches the
481 .Ic cgi
482 rule, that file will be executed and its output fed to the client.
483 .Pp
484 The CGI scripts are executed in the directory they reside and inherit
485 the environment from
486 .Nm
487 with these additional variables set:
488 .Bl -tag -width 24m
489 .It Ev GATEWAY_INTERFACE
490 .Dq CGI/1.1
491 .It Ev GEMINI_DOCUMENT_ROOT
492 The root directory of the virtual host.
493 .It Ev GEMINI_SCRIPT_FILENAME
494 Full path to the CGI script being executed.
495 .It Ev GEMINI_URL
496 The full IRI of the request.
497 .It Ev GEMINI_URL_PATH
498 The path of the request.
499 .It Ev PATH_INFO
500 The portion of the requested path that is derived from the the IRI
501 path hierarchy following the part that identifies the script itself.
502 Can be unset.
503 .It Ev PATH_TRANSLATED
504 Present if and only if
505 .Ev PATH_INFO
506 is set.
507 It represent the translation of the
508 .Ev PATH_INFO .
509 .Nm
510 builds this by appending the
511 .Ev PATH_INFO
512 to the virtual host directory root.
513 .It Ev QUERY_STRING
514 The decoded query string.
515 .It Ev REMOTE_ADDR , Ev REMOTE_HOST
516 Textual representation of the client IP.
517 .It Ev REQUEST_METHOD
518 This is present only for RFC3875 (CGI) compliance.
519 It's always set to the empty string.
520 .It Ev SCRIPT_NAME
521 The part of the
522 .Ev GEMINI_URL_PATH
523 that identifies the current CGI script.
524 .It Ev SERVER_NAME
525 The name of the server
526 .It Ev SERVER_PORT
527 The port the server is listening on.
528 .It Ev SERVER_PROTOCOL
529 .Dq GEMINI
530 .It Ev SERVER_SOFTWARE
531 The name and version of the server, i.e.
532 .Dq gmid/1.7.3
533 .It Ev AUTH_TYPE
534 The string "Certificate" if the client used a certificate, otherwise
535 unset.
536 .It Ev REMOTE_USER
537 The subject of the client certificate if provided, otherwise unset.
538 .It Ev TLS_CLIENT_ISSUER
539 The is the issuer of the client certificate if provided, otherwise
540 unset.
541 .It Ev TLS_CLIENT_HASH
542 The hash of the client certificate if provided, otherwise unset.
543 The format is
544 .Dq ALGO:HASH .
545 .It Ev TLS_VERSION
546 The TLS version negotiated with the peer.
547 .It Ev TLS_CIPHER
548 The cipher suite negotiated with the peer.
549 .It Ev TLS_CIPHER_STRENGTH
550 The strength in bits for the symmetric cipher that is being used with
551 the peer.
552 .It Ev TLS_CLIENT_NOT_AFTER
553 The time corresponding to the end of the validity period of the peer
554 certificate in the ISO 8601 format
555 .Pq e.g. Dq 2021-02-07T20:17:41Z .
556 .It Ev TLS_CLIENT_NOT_BEFORE
557 The time corresponding to the start of the validity period of the peer
558 certificate in the ISO 8601 format.
559 .El
560 .Sh FastCGI
561 .Nm
562 optionally supports FastCGI.
564 .Ic fastcgi
565 rule must be present in a server or location block.
566 Then, all requests matching that server or location will be handled
567 via the specified FastCGI backend.
568 .Pp
569 By default the following variables
570 .Pq parameters
571 are sent, and carry the same semantics as with CGI.
572 More parameters can be added with the
573 .Ic param
574 option.
575 .Pp
576 .Bl -bullet -compact
577 .It
578 GATEWAY_INTERFACE
579 .It
580 GEMINI_URL_PATH
581 .It
582 QUERY_STRING
583 .It
584 REMOTE_ADDR
585 .It
586 REMOTE_HOST
587 .It
588 REQUEST_METHOD
589 .It
590 SERVER_NAME
591 .It
592 SERVER_PROTOCOL
593 .It
594 SERVER_SOFTWARE
595 .It
596 AUTH_TYPE
597 .It
598 REMOTE_USER
599 .It
600 TLS_CLIENT_ISSUER
601 .It
602 TLS_CLIENT_HASH
603 .It
604 TLS_VERSION
605 .It
606 TLS_CIPHER
607 .It
608 TLS_CIPHER_STRENGTH
609 .It
610 TLS_CLIENT_NOT_BEFORE
611 .It
612 TLS_CLIENT_NOT_AFTER
613 .El
614 .Sh MIME
615 To auto-detect the MIME type of the response
616 .Nm
617 looks at the file extension and consults its internal table.
618 By default the following mappings are loaded, but they can be
619 overridden or extended using the
620 .Ic map
621 configuration option.
622 If no MIME is found, the value of
623 .Ic default type
624 matching the file
625 .Ic location
626 will be used, which is
627 .Dq application/octet-stream
628 by default.
629 .Pp
630 .Bl -tag -offset indent -width 14m -compact
631 .It diff
632 text/x-patch
633 .It gemini, gmi
634 text/gemini
635 .It gif
636 image/gif
637 .It jpeg
638 image/jpeg
639 .It jpg
640 image/jpeg
641 .It markdown, md
642 text/markdown
643 .It patch
644 text/x-patch
645 .It pdf
646 application/pdf
647 .It png
648 image/png
649 .It svg
650 image/svg+xml
651 .It txt
652 text/plain
653 .It xml
654 text/xml
655 .El
656 .Sh LOGGING
657 Messages and requests are logged by
658 .Xr syslog 3
659 using the
660 .Dv DAEMON
661 facility or printed on
662 .Em stderr .
663 .Pp
664 Requests are logged with the
665 .Dv NOTICE
666 severity.
667 Each request log entry has the following fields, separated by
668 whitespace:
669 .Pp
670 .Bl -bullet -compact
671 .It
672 Client IP address and the source port number, separated by a colon
673 .It
674 .Sy GET
675 keyword
676 .It
677 Request URL
678 .It
679 Response status
680 .It
681 Response meta
682 .El
683 .Sh EXAMPLES
684 Serve the current directory
685 .Bd -literal -offset indent
686 $ gmid .
687 .Ed
688 .Pp
689 To serve the directory
690 .Pa docs
691 and enable CGI scripts inside
692 .Pa docs/cgi
693 .Bd -literal -offset indent
694 $ mkdir docs/cgi
695 $ cat <<EOF > docs/cgi/hello
696 #!/bin/sh
697 printf "20 text/plain\er\en"
698 echo "hello world"
699 EOF
700 $ chmod +x docs/cgi/hello
701 $ gmid -x '/cgi/*' docs
702 .Ed
703 .Pp
704 An X.509 certificate must be provided to run
705 .Nm
706 using a configuration file.
707 First, the RSA certificate is created using a wildcard common name:
708 .Bd -literal -offset indent
709 # openssl genrsa \-out /etc/ssl/private/example.com.key 4096
710 # openssl req \-new \-x509 \e
711 \-key /etc/ssl/private/example.com.key \e
712 \-out /etc/ssl/example.com.crt \e
713 \-days 36500 \-nodes \e
714 \-subj "/CN=example.com"
715 # chmod 600 /etc/ssl/example.com.crt
716 # chmod 600 /etc/ssl/private/example.com.key
717 .Ed
718 .Pp
719 In the example above, a certificate is valid for one hundred years from
720 the date it was created, which is normal for TOFU.
721 .Pp
722 The following is an example of a possible configuration for a site
723 that enables only TLSv1.3, adds a mime type for the file extension
724 .Qq rtf
725 and defines two virtual host:
726 .Bd -literal -offset indent
727 ipv6 on # enable ipv6
729 protocols "tlsv1.3"
731 map "application/rtf" to-ext "rtf"
733 server "example.com" {
734 cert "/etc/ssl/example.com.crt"
735 key "/etc/ssl/private/example.com.key"
736 root "/var/gemini/example.com"
739 server "it.example.com" {
740 cert "/etc/ssl/example.com.crt"
741 key "/etc/ssl/private/example.com.key"
742 root "/var/gemini/it.example.com"
744 # enable cgi scripts inside "cgi-bin"
745 cgi "/cgi-bin/*"
747 # set the language for text/gemini files
748 lang "it"
750 .Ed
751 .Pp
752 Yet another example, showing how to enable a
753 .Ic chroot
754 and use
755 .Ic location
756 rule
757 .Bd -literal -offset indent
758 chroot "/var/gemini"
759 user "_gmid"
761 server "example.com" {
762 cert "/path/to/cert.pem" # absolute path
763 key "/path/to/key.pem" # also absolute
764 root "/example.com" # relative to the chroot
766 location "/static/*" {
767 # load the following rules only for
768 # requests that matches "/static/*"
770 auto index on
771 index "index.gemini"
774 .Ed
775 .Sh ACKNOWLEDGEMENTS
776 .Nm
777 uses the
778 .Dq Flexible and Economical
779 UTF-8 decoder written by
780 .An Bjoern Hoehrmann .
781 .Sh AUTHORS
782 .An -nosplit
783 The
784 .Nm
785 program was written by
786 .An Omar Polo Aq Mt op@omarpolo.com .
787 .Sh CAVEATS
788 .Bl -bullet
789 .It
790 All the root directories are opened during the daemon startup; if a
791 root directory is deleted and then re-created,
792 .Nm
793 won't be able to serve files inside that directory until a restart.
794 This restriction only applies to the root directories and not their
795 content.
796 .It
797 a %2F sequence is indistinguishable from a literal slash: this is not
798 RFC3986-compliant.
799 .It
800 a %00 sequence is treated as invalid character and thus rejected.
801 .El