-This document is a summary of why we're moving to GNUnet NG and what
-this major redesign tries to address.
+This document is a summary of the changes made to GNUnet for version
+0.9.x (from 0.8.x) and what this major redesign tries to address.
First of all, the redesign does not (intentionally) change anything
fundamental about the application-level protocols or how files are
encoded and shared. However, it is not protocol-compatible due to
other changes that do not relate to the essence of the application
-protocols.
-
+protocols. This choice was made since productive development and
+readable code were considered more important than compatibility at
+this point.
The redesign tries to address the following major problem groups
describing isssues that apply more or less to all GNUnet versions
mutexes and almost 1000 lines of lock/unlock operations.
It is challenging for even good programmers to program or
maintain good multi-threaded code with this complexity.
- The excessive locking essentially prevents GNUnet from
+ The excessive locking essentially prevents GNUnet 0.8 from
actually doing much in parallel on multicores.
* Despite efforts like Freeway, it was virtually
- impossible to contribute code to GNUnet that was not
+ impossible to contribute code to GNUnet 0.8 that was not
writen in C/C++.
* Changes to the configuration almost always required restarts
of gnunetd; the existence of change-notifications does not
days, result in really nasty and hard-to-find crashes.
* structs of function pointers in service APIs were
needlessly adding complexity, especially since in
- most cases there was no polymorphism
+ most cases there was no actual polymorphism
SOLUTION:
* Use multiple, lously-coupled processes and one big select
- loop in each (supported by a powerful library to eliminate
+ loop in each (supported by a powerful util library to eliminate
code duplication for each process).
* Eliminate all threads, manage the processes with a
master-process (gnunet-arm, for automatic restart manager)
=> Process priorities can be used to schedule the CPU better
Note that we can not just use one process with a big
select loop because we have blocking operations (and the
- blocking is outside of our control, thanks MySQL,
+ blocking is outside of our control, thanks to MySQL,
sqlite, gethostbyaddr, etc.). So in order to perform
reasonably well, we need some construct for parallel
- execution.
+ execution.
RULE: If your service contains blocking functions, it
- MUST be a process by itself.
+ MUST be a process by itself. If your service
+ is sufficiently complex, you MAY choose to make
+ it a separate process.
* Eliminate structs with function pointers for service APIs;
instead, provide a library (still ending in _service.h) API
that transmits the requests nicely to the respective
thing given the potential for bugs.
* There is no more TIME API function to do anything
with 32-bit seconds
+* There is now a bandwidth API to handle
+ non-trivial bandwidth utilization calculations
PROBLEM GROUP 3 (statistics):
* If GUIs die (or are not properly shutdown), state
of current transactions is lost (FSUI only
saves to disk on shutdown)
+* FILENAME metadata is killed by ECRS/FSUI to avoid
+ exposing HOME, but what if the user set it manually?
-SOLUTION (draft, not done yet, details missing...):
+SOLUTION:
* Eliminate threads from FS-APIs
- => Open question: how to best write the APIs to
- allow integration with diverse event loops
- of GUI libraries?
-* Store FS-state always also on disk
- => Open question: how to do this without
- compromising state/scalability?
+* Incrementally store FS-state always also on disk using many
+ small files instead of one big file
+* Have API to manipulate sharing tree before
+ upload; have auto-construction modify FILENAME
+ but allow user-modifications afterwards
+
+
+PROBLEM GROUP 7 (User experience):
+* Searches often do not return a sufficient / significant number of
+ results
+* Sharing a directory with thousands of similar files (image/jpeg)
+ creates thousands of search results for the mime-type keyword
+ (problem with DB performance, network transmission, caching,
+ end-user display, etc.)
+
+SOLUTION (draft, not done yet, details missing...):
+* Canonicalize keywords (see suggestion on mailinglist end of
+ June 2009: keep consonants and sort those alphabetically);
+ while I think we must have an option to disable this feature
+ (for more private sharing), I do think it would make a reasonable
+ default
+* When sharing directories, extract keywords first and then
+ push keywords that are common in all files up to the
+ directory level; when processing an AND-ed query and a directory
+ is found to match the result, do an inspection on the metadata
+ of the files in the directory to possibly produce further results
+ (requires downloading of the directory in the background)
+
+
+
+SUMMARY:
+* Features eliminated from util:
+ - threading (goal: good riddance!)
+ - complex logging features [ectx-passing, target-kinds] (goal: good riddance!)
+ - complex configuration features [defaults, notifications] (goal: good riddance!)
+ - network traffic monitors (goal: eliminate)
+ - IPC semaphores (goal: d-bus? / eliminate?)
+ - second timers
+* New features in util:
+ - scheduler
+ - service and program boot-strap code
+ - bandwidth and time APIs
+ - buffered IO API
+ - HKDF implementation (crypto)
+ - load calculation API
+ - bandwidth calculation API
+* Major changes in util:
+ - more expressive server (replaces selector)
+ - DNS lookup replaced by async service