Politech mailing list archives

FC: Mordechai Ben-Menachem on Pentagon's TIA: "Lies and obfuscation"

From: Declan McCullagh <declan () well com>
Date: Thu, 12 Dec 2002 21:04:20 -0500

I think the Aldridge statement that Mordechai is talking about is fromUndersecretary of Defense for Acquisition, Logistics, and Technology PeteAldridge last month at a Pentagon briefing:

http://www.politechbot.com/p-04186.html

News article:
http://www.cnn.com/2002/US/11/20/terror.tracking/index.html

---

From: "Mordechai" <quality () computer org>
To: Declan McCullagh <declan () well com>
Date: Thu, 12 Dec 2002 11:53:28 +0200
Subject: TIA feasability and costs
Reply-to: quality () computer org

Declan,

My name is Mordechai Ben-Menachem. I am a lecturer at Ben-GurionUniversity, Beer-Sheva, Israel.My areas of speciality are software engineering and projectmanagement. Bob Bauman asked me towrite to you to express certain views concerning the DARPA project calledTIA.

I have read the Aldrige testimony. Most of the following was written inreaction to that.

Much of what Aldrige says walks a very narrow line between outright liesand obfuscation. It is simplynot correct. The areas for objection are too broad to cover here, but Ishall try to give a few examples.

1. You cannot talk about "... if they choose to use it." The system ONLYhas value if there is acritical mass of data in it. This means, by definition, that the databasemust be massively populatedand this must be constantly maintained. This is not a situation where onecan query and THEN thesystem will go off to a thousand different databases around the world tosearch for transactions youmay want. There is a fine line here between data collection and dataretrieval. The "if they choose"part can relate to data retrieval, but that makes it a very stickywicket. Existing legal controls (e.g.,search warrants, Miranda) are designed to control data collection, not useof that data once it has been

collected.

2.      Speech recognition /  rapid translation:

The statements are very misleading. No such software exits today. Thestate-of-the-art of voicerecognition / voice response systems is that of a watch (you can also tellyour phone to dial your wife,but only after rigorous training of the system). The accuracy oftranslation systems used today ismostly used as Computer Science jokes. The distance to workable systems isquite profound. Intelhas recently announced a 3 Giga Hertz chip. This infers (via Moore's Law)that we shall see a 6 GigaHertz chip in 18 months. Many authorities have called 6 GH a milestonethat will allow a new set ofapplications. In other words, when those capabilities exist, we may beable to intelligently discussrapid, real-time translation. However, by definition, we do not know howto conceive of thoseapplications now. Perhaps it can be on a supercomputer, as cost is not thegoverning factor -- no, thebasic computational complexity may be solvable on a supercomputer (no proofof that exists) but thereare many other aspects that requires a different type of architecture forreal time usage. He also statedthat there will be voice recognition capabilities to recognise who isspeaking. Totally science fiction,has never been tried in real life. What exists is the ability to match"voice prints" via patternrecognition techniques. Very time consuming and with a very low level ofaccuracy and reliability. I do

not recall it being recognized by any court, for example.

3.      Connections between transactions:

Echelon gathers data from some 8-billion telephone conversationstoday. How successful has thisbeen in the "war on drugs"? The answer is, almost not at all. Add tothat, all airline transactions,chemical purchases, credit card ... How many daily transactions are wetalking about -- 20 billion,more? (Visa alone has some 110 million transactions per day.) There is noway to even imagine howto query this size of database, much less, make any sense of theanswer. In other words, if theymanage to simulate the data (we do not know how to simulate that), and ifthey manage to perform aquery, what do we do with the results of such a query? The datavisualization techniques do not exist.The quantity of false positives will overload any investigative agency(tens of thousands per day). As amatter of fact, the database technology that would allow this type of querydoes not exist, either. I mustadd, on small scales, tens of thousands of transactions, this is beingperformed. The distance to be

able to process five orders of magnitude more is perhaps a decade.

4.      Collaborative reasoning:

This part is probably practical, though the development is still quite away off. I have done a little bit ofwork in this area. (I have an article submitted to a major journal that Ican send you, but it has not yetbeen published.) The major issue here is reliability. We are talkingabout using massive webs ofhierarchical data (that is, the data has both hierarchical attributes andnetwork attributes). With thislevel of complexity, testing such a system is very far beyond ourcapabilities -- we simply have no ideahow to ensure that the answers we are given are correct because we do notknow how to test it. This isnot the only difficulty. The definition of interrelationships is an openissue -- they are not static.

As I said, space and time do not permit me to do a full analysis and I havenot read the fullspecification. The bottom line is composed of two points. The report byPete Aldridge cannot simplybe taken at face value. The system / project, as presently defined remindsme greatly of Reagan's SDIproject. Brilliantly thought of, but much too early. Some of the fruitsof that effort are just now comingon line, 20 years later (e.g., the Arrow anti-ballistic missile and theNautilus anti-tactical rocket lasergun). When SDI was conceived, it was not technologically possible. Thisis not today. In 20 years,who knows, this may be reasonable. Today, the base technologies do notexist. The complexity is toogreat, the size is impossible to conceive. I don't care how passionatePoindexter is. It sounds wrong.

Additionally, I spoke with a colleague of mine whose expertise is in thearea of face recognition andother "bio" technologies. My objective was to double-check that my initialguess-timates werereasonable. He confirms and even thought me rather optimistic on some ofthe things. For instance,"rapid translation" based on speech recognition: I said I thought it a fewyears off. He says it is AT

LEAST 7-10 years off.  The capabilities we see today are very primitive.

In any case, we are talking about a 10-20 year timeframe to demonstratecapabilities -- similar to SDI.You are talking about spending billions of dollars for a project to developa system that has no hope ofbeing useful in a significant time-frame -- the size of the project is muchlarger than what has been

reported, the base technologies do not exist.

best regards, I hope this is helpful and I shall be most pleased to furtherexplain if you like,

  Mordechai Ben-Menachem
        Dept. of Industrial Engineering & Management
        Ben-Gurion University
        P. O. Box 5613; Beer-Sheva; 84156; Israel
        Tel. 972-86-433231, mob. 972-57-433231, off. 972-86-479374

quality () computer org




-------------------------------------------------------------------------
POLITECH -- Declan McCullagh's politics and technology mailing list
You may redistribute this message freely if you include this notice.
To subscribe to Politech: http://www.politechbot.com/info/subscribe.html
This message is archived at http://www.politechbot.com/
Declan McCullagh's photographs are at http://www.mccullagh.org/
-------------------------------------------------------------------------
Like Politech? Make a donation here: http://www.politechbot.com/donate/
Recent CNET News.com articles: http://news.search.com/search?q=declan
-------------------------------------------------------------------------

Current thread:

FC: Mordechai Ben-Menachem on Pentagon's TIA: "Lies and obfuscation" Declan McCullagh (Dec 12)