This site contains older material on Eiffel. For the main Eiffel page, see http://www.eiffel.com.





L'OBJET1.3: Interview de Jacob Stein



AVERTISSEMENT

Ceci est le texte complet de l'interview de Jacob Stein, retranscrit par une secrétaire. Il a été laissé a l'état original, y compris bruit de cuisine, interruptions sur la bande, et fautes de transcription ... d'où la spontanéité qui fait tout le charme de ce document brut de fonderie.

En cas de doute ou de surprise, se reporter au texte français, qui a lui été revu de près!


Bertrand Meyer: How did you get into the field of Object Oriented data bases and what has your personal itinerary been?

Jacob Stein: I began working with David Maier on object extentions to a relational model at Stony Brook. I spent 8 years at Servio where ai got involved with the Object Database Management group. I spent a year and a half on my own afterwards and have been at Sybase for the last year and a half. Currently I'm Vice Chair of the Object Database Management group, I'm still involved with O.M.G. and I'm Strategic Planning Manager with responsibilities lying in the area of object technology at Sybase.

Bertrand Meyer: Do you think this itinerary is representative of the evolution in this field going from something that is fairly specialized and had the pure object label attached to it to something which is becoming more main stream........

Jacob Stein:I think that the fact that Sybase employed me is indicative of the fact that technology is moving more main stream. Whether it gets co-opted or compromised on the way, well the results aren't in yet we'll have to see.

Bertrand Meyer: Do you think that's a good evolution or a little bitter after the whole power of the ideas eventually made it?

Jacob Stein: Ultimately I don't think that it matters how the technology makes its way to the market place. It would be nice if the individuals in the companies that pioneered the technology were the ones that succeeded with it but that's not always the way it is.

Bertrand Meyer: What your appraisal of the actual success of Object Database ideas?

Jacob Stein: I think that the Object Database vendors as a whole haven't quite fulfilled the expectations that they had of themselves or the industry had of them. I think the vendors will still be successful but they'll be successful in specialized markets: Technical design markets, perhaps as middleware in future architectures, perhaps as data marks in data wherehousing architecture so that data's transformed from the operational system into an object data base to be used by applications built in object languages. But the notion that we held, I guess whenever a new industry starts you always have grand visions, that object data base vendors themselves represent the next generation of mainstream vendors in the sense of oracle Sybase in the performance doesn't seem to be happening.

Bertrand Meyer: And why is that?

Jacob Stein: I think for one thing, this time around, things are different than when the relational federation took on code. So I think that the large relational vendors were very well intrenched and the requirements, scaleability availability and reliability were in order of magnitude higher than when the relational products came into the market and that's a very hard wall to climb.

Bertrand Meyer: So is it partly a matter of the technology being introduced too early .........

Jacob Stein: I often wonder about that myself. About whether technology was indeed introduced too early in the sense that the products didn't at the time have all the availability, the reliability, scaleability, robustis requirements required in the commercial market. But there are also a few deeper questions relating to whether an architechture that tends to work best with a specific programing language and not well with all languages could ever have hoped to have dominated the market. Most of the products tend to be focused either on C++ or Small Talk and unfortunately the commercial market requires the you be able to access the data from a dozen programming languages and that's a more difficult thing to do.

Bertrand Meyer: And also another approach for someone...going even further into this direction that you critisized mainly taking a programming language and almost hiding the systems completely .......

Jacob Stein: I think persistant programming languages object data bases focus on a particular language have their place and certainly if the organizations commited primarily to a single language then a single language approach works well because of the degree of transparency you can see that otherwise impossible. The trouble arises when an organization grows and it has multiple applications to support compromised on the way, well the results aren't in yet we'll have to see.

Bertrand Meyer: Do you think that's a good evolution or a little bitter after the whole power of the ideas eventually made it?

Jacob Stein: Ultimately I don't think that it matters how the technology makes its way to the market place. It would be nice if the individuals in the companies that pioneered the technology were the ones that succeeded with it but that's not always the way it is.

Bertrand Meyer: What your appraisal of the actual success of Object Database ideas ......

Jacob Stein: I think that the Object Database vendors as a whole haven't quite fulfilled the expectations that they had of themselves or the industry had of them. I think the vendors will still be successful but they'll be successful in specialized markets: Technical design markets, perhaps as middleware in future architectures, perhaps as data marks in data wherehousing architecture so that data's transformed from the operational system into an object data base to be used by applications built in object languages. But the notion that we held, I guess whenever a new industry starts you always have grand visions, that object data base vendors themselves represent the next generation of mainstream vendors in the sense of oracle Sybase in the performance doesn't seem to be happening.

Bertrand Meyer: And why is that?

Jacob Stein: I think for one thing, this time around, things are different than when the relational federation took on code. So I think that the large relational vendors were very well intrenched and the requirements, scaleability availability and reliability were in order of magnitude higher than when the relational products came into the market and that's a very hard wall to climb.

Bertrand Meyer: So is it partly a matter of the technology being introduced too early .........

Jacob Stein: I often wonder about that myself. About whether technology was indeed introduced too early in the sense that the products didn't at the time have all the availability, the reliability, scaleability, robustis requirements required in the commercial market. But there are also a few deeper questions relating to whether an architechture that tends to work best with a specific programing language and not well with all languages could ever have hoped to have dominated the market. Most of the products tend to be focused either on C++ or Small Talk and unfortunately the commercial market requires that you be able to access the data from a dozen programming languages and that's a more difficult thing to do.

Bertrand Meyer: And also another approach going even further into this direction that you critisized mainly taking a programming language and almost hiding the systems completely ......

Jacob Stein: I think persistant programming languages object data bases focus on a particular language have their place and certainly if the organizations committed primarily to a single language than a single language approach works well because the degree of transparency as you can see is otherwise impossible. The trouble arises when an organization grows and it has multiple applications to support with a data base. When new languages are introduced that also have to be supported than it's difficult to provide that same level of transparency in multiple languages. So for small organizations or for applications that are in a sense stand alone that dont require a high degree of interaction with other applications to the rest of the environment language specific solutions probably are unbeatable in terms of the ability to quickly import and display applications and maintain them and there's a market there but I don't know if its what if it supports the 500.

Bertrand Meyer: So what is the market ........but we are reaching the end of that and we are going to conquer the industry at large ..... what is the market today and what will it be in a few years?

Jacob Stein: Part of the market I think is still taking a place for proprietary file systems. You look at Menta Graphics 10 years ago and they were able to be the market leader in their market running on one platform. Apollo. And today there's no hope of bringing out a cat cam product that runs on just one platform and when they run on one platform they can fill the database for that platform that's essentially a proprietary file sytem extension and then they discovered all of the sudden they were in the data base business cause they had to port this proprietary data base management system of course multiple operating systems and in a sense that began the intial opportunities for the object data base vendors. And to a certain degree that still is an opportunity because their are applications with very complex data where the data doesn't easily accomodate ... relationtional platform. Those are great candidates for using an object data base. Those sorts of applications wont go away because for all the scaleability and robustness that the large relational products bring to the market they cant satisfy every need. I think applications of those natures will always exist but I think there's room for the object data base vendors to grow but it will be likely within the domains of the larger relational vendors and playing roles within those environments but the object data base itself is not likely to become the central enterprise operational system but in terms of data marts and running specific applications on such as: Servio is running wafer fabrications with the TI . There's still room for these object vendors to grow to be profitable companies. The question is, will there be a few hundred million object data bases, I think there will be fifty one hundred million dollar companies. Will there be a half a billion dollar object data base vendors, a billion dollar object data base vendors, thats going to be much harder to achieve.

(Tape recorder went of)

Jacob Stein: ... using object data bases. There's a difference between having a half dozen referenceable mission to criticle applictions and being excepted within MIS. The tension in MIS is always one of "I need to be on the cutting edge and deliver a high degree of functionality and yet I still have to be conservative and make sure I deal with vendors that I know will be around in 5 or 10 years. Standards is the big issue.

Bertrand Meyer: ...

Jacob Stein: Sybase is company that believes there's certainly room within the data management world for object data base, they clearly play a role. I think that most of the relational vendors though are focused on extending their existing product lines rather than inventing entirely new data bases. In a sense our task is to deliver improved functionality to the existing customer base and to ask them to learn an entirely new data management paragon would make that task difficult. So I think the relational vendors by and large are going to focused on people who are ready to incorporate what are commonly referred to as abstract data types into their servers. The ability to introduce new indexing mechanisms into the server. Better intergration from their servers with object tools on the client side. I still believe and always have that the best way to get for organizations to get familiar with object technology is on the client side and the application side initially and not initially on the data base management side.

Bertrand Meyer: Why?

Jacob Stein: There's a certain amount of less risk. If a couple of applications don't work out than they don't work out and that's O.K. If you commit big time to an object data base and put a tremendous amount of corporations data in there and it doesn't work out, extricating yourself's harder. I think that in terms of value for effort intially you get more value by focusing objects on the client and applications development. It's on the application side where the fast development's required. It's all the less time to market issues. The data base has to support the applications and get them to market quickly but those focus in the world is really on applications. People buy applications, people don't buy data bases. It's time to move forward. Look at the Success of SAP down on Brad St. If organizations by and large want to buy solutions they're not looking to buy technology anymore.

Bertrand Meyer: So you're seeing in architecture where your have a kind of central data base depository which can be relational and a very high speed, fast access communicator with structurally just world data and in specialized applications where the intelligence ...

Jacob Stein: The operational systems of an organization have to be generalized and generalizeable cause they have to serve a tremendous number of needs. One of the trends you see in data management is the move towards the use of replication both in wherehousing and in nomadic computing. And so one of the options is that you have a generalized installational store, operational store, and then you have specialized stores adding departments and the visions and thats certainly an appropriate role for object data bases. So particular of you've committed to an object programming language within a particular task, lets say in manufacturing, your operational data base might still be relational but the soft form might be running it off of a replicate where you've done transformations as well as data movement into the object data base so the formats were easily accessible from those object programming languages. So that's a reasonable use. But I don't think the object data bases by and large are ever designed to do high transaction proccess rate. High alt to key environments.

Bertrand Meyer: ...

Jacob Stein: I don't think any of the object data base vendors targeted all line transaction proccessing. Primarily when you look at some of the applications they target as a design transactions, they're not all of the environments. They tend to be long transactions accessing large volumes of design data. All of the key environments tend to be very short transactions that are updating very small pieces of data. Given they didn't target that and that's still the primary use of operational data bases, the transactional data, then it's not resonable to expect them ... to capture that part of the market. But we are in a world where people are doing more replication and more data marts which are many wherehouses that are specialized for particular applications of departments. In that area I think they still have a lot of potential. Because a lot of data wherehousing is about data movement and data transformation. And the movement could just as easily be from relational data base and transformed into the formats of an object data base.

Bertrand Meyer:...

Jacob Stein: The issue with all the replication base strategies is that there's always gonna be a delta of time during which the copy is not quite up to date with the operational data base. But there's a large,large number of ... support applications where that's totally appropriate. If you were analysing markets and defining strategy, generally speaking, the data that's directly yesterday is adequate. You don't need data that's correct up to the moment.

Bertrand Meyer: Once in a while you still need to reconcile.

Jacob Stein: But you need to reconcile usually on a nightly or weekly basis. Usually your generally not performing updates on the copies so really it's a refresh more than a reconcilation or you attempt to make it a refresh rather than a reconciliation.

Bertrand Meyer: There seems to be some contraction between the traditional notion of database and O-O ideas. Object technology is based on the principle of data abstraction: this table doesn't exist other than what I use it for so if I'm a carpenter the table is something I make; if I am a restaurant waiter the table is something I put dishes on; and so on. But the table by itself doesn't really exist. Now this is completely different from the view of data base people for whom the data is there, independently of the programs that may use it, and it's my responsibility to get at it.

Jacob Stein: We've always partitioned the walls into proccess oriented aspects and into state oriented aspects. That's why we're able to get data bases verses application development environments. There were certainly some very good reasons for doing that when we did it. When we look at the data base side of the equation we're concerned about preservation that stayed over long periods of time. Accessing data, we developed core languages that had a subset of the functionality of programming languages because we thought that we could optomize those more effectively than full programming languages against tremendous volumes of data. This created problems as well as benefits. Some of the benefits is that the data base person is a very data-centric and the one of the good things about that is that they tend to focus on data integrity and preservation of data ... process oriented puts a little bit less of a focus on "I corupt the data it's totally useless forever" and in the data base arena we, in a sense, did keep data in a deep freeze because we didn't trust the application developers to just have random access to it. Things like data wherehousing and replication though make it possible to give the application developers a freer hand. Because if your working in a duplicate of the data then I know you can't corupt my core repository than I don't have an issue. So now we have the ability to start the flow of the lines between data and proccess. I think that some of the interfaces that the object data base vendors have for the system programming languages with object programming languages provided a greater degree of transparency than you find in any of the data manipulation languages from traditional vendors. The ability to do automatic dereferencing thats objects out of the backing store. They bring us a step closer to the persistant programming languages although you can never ultimately forget that there's a disk drive I mean the same way that you can never that there's memory.

Bertrand Meyer: ...

Jacob Stein: The application developer should be less aware that he's working with data from disk and that task should be, that distinction should only be clear to the people doing administration of the data. Which is a vision that, to a certain degree, the object vendors helped realize. Although it's still imperfect and there's still questions about how you clurry objects. When you doing non navigational actions and you not just following one reference to another reference and you need to find a bunch of data base upon some logical criteria. Most programming environments don't have the concepts that are strong enough to support that effectively. You clearly don't want to do that with brute force looping.

Bertrand Meyer: Are you satisfied with your ODMG standard?

Jacob Stein: ... In standards the primary contribution is in support of relationships and the core language. At Sybase we're working fairly closely with ODMG to help influence of the SQL3 effort. I think that the ODMG clurry language partly because they don't have the demands of backwards compatibility has more elegance than the current draft at SQL3. So we've been working actively with the ODMG trying to have them submit papers the Ansy committy.

Bertrand Meyer: ...

Jacob Stein: Servio, Object design, I think there are 5 or 6 members.

Bertrand Meyer: And there's none of the relational vendors?.

Jacob Stein: Sybase belongs as a reviewer member. We need to be shipping an object data base product supporting either C++ or Small Talk in order to be a full member. So none of relational members are full members and I think we're the only reviewer member.

Bertrand Meyer: And what about the SQL3 ... are all the data base members represented there?

Jacob Stein: The've attended some joint meetings and tend to follow the activity within the Ancy. It's a bit difficult for them to be full time participants. The've increased their level of activity. I think they're going to be submitting several papers to the Ancy group with the focus to modify SQL3.

Bertrand Meyer: ...

Jacob Stein: I don't know whether we'll see mergers or not. I think it's likely that in any competitive developing market not all the participants will grow as quickly and ultimately not all of them will be equals in 5 years. There's likely not sufficient space in the market for a half dozen vendors of equal size. So I think the likleyhood is that 2 or 3 of them will grow fast enough to survive as independent vendors the other 2 or 3 their technology, their companies will be aquired. I don't see mergers as very likely though.

Bertrand Meyer: ...

Jacob Stein: Frankly I don't have a very strong opinion about UNISQL partly because it's so difficult to find out any information. So to me it's just a big fuzzy cloud and we'll see how they do.

Bertrand Meyer: ... trying to bridge the gap.

Jacob Stein: There are a couple of vendors out there that have extended relational technology and the challenge for them is to add into these systems all of what the main stream vendors already have until the scaleability transaction proccessing tools and the like. That's a big challenge. What`s funny is when I work for a 5 million dollar a year company I was amazed we would get compared to billion dollar a year companies and now that I work for a billion dollar a year company I find it equally interesting to be asked to compare ourselves to a 5 million dollar a year company. They have interesting technology. They have a long road. Time will tell. If I knew the answer to who succeeds and who doesn't I would probably be a venture capitalist and not a technologist. I think it's also true now that extended relational in and of itself isn't adequate. I think you need to combine extended data bases with a good services infrastructure. Not everything's an abstract data type somethings are a service. If you take a look at someone who already has a text management system, lets say, now intergrate that in with an extended relational environment. That takes every operator, every index, every access message and register them independently or register each one of those in my extended relational environment. I'll bet you that text management system wasn't designed to allow arbitrary access to its primitive operators. As such you couldn't intergrate that text management system into one of these extended relational products. I'd argue that a text management system is a service not just a collection of agencies. And as such you need an infrastructure more atuned to an ORV or OA to intergrate those services and you need a crurry proccessing environment that cannot just handle new indexes and new types but can also handle crurries that can stand multiple data base services such as a text service. ... you start up some relational domain by working on half the problem. There's a lot to watch out for moving forward that it's all a lot of work ahead of them before they're gonna be able to be main stream players.

Bertrand Meyer: So who's working on the other half of the problem?

Jacob Stein: Right now I think we have a typical situation where people who have an operating system perspective are thinking about only services. People with a data base perspective are only thinking about extended data base products. There appears to be a few pioneers that realize that you need to have both and they need to work in an intergrated fashion. So you can't have a serperate service infrastructure that isn't intergrated to your data base infrastructure and you can't have a data base infrastructure that isn't intergrated to your service infrastructure. I think people will start to realize that over the next couple of years and you'll see better intergration.

Bertrand Meyer: Does it mean that what's usually known as the major players don't participate very much? Another suprising aspect of the situation is that hear very much when you talk about the data bases you don't hear very much about IBM and the like. ...

Jacob Stein: The relational vendors that had moved more slowly that tends to be what happened to the established vendors now. Moving more slowly does not mean you move to late.

Bertrand Meyer: Atleast they have made some noises. I was also thinking of the whole computer industry.

Jacob Stein: IBM has made a lot of noise relatively speaking. Their commitment to Small Talk, The dock in Som. It's been taking a while for them to intergrate all the pieces together there's no arguing about that.

Bertrand Meyer: ...

Jacob Stein: That can't be the case when you have to wait two years for you relational vendor to intergrate in a new index type. So if I have spacial data the data base vendors as a group need to be able to,more easily, incorporate access methods to spacial datas in their products. So one of the things that they're doing is using object technology in a sense within these products by creating abstractions within their infrastructure for indexing mechanisms and for types and for operators. That way you can introduce new types and use new index mechanisms for optimize queries that involves the new types of indexing mechanisms. And in some sense you view that as using object technology within the product line although it might be overtly visible for the outside world. That's clearly a step in the right direction. Also I think we're going to see a lot of changes in the area of core languages so that they're more flexable also. I think the world will soon realize that they need to intergrate data base services in with the other services. If you look at the OS vendors they do tend to have a very application oriented ... In the same way like at Sybase ... architecture took off these C protocols which weren't appropriate for data streaming, there were no data streaming facilities in most docu C mechanisms. We added data streaming to our PC so in a sense had a data oriented version of our PC that's worked very well for us.

(Lost some dialogue -- new side of tape)

Jacob Stein: There's still a lot of promise and a lot to be delivered.

Bertrand Meyer: ...

Jacob Stein: The LMG basically reflects the attitudes of the Unix vendors. They don't seem to realize that they're in a fight for their survival. So you have an inneroperability spec that is still so vague that your core for architechture would only work if I could develop services for the ... infrastructure and farily easily brought them to multiple core implamintations.

Bertrand Meyer: ...

Jacob Stein: Now if it's a case that I have to do a lot of very hard custom porting work to move from one ORB to another with this service than you greatly diminish the desirability of anyone entering the service business for ORBs.