Monday, 2014-06-30

*** openstack has joined #magnetodb13:49
jeromatrondukhlov: with your email - the query with an index would require allow filtering without the complete key - just as a matter of syntax.  so I believe it would do a scan within the partition at this point.  however, I'm more interested in whether it would do the whole scatter gather thing.  I would look at a trace for the query to see if it just hits the one node.14:14
jeromatronalso, what performance are you seeing with your data model14:25
dukhlovjeromatron: I measured only request processing time. Querying data form 10 records dataset needs ~100ms, querying from 5000 records dataset needs ~ 2000s on my laptop and single node cassandra cluster14:38
dukhlovdynamodb data model allows up to 10gb data stored per partition key14:40
jeromatronhow wide are the rows?  and you're trying to emulate the dynamodb limits right - so people can seamlessly move over?14:40
dukhlovso it could be a great amount of data and filtering is not a option14:40
jeromatronperhaps a different data modelling route might be good.  Have you spoken with Patrick McFadin about the modelling that you're doing?  he works for datastax on the community side and has done a whole lot of data models with customers.  he might have some tips.14:41
dukhlovno, I haven't14:42
jeromatronI'll forward the email to him and CC you and see if he has any suggestions.  Feel free to add to the email.14:43
dukhlovbut problem is that DynamoDB has a lot of features. And if we change data model usually it makes implementing another feature impossible or too complicated14:47
dukhlovlike condition operations14:47
dukhlovok, thank you I will talk with McFadin14:49
jeromatronah okay.  are you able to make the indexed column a clustering key?  or does that not work for you generally?15:01
jeromatronanyway, I'll send off an email.15:04
*** [o__o] has quit IRC15:51
*** [o__o] has joined #magnetodb15:53
*** [o__o] has quit IRC15:55
*** [o__o] has joined #magnetodb15:57
dukhlovjeromatron: DynamoDB supports up to 5 LSI. We can use clustering key to organize only one of them16:03
dukhlovhm, actually now we are using composite clustering key to organize DynamoDB LSIs16:04
dukhlovbut in this case we have a few CQL records in table (for original row and for one per LSI row) for DynamoDB row16:05
dukhlovand must take care about index integrity and consistency ourself16:06
*** [o__o] has quit IRC16:13
*** [o__o] has joined #magnetodb16:17
jeromatronhmm, are they keys based indexes with ddb?  or bitmap indexes where you can do inequalities and such?16:20
*** jeromatron has quit IRC16:23
dukhlovDynamoDB has primary key which consists of 2 parts HASH key (it is partition key in C*, you can specify only equality condition on it to choose partition) and RANGE key (it is clustering key in cassandra, you can get slice using it, means it allows inequalities like greater or less conditions)16:28
dukhlovLSI is alternative RANGE key on the table16:29
dukhlovso, yes you can do inequalities with LSI as well as with RANGE key as well as Cassandra clustering key16:30
dukhlovhttp://docs.aws.amazon.com/amazondynamodb/latest/developerguide/LSI.html16:30
dukhlovCREATE TABLE <table_name> (hash_key_attr <text/decimal/blob>, index_name <text>,indexed_attr_str_value text,indexed_attr_number_value decimal,indexed_attr_blob_value blob,range_key_att <text/decimal/blob>, PRIMARY KEY (hash_key_attr, index_name, indexed_attr_value, range_key_att))16:35
dukhlovhere is our table structure16:36
dukhlovhash_key_attr is partition key16:36
dukhlovother 4 fields is composit clustering key16:37
dukhlovour table contains duplicated CQL rows for each LSI16:38
dukhlovand original row16:38
dukhlovand those rows are combined in sorted groups by index value16:40
openstackgerritLonggeek proposed a change to stackforge/magnetodb: Update requirements file matching global requ  https://review.openstack.org/10358916:53
openstackgerritLonggeek proposed a change to stackforge/magnetodb: Update requirements file matching global requ and add the six module  https://review.openstack.org/10358916:56
*** vnaboychenko has joined #magnetodb17:24
*** vnaboychenko has quit IRC17:33
*** jeromatron has joined #magnetodb19:02
*** vnaboychenko has joined #magnetodb19:09
*** vnaboychenko has quit IRC19:22
*** jeromatron has quit IRC19:22
*** keith_newstadt has quit IRC19:25
*** jeromatron has joined #magnetodb19:33
*** jeromatron has quit IRC19:37
*** jeromatron has joined #magnetodb21:55
*** charlesw has quit IRC22:34
*** jeromatron has quit IRC22:43
*** openstackgerrit has quit IRC23:19
*** openstackgerrit has joined #magnetodb23:21

Generated by irclog2html.py 2.14.0 by Marius Gedminas - find it at mg.pov.lt!