What should go in a next-generation MATLAB X?

David Young on 13 Mar 2024

The first change I would make would be to scrap the special treatment of Nx1 and 1xN matrices. These are given special status (as "column vectors" and "row vectors"), which must, I suppose, be helpful sometimes, but in practice it's confusing (that is, it confuses me) and makes general code much more complex than it should be.

For example, if you write c = a(b) where the values of all the variables are numeric arrays, the rule is that c will be the same shape as b except when a or b is a column vector and the other is a row vector. An exception to a general rule is, as a general rule, a bad thing. One that affects as fundamental an operation as indexing an array is a very bad thing.

Another exception: size truncates trailing 1s except in the case of column vectors, and ndims returns 2 for column vectors. General code therefore has to handle this case specially. For an example of code that could be simpler without these complexities see exindex.

It makes for messy code in other ways: my arguments blocks are peppered with (1,1) to indicate scalars, when (1) would be easier to read and should be sufficient.

It's not as if row vector and column vectors are always treated the same as each other. Matrix multiplication distinguishes between them of course, as does the loop construct for. Making them a special category, when they're actually just different shapes of arrays, simply adds complexity.

Can anyone make a case for keeping this peculiarity?

Jim Svensson on 19 Sep 2023

Some simple things would be nice:

counter += 1; salery *= 2 % operator assignment, or whatever it is called
y = (x < 0) ? 3 : 2*x; % ternary operator

Simon on 6 Sep 2023

Insert 'parfor' option into splitapply( ), grouptransform( ) or create separate parallel versions of those two functions.

Right now the groupbased functions run through groups with for-loop. It's very slow for data with large number of groups. When the said data set was run through with parfor-loop, it was 5 to to 10 times faster.

functional programming hiding looping details makes the coding process closer human cognition. And parfor is a really powerful beast. The combination of these two infowar-horses will make Matlab take a decise lead ahead of sluggish reptile.

Clay Swackhamer on 5 Sep 2023

My wish list:

A real, beautiful dark theme
Improving the appearance of figures. Reduce padding around subplots, set default axis and tick mark color to black, adjust default linewidth and font sizes to be a bit larger. In general, try to make figures made quickly with default settings look better.
Multi-start options for all solvers in the optimization/curve fit toolbox.
Consistent arguments for plotting functions. I think some still use different capitalization schemes (like "LineWidth" vs "linewidth").

Andrew Janke on 4 Sep 2023

Yeah, @Rik and/or @Paul, go for creating a new "Matlab X Part 2" question; my browsers are also having trouble dealing with how big this question has gotten. I don't see a way I can lock this question; dunno if that's a moderator-only action or I just don't have enough rep.

Rik on 4 Sep 2023

The current thread is fairly close to my arbitrary suggested limit of 50 answers. If you think it makes more sense to start a new thread, go ahead. I'm happy to start a new one, but you can also do it and add it to the list of threads (don't forget to edit the other threads as well).

In an attempt to discourage new answers (while waiting for the ability to soft-lock threads), I have started editing the older questions by putting '[DISCONTINUED]' at the start of the question.

Paul on 4 Sep 2023

@Andrew Janke, @Rik

This wonderful thread is becoming unwieldy and slow to respond to editing on both laptop and desktop for me. If others are having the same problem, perhaps this Question should be locked, at least for new Answers (is that possible?), and a new Question opened for new Answers?

Simon on 1 Sep 2023

My wish list, not about code improvement but about official tutorials.

a tutorial of using splitapply to take advantage of parallel computation.
a tutorial of assignment and indexing involving comma-seprated list, cell array. It not only shows what works, but also explains what syntax would go wrong, and why it go wrong.

For example, x = ["a", "b"] is a 1x2 string array. But then x(:) becomes a column vector, then x{:} is a comma-seprated list; then [x{:}] is a character vector 'ab'. Such 'delicate' usage is the biggest bottleneck for my coding process. @Stephen23 has written a tutorial of comma-separated list. I hope Mathworks staff can take from there to expand it, covering the use cases of table. For example, if T is a table. T(1,:) is a single-row table. But then T{1,:} sometimes works if variables' data type can be lumped together; sometimes fails if variables have mixed data types. But then when it works, say, all table variables are 'string'. Why then T{1,:} is a string array, intead of a comma-separated list? Two similar syntaxes, x{:} and T{1,:}, have two different semantic meaning. That really causes workflow jam in my coding.

35 Replies

Andrew Janke on 5 Sep 2023

@Simon - I do cast some of my table variables to categorical, and have also noticed things go slower than I expected with them. (Kinda the whole point of categoricals is that they're small and fast compared to strings, right?) I have no idea what would cause categoricals in table variables to go slow.

Simon on 5 Sep 2023

@Andrew Janke Do you cast your table variables to categorical? In my case, if a task is to process strings, it will be many-times slower if variables are casted as categorical. I don't know why. What you think might have caused that?

Andrew Janke on 4 Sep 2023

> Sounds like you're fairly satisfied with the Matlab table implementation, except for the {} indexing.

Oh yes. I think the Matlab table array implementation is quite good. I'm not even sure the {}-indexing behavior is a problem; I use the multi-variable form of it seldom enough that I don't really have an opinion on whether its concatenation-instead-of-comma-separated-list behavior is an issue. I was just saying it is inconsistent with the {}-indexing behavior of cells and strings and thus unintuitive for a Matlab programmer new to tables; I don't know if that makes it bad.

The main thing about table arrays I think is not great is speed in some cases: addressing variables inside a table, and doing chained/multi-level indexing in to them, is not as fast as with structs, which makes tables unsuitable for some performance-sensitive contexts. (As of R2019b; I haven't benchmarked newer versions.) And I believe that the in-place-modification optimization ("IPMO") of Matlab's execution engine does not work for variables/columns inside a table array, even if the table array is in a local variable and there are no shared references to its underlying variables' data. (I believe that structs, cells, and objects in general share this no-IPMO-on-field-contents limitation, so it's not a weakness unique to table arrays.) And concatenating several tables can be kind of slow.

Paul on 4 Sep 2023

@Andrew Janke

Thanks for the detailed response. Sounds like you're fairly satisfied with the Matlab table implementation, except for the {} indexing.

Paul on 4 Sep 2023

Referring to my own comment, timetable can also be accessed via curly bracing.

Also, as of 2023a, dictionary can also be referenced with curly braces to access cell elements of dictionary entries

vals = {1:3 , "Bicycle" , @sin};

keys = [1 2 3];

d = dictionary;

d(keys) = vals

d =

dictionary (double ⟼ cell) with 3 entries: 1 ⟼ {[1 2 3]} 2 ⟼ {["Bicycle"]} 3 ⟼ {@sin}

d{1}

ans = 1×3

1 2 3

d{2}

ans = "Bicycle"

d{3}

ans = function_handle with value:

@sin

Apparently curly brace indexing is returning a CSL of the elements stored in the dictionary value cells, rather than a CSL of the dictionary elements themselves, which would be a CSL of cells.

d{1:2}
ans = 1×3
     1     2     3
ans = "Bicycle"

Andrew Janke on 4 Sep 2023

> When you wrote your own table implelmentation, if {} returned a table, what did () indexing return?

()-indexing also returned a table. This was one of the big differenes between my table array design and Matlab's design: My table array was an array of tables, where each element was a whole table (aka relation), and an array of tables could be of arbitrary size, as opposed to a table array always being a 2-D array of the rows & columns/variables inside that single table like Matlab's does. Like, if you call size(t) on a Matlab table array and it returns [2, 5], you're looking at a "single" table of 2 rows and 5 variables/columns.But a size of [2, 5] in my design is an array of 10 tables, each of which might have different numbers of rows and differnt numbers/names/types of columns. So you could do things like joins or projections with a single method call, and have them apply to many tables at once, with scalar expansion. My table array was more of a container with "stronger" encapsulation of its contents, and each element of a table array was a container that held a whole table/relation, kind of like how in a Matlab cell array, each element of the cell array contains a whole arbitrary-size-and-type array.

I don't think my approach of "array of tables with multi-table function application" ended up being very useful, and I'd probably just do the sizing and ()-indexing Matlab's way if I had to do it all over again. Doing operations over a plain list or array of tables, as opposed to a set of named tables, doesn't seem to happen much in practice, and you can always just slap them in a cell array if you need to do that.

I also had a different approach to dot-indexing. Instead of tbl.Foo being the column/variable Foo inside the table array, I had a special "cols" property that contained the columns for dot-indexing, so it would be tbl.cols.Foo. This meant that methods on table arrays could be called like tbl.meth(...) and you could use tab-expansion on them, and address table-level properties as tbl.Blah instead of tbl.Properties.Blah. I still don't know which of these ways I like better. Probably Matlab's, because column access is such a common operation, and Matlab's direct-column-addressing approach means you can use table arrays as drop-in replacements for structs in many places.

Simon on 4 Sep 2023

@Stephen23. "x is numeric. It has no comma-separated list syntax because it is not a container". I didn't really mean Matlab should do that kind of crazy thing. Just from a mathematical point of view, a scalar can be seen as a vector (so can be implemented as a container). Even Lisp, which construts almost everything as list, doesn't render scalar as list.

@Stephen23. "**There are some subtleties/differences due to the need to refer to rows: unllike other container types, with tables it is useful to be able to refer to rows (which refer to the content not the container itself)."

"assigning a row of different data types to a table, a challenge I have seen several times on this forum."

Absolutely agreed. At first, I had not been paying enough attention to different levels of extraction from table rows, causing self-doubt :-).

@Walter Roberson. "A few days ago I was trying to write some splitapply code that would have gone notably easier if {} indexing of tables returned comma separated values (or if there had been other syntax that did that.)"

Sharing your frustration. Recently I was trying to shift toward functional programming, and table row stands in the way like a Matryoshka doll--can't access the innermost one without stripping away the outer ones first. Reminder to myself:

T(row,:); a single row table
C = table2cell(T(row,:); a cell array
C{:}; a comma-separated list.

Paul on 3 Sep 2023

@Andrew Janke

Referring to this comment ...

When you wrote your own table implelmentation, if {} returned a table, what did () indexing return?

Are there other classes in base Matlab besides cell, table, and string that accept curly brace indexing?

I wonder if there are any toolbox classes that accept brace indexing and how that works, i.e., does {} return addressed elements or something else.

Andrew Janke on 3 Sep 2023

> A few days ago I was trying to write some splitapply code that would have gone notably easier

I hear that. Matlab's splitapply, join, and similar table functions have never really felt quite "right" to me, in terms of their interfaces. I usually end up writing my own wrapper functions on top of them that translate them to interfaces that feel more natural to me. But I'm an old SQL/table-head, maybe my tastes in code are just weird here.

Bruno Luong on 3 Sep 2023

A work around to do comma list on table content in single command

person=["maman"; "papa"; "moi"];
x=rand(3,1);
T=table(person,x)
T = 3×2 table
    person        x    
    _______    ________

    "maman"    0.028974
    "papa"      0.20146
    "moi"        0.3114
struct('dummy',num2cell(T{:,1})).dummy
ans = "maman"
ans = "papa"
ans = "moi"
struct('dummy',num2cell(T{:,2})).dummy
ans = 0.0290
ans = 0.2015
ans = 0.3114

Unfortunately I don't know how to put this little command in a function, since it will return only the first element of the comma list.

Walter Roberson on 3 Sep 2023

A few days ago I was trying to write some splitapply code that would have gone notably easier if {} indexing of tables returned comma separated values (or if there had been other syntax that did that.)

Andrew Janke on 3 Sep 2023

Anyway, FWIW, I am similarly bothered as @Simon about how "{...}" brace indexing in to a table array returns a single array (subject to concatenation of the addressed elements) in a single variable/argout instead of a comma-separated list. I think this is probably the more-commonly-used case, so on the one hand it makes sense there. ("{}"-indexing is just an operation you can override or define however you want, at least in user-defined classes.) But there's just no precedent for it in the Matlab base language or standard library. I can't think of any other datatype that accepts brace-indexing and doesn't return a comma-separated list containing "addressed elements" in return. And the inverse as an lvalue.

Back when I implemented table arrays in my own Matlab library, I overrode {}-indexing to subset tables, producing tables. (Because I thought, who would do brace-indexing across multiple columns in a table array, and want it back as a comma-separated list? Why would you even do that?) And my {}-indexing was even weirder: it accepted a string with a SQL-style "WHERE" clause predicate, so you could do like t2 = t{'Date > now-7 && NumErrors > 1'} if you want to see things that blew up recently. Which was clever and concise. But I came to regret it: in standard Matlab usage, brace-indexing is such a specific thing with certain semantic/low-level effects, that I think it's best to just not devaite from that, even if it seems like a really useful thing to do. If I were doing it again today, I would have skipped the {}-indexing override and just used a really short method name, like "q()"`.

Andrew Janke on 3 Sep 2023

> It already does, that is exactly what happens

Yes. This is maybe a rhetorical issue. I was doing the British-style thing of "oh, perhaps there are reasons this thing works the way it does, and we should try to understand and think about those, instead of just getting grumpy and demanding it work in a different manner" thing. Like, I'm not actually wondering if there's perhaps a reason for that; I am somewhat familiar with those reasons and it's more of a Socratic dialog thing.

Sorry if that approach is condescending; I didn't mean it to be an insult.

Stephen23 on 3 Sep 2023

@Simon: "I don't mean that Matlab should change the meanings of x{:} and T{1,:}"

Yet later you state that you want "Suppose T is a table. T{1,:} returns the first row's contents as a comma-separated list.". That would be a change of meaning of curly braces for tables.

Stephen23 on 3 Sep 2023

"If you've got string arrays in table variables, then I think maybe T{x,y} should return a string array there, and then if you want to get at the "contents" of that string array in terms of char vectors, then you should hit that string array with an additional level of {...} indexing..."

It already does, that is exactly what happens:

one = ["hello";"world"];
two = [pi;NaN];
T = table(one,two)
T = 2×2 table
      one       two  
    _______    ______

    "hello"    3.1416
    "world"       NaN
out = T{1:2,1}
out = 2×1 string array
    "hello"
    "world"
class(out)
ans = 'string'
out{:} % content of string container in a comma-separated list
ans = 'hello'
ans = 'world'

Andrew Janke on 3 Sep 2023

> In scalar case, x=x(:)=x{:}.

Are you sure this actually happens? (I assume by "=" you actual mean something like "isequal()" or "is the same as" in the broader sense; one-equal-sign "=" is the assignment operator, and two-equals "==" is the elementwise equality test.)

In the scalar case of an array x, then x and x(:) are the same thing. But the {...} dereferencing/pop-out operation produces something different. The x{...} operation "reaches in to" the contents of x subsetted by "...". I'm not aware of any case where x{...} is the same as x, unless you do some silly special-case subsref magic to make that happen. And I don't think any regular Matlab datatypes do that.

Andrew Janke on 3 Sep 2023

Also, note that "comma-separate lists" are – as far as I undersand it – not a Matlab datatype, but a special value-passing form or whateveryoucallit that only happens in the context of M-code syntactical and control flow constructs. CSLs can be captured in to cell arrays and vice versa, but they are not the same thing.

Andrew Janke on 3 Sep 2023

> {:} acts like CIA trying to turn Jason Bourne back to what he used to be [...]

Yeah well, what if the CIA pays for your Matlab licenses? Bc in my experience that's usually how it is: Matlab is commercial software, often paid for by the "business" intead of "tech" department, and the biz folks like things to just stay more or less like how they're used to.

> Suppose T is a table. [...] T{1,:} returns [...]

I think maybe there's another level of indirection going on here. If T is a table, then T{x,y} will "pop out" the contents of that table array's columns/variables, as opposed to T(x,y) subsetting the table and then returning another table. (And imho, for Matlab, "table array" means exactly the same thing as "table", because tables are arrays, just like everything else in Matlab.)

If you've got string arrays in table variables, then I think maybe T{x,y} should return a string array there, and then if you want to get at the "contents" of that string array in terms of char vectors, then you should hit that string array with an additional level of {...} indexing. Like, if T is a table array with a variable/column "mystr" that contains strings, maybe T{:,1} should pop out that one var and return a string array, and then T{:,1}{:} should then pop out the the string array's elements in to a list of charvecs, returned as a "comma-separated list" in this context.

My thesis here is that the string array type provides and additional layer of "indirection" or encapsulation that wraps charvecs in a higher-level type, and that table arrays are another level of composition on top of that, and you should expect one application of {...} indexing to only "pop out" one level's worth of encapsulation or composition.

IMHO, string arrays are kind of a special case here, because Matlab string arrays are kind of new thing, and the older ways of Matlab string handling were all kinda sloppy hacks layered on top of kinda-too-low-level representations. (http://matlab.wtf)

Stephen23 on 3 Sep 2023

@Simon: it would be quite handy having {:} also define a comma-separated list for tables, which would make that syntax meaning consistent** for all data types. It would also make a few kinds of operations much easier for tables (e.g. assigning a row of different data types to a table, a challenge I have seen several times on this forum). As far as I can tell, the main difference would be in case of multiple columns/variables, which would need to be replaced with horizontal concatenation, i.e. t{:} -> [t{:}].

Note that for comma-separated lists x(:) != x{:}, so your "singular case" example is inconsistent with all other comma-separated lists and is also inconsistent in and of itself: why should a comma-separated list of one array have a completely different behavior to a comma-separated list with any other number of arrays?. I would not expect or desire that, it would make comma-separated lists much harder to use (need to program special cases) with all of the resulting latent bugs etc.

" Even in scalar case, x = 2; x{:} returns 2."

No, it does not. Lets try that right now:

x = 2;
x{:}
Brace indexing is not supported for variables of this type.

x is numeric. It has no comma-separated list syntax because it is not a container. It makes no sense to attempt curly-brace indexing on something that is not a container.

**There are some subtleties/differences due to the need to refer to rows: unllike other container types, with tables it is useful to be able to refer to rows (which refer to the content not the container itself).

Simon on 3 Sep 2023

@Rik, thanks for the cellstr solution. I'll give it shot.

@Paul, your feedback led me to think more deeply about {:}.

x = "abc string";
x{:}
ans = 'abc string'

In that example, {:} acts like CIA trying to turn Jason Bourne back to what he used to be. Bad practice. I would like {:} to be like the toughest NKVD intogerator. Whatever container it touches up, the container will spew his or her contents.

Suppose T is a table.

T{1,:} returns the first row's contents as a comma-separated list. Even in scalar case, x = 2; x{:} returns 2. Under this semantic principle, {:} will behave as a nice symetric complement to (:).

T(1,:) returns a single-row table, wrapping the ocntent inside.

T{1,:} returns naked contents.

In scalar case, x=x(:)=x{:}. a singular case that doesn't break general rule, kind of fitting Matlab's birth purpose of serving mathematicians, isn't it?

Paul, the examples you give have my codes want to shout out MeToo. When I looked into built-in rowfun, splitapply, I found they also had the same MeToo moment. Those built-ins must have a carefully crafted local function to handle inconsistent semantic interpretation involving {:}, usually 'flatten' or 'expand' table rows, or other contructs to cell array. (But don't take my words 100% because I lost my patience during the code tracing.)

Steven Lord on 1 Sep 2023

If you have feedback about a specific documentation page (something you expected to find on the page that's not present, a bug on the page, or a suggestion for an alternate way to phrase something on the page that may be clearer or more general) you can select a rating for "How useful was this information?" at the end of the page. Once you select a number of stars that will be replaced with a box asking "Why did you choose this rating?" where you can enter free-form text. I know for a fact that the documentation staff reviews this feedback.

If you have feedback about something that's missing entirely from the documentation, for that I'd recommend you contact Technical Support directly using this link. They can enter your feedback into the enhancement database for the documentation staff to review.

Bruno Luong on 1 Sep 2023

If only we could cascade brace indexing

y = [1 2 3]
num2cell(y){:} % won't work currently

Paul on 1 Sep 2023

@Simon

My understanding is that you have two basic concerns:

a) TMW should have a tutorial on Comma Separated Lists. See Comma Separated Lists. Unfortunately, that doc page is lacking as it does NOT discuss how to generate a comma separated list from a string array, when doing so is a feature as you've pointed out. However, that doc page would not discuss comma separated lists as they relate to tables, because there is no way to generate a comma separated from a table (at least not according to the doc page I linked previously).

b) T{1,:} returns an array, not a comma separated list, and is therefore inconsistent with use of {} on classes like cell and string where {} does return a comma separated list. Do you think a comma separated list in this case would be more useful?

As an aside, I've sometimes wanted to be able to generate a comma separated list from a numeric array. Alas such is not possible and one as to resort to workarounds

y = 1:3
y = 1×3
     1     2     3
try
    y{:};
catch ME
    ME.message
end
ans = 'Brace indexing is not supported for variables of this type.'
struct('y',num2cell(y)).y
ans = 1
ans = 2
ans = 3

Bruno Luong on 1 Sep 2023

@Steven Lord "...backwards compability with functions that accept cell arrays containing char vectors"

I see that thanks.

Strictly speaking I consider the backward compatibility is not broken even without the {} behavior on string, since every code written for char still works.

What you can "compatibility" is more like wanting the same functionality working for both string and char. However there are a bunch of other things that cannot work for both classes, such as char arithmetics, extract sub-char, numeric conversion, etc...

I remember we were working with a robot using a TCP-IP protocol with sending char-array/string. One of my intership changed char to string or using a function intended working for one and not for another, I can tell you that was the a very frustrating experience for us when the bug occurs because it is not working exactly the same...

Rik on 1 Sep 2023

I really like how string vectors have extremely similar bahavior to cellstr. You can pretty much rely on cellstr(data) to convert a string and your code should not require any changes. That especially helped me when the Name=Value syntax was introduced:

MyFun(Name='Value')
ans = "Name"
ans = 'Value'
function MyFun(varargin)
for n=1:nargin
    varargin{n}
end
end

With varargin{:} forwarding the lot to your parser function, this new syntax is automagically supported.

Simon on 1 Sep 2023

@Steven Lord, I think back compatibility is the reason. When I began using Matlab, there was no string. That was good old days. Cell array was a wonderful, powerful thing. Life was much simpler. But it was a little too simple without string. Then there was string. Like any new useful invention, it requires users to be re-adjusted. I have saved Loren Shure's wonderful blog post for a quiet raniny day reading.

Paul on 1 Sep 2023

Frankly, I've never thought about it until I saw that use in this thread. Without thinking about it much more, I don't have any issue with curly brace indexing of a string array returning a comma separated list of char.

I did find this doc Access Characters Within Strings that at least shows how curly brace indexing works to convert one element of a string array to a char and relates that back to similarity of indexing into cell arrays of chars. In that sense, having x{:} return a CSL makes sense in that it mimics the behavior of the "old days" when x would be cell array.

x = {'a' , 'b'};
x{:}
ans = 'a'
ans = 'b'

Having Access Characters Wtihin Strings as a subordinate topic on a doc page for Create String Arrays is quite illogical IMO.

Steven Lord on 1 Sep 2023

If I recall correctly, one of the reasons (perhaps the main reason) for curly brace indexing on a string array returning a comma-separated list of char vectors is for backwards compability with functions that accept cell arrays containing char vectors. See the third bullet point in the Looking to the Future section on this post from Loren Shure's blog about working with text in MATLAB.

If we'd made that operation error, I suspect our users would be grumbling something along the lines of "MathWorks, you know what I meant, just go ahead and do it instead of making me change my code to distinguish cellstr and string!" [Actually, you probably wouldn't have because internally MathWorks developers who would have had to make those same changes in our code base would have grumbled before the feature got released!]

Simon on 1 Sep 2023

@Paul, from the perspective of a pracctical user of Matlab, I don't mind what x{:} returns. An end-user-oriented tutorial would be good enough. Comma-separated list is a wonderful construct, and I have come gradually to embrace it. Great help came from @Stephen23, who has written an excellent tutorial: comma-separated lists. I hope Matlab staff could expand on that.

I remember I read somewhere, Hacker News maybe, that some experienced programmer in other languages, who wanted to get into Matlab, complained about similar things. The tutorial I suggest would lower Matlab's entrance barrier for, say, C++ programmers.

Bruno Luong on 1 Sep 2023

@Paul What would YOU like

x = ["a" , "b"];
x{:}

returns?

PS: "throw an error" is also a valid answer.

Paul on 1 Sep 2023

@Simon

Does Access Data in Table cover all of the use cases for table indexing that you're looking for?

I, not surprisingly, couldn't find a doc page for x{:} where x is string array

x = ["a" , "b"];
x{:}
ans = 'a'
ans = 'b'

Setting aside concerns about inconsistent semantics for the moment, would T{1,:} returning a comma-separated-list of char be more useful than returning a string array?

Bruno Luong on 1 Sep 2023

@Simon "x{:} and T{1,:}, have two different semantic meaning."

Agree.

It seems to me table overloads heavily subasgn and subsref especially for {} and it's done internally.

I tried long ago do overload {} with my own class but it won't be able to make it works.

Rik on 1 Sep 2023

It sounds like we agree. This thread is for changes that would break compatibility. What you're suggesting is keeping the technical behavior the same, but improving the documentation. The threads I linked are more suited for that. I would like to show support for your suggestion by giving you an up-vote, but I don't feel this thread is the most suitable location.

I know that Mathworks staff is monitoring these threads and do consider the comments and votes when deciding what to do with a suggestion. Posting in the correct thread helps giving your suggestion the correct exposure.

Simon on 1 Sep 2023

I don't mean that Matlab should change the meanings of x{:} and T{1,:}. Just a more comprehensive tutorial would lessn a greate degree of headache. Better documentation oriented toward beginners go quite well with 'What should be in next generation', I think. But it's all personal opinion.

Rik on 1 Sep 2023

How exactly would this break compatibility? These things sound like new features, but not really things that will prevent older versions from running the same code.

This sound more suited to a missing feature threads (#1 #2): features that you whish Matlab would have had.

Feel free to move your answer (by posting it there and deleting it here; moving answers between threads is work in progress).

Yevgeniy Gorbachev on 29 Aug 2023

Being dynamically typed makes large programs irritating to develop and makes the language slower (JIT needs some time to do its thing); I think a compiled statically typed MATLAB would be amazing (yes, I know the arguments block is a thing, but that's still checked at runtime)
In-editor vim emulation (IdeaVim is the ideal case)

Sulaymon Eshkabilov on 29 Aug 2023

I have a couple of wishlists:

# 1. Machine Learning applications should have a few features to extract/store the simulation results (numerical data) in the workspace: (1) Regression data (target vs. predicted values), R (correlation coefficient values), Mean Squared Error values (Training, Test, Validation and overall).

# 2. Chord diagram function (a 3rd party function posted on mathworks.com)

dim-ask on 28 Aug 2023

Using arrays of string by default to anything that right now is cellstr by default. For example, string columns with readtable, some_table.Properties.VariableNames, etc. Apart from it saving me a lot of time having to adjust things myself every time, it would help novice matlab users who may not know stuff like how bad long cellstr's are (huuuuge overhead), and have to learn this the hard way (like I had). Even put one of these "not recomended" warnings if somebody uses readtable opting for cellstr as default for character columns. It would require adjusting older code accordingly, but it would save a lot more. Maybe put a warning in general whenever somebody defines or uses a 1-d cellstr that is above some length.

1 Reply

Simon on 1 Sep 2023

I had the same headache and am still having it. I recently modify my readtable opts to set all variables as "string" in data-extraction step. This seems to reduce error-pronenss in this step and to speed up looping algorithm. However, when I want to store the extracted tables, I would change group variables to 'categorical' to reduce storage size. String data would occupy much larger space in the hard drive.

I also very much like Matlab to default some_table.Properties.VariableNames to be of string array. Somehow, Matlab is inconsistent in defaulting things as cell array or string array. That kind of inconstency is the biggest slow-down for me.

Simon on 28 Aug 2023

Better folder path utility. Python's pathlib is powerful and very intuitive to use. Matlab's dir is cumbersome. To be the problem lies in dir use structure and comma separated list, which I don't feel at home with.

Jason on 6 Jul 2023

I'm not deep into how code "should" be written, but more of a user who realizes theoretical papers in matlab and python. So just want to frame my comments in that light. I also want to give kuddos to mathworks as I can't get away from MATLAB. The tool is extremely well done and well supported.

I would like to see the following:

- lose the brackets when assigning output arguments and make carrage returns end lines.

Example:

x, y = myFunc(temp, temp2)

- Add a way to do bulk comments

%* these are comments
   still commenting *%   

- add functionality to take highlighted code and immediately turn it into a function

- make it simpler to make code with variable arguements. For example, we have several ways right now to do name-value pairs. One of which cannot do autofill and one that can. Having the default values within the function definition is nice in python and somehow that one does auto filling without any extra code like is required by matlab. When I say autofill, I mean you hit tab in the argument spot and it gives you a list of options.

-native arduino or generic microcontroller support

-native AI code writing support like co-pilot

-improve the symbolic toolbox. Mathmatica kills you guys here.

Andrew Janke on 30 Sep 2022

Cool, thanks!

Rik on 30 Sep 2022

I just added this thread to the list of 'where to post' discussion threads. At 41 answers as of today it is getting pretty large already, so I think a second thread will soon be a good idea.

men8th on 30 Sep 2022

My goodness, the IDE can be annoying sometimes. What's missing...

I use the editor undocked. Please can we have the capability to display a watchlist of variables in a panel in the editor. Also, you should be able to right click on a watched variable and set a breakpoint to halt when the value changes or some user specified conditional relating to that variable is satisfied. Basically, please can we have the MS Visual Studio watchlist.
The call stack display in the editor is absolutely useless if the call stack is deep, which it often is with OOP. Can't we have this as a proper list? Having to open and re-open a tiny dropdown menu is hopeless. The horizontal list that you get with the live editor is also useless if the stack is deep. It needs to be a list which you can pin open, and where you click on it to move the stack frame. I routinely resort to using dbstack at the command line to get round this, but then clicking the output from dbstack doesn't move the stack frame so it is only half useful. Also, because the output from dbstack moves off the screen when you enter other commands and has to be regenerated to stay up-to-date, it's hard to mentally "keep your finger in the pages of the book where you want to go back to" when you are concentrating hard.
Finally, and this is a big ask I'm sure, can we have the capability to drag the instruction pointer during debugging and also modify code on the fly when debugging.

Turlough Hughes on 23 Aug 2022

When using the debugger, I would love to have a button to Step (run the next line) and display output regardless of the ; being there or not.

1 Reply

Image Analyst on 23 Aug 2022

What I do in cases like that is to highlight the line up to the semicolon and then hit F9 to execute it.

Gregory Warnes on 21 Jun 2022

Allow (prefer?) use of square brackets for indexing into arrays:

A[1:10,1:10]

Gregory Warnes on 21 Jun 2022

Documentation on how to change the default size of figures in Live Scripts.

Gregory Warnes on 12 May 2022

Extend Find/Replace regular expression support to include substitution of matched elements from 'find' into 'replace', so that one can do things like:

Find: (call\(\w+ *, \w+, *)(\w+ *))

Repace: \1uint16(\2))

and accomplish the transformation

call(a, b, c)
call(d, e, f)

to

call(a, b, uint16(c))
call(d, e, uint16(f))

Gregory Warnes on 11 May 2022

Please unify/combine the Matlab coder (`ceval` and friends) and C API (`calllib` and friends) to remove the need to double-code all C calls in code that needs to be run by the interpeter and processed by coder.

For example, I currently have a device driver where every c library call looks like:

if coder.target("MATLAB")
                [status, ~, val] = calllib( ...
                    'libFoo', ...
                    'foo_get_correction', ...
                    obj.foo.device, ...
                    foo.str2ch(obj.module), ...
                    'FOO_ENUM_STRING', ...
                    val ...
                    );
else
    status = int32(0);
    val = int16(0);
    enum_val = foo_enuminfo('foo_correction').FOO_CORR_GAIN;
                status = coder.ceval( ...
                    'foo_get_correction', ...
                    obj.foo.device, ...
                    foo.str2ch(obj.module), ...
                    enum_val, ...
                    val ...
                    );
end;
Unable to resolve the name 'obj.foo.device'.

Even bettter, would be a tool that automatically generates a wrapper from a (well formed) C/C++ header file, that can be customized by the user, and that is compatible with both interpreded use (coder.target("MATLAB")) and compiled/embedded use (~coder.target("MATLAB")).

James Strieter on 25 Mar 2022

One thing that I love about the way MATLAB has evolved over the 20+ years I've been using it is the way you keep adding modern features while keeping the fast matrix operations. Beautiful plotting built in helps a lot too. Like a lot of other people, I've said, "I'll use Numpy because it's free," and then 8 hours later I'm like this would take 5 minutes in MATLAB. And then I do it in MATLAB and it's done. I love that. Here are my favorite features from other languages that could be added, probably without breaking anything:

Haskell's guards and list comprehensions,
Lazy containers,
LISP keywords,
LISP style maps, in which the :keyword-with-hyphens is also a function that retrieves data from an object,
Python's convention of defining ```__str__(self)``` to mean "This is what happens when you cast to a string," ```__int___``` for "This is what happens when you cast to an int," etc. Optional methods that support every kind of cast you could want.
More modern kinds of loops. ```for i in <arbitrary_container>``` for example. Whether the loop is executed in any particular order depends on whether the container has any kind of order, etc.
An API for defining language extensions. This would allow the community to experiment with new language features, making it cheaper & easier for Mathworks to see which language features gain traction. Mathworks would always have the option to include the most popular language extensions in a future release.

Bjorn Gustavsson on 24 Mar 2022

Replace the pinv function with a function tikhonov that defaults to the Moore-Penrose generalized inverse without regularization-parameter, zeroth-order Tikhonov with a second scalar input for the regularization-parameter and a L-th-"order" Tikhonov-regularization with a third-order L-matrix.

The argument for this compatibility-break is that it would force users of pinv to think about what they've done and why, and let them consider the more general and preferable regularized solutions than the M-P inverse.

CFDesign on 18 Feb 2022

Clean inconsistencies, or counter-intuitive behaviours.

- [dr,dc] = size(data)
- dsize = size(data)
- [dr,dc] = dsize

1) works, but 2 and 3, wich is intuitively same, do not work.

Improve consistency in general.

Tobias Held on 2 Feb 2022

A small but handy function that allows when in workspace to press a letter and automatically highlight the variable with this first letter.

1 Reply

Rik on 18 Feb 2022

This wouldn't break compatibility as far as I can see. Feel free to cross-post this here (and/or submit a feature request).

Image Analyst on 27 Jan 2022

I wish there was a way to undo Editor text changes to the max level possible. Clicking the little blue curved arrow 50 times to undo as much as possible seems excessive. I'd an option where I could just back up all the way to the beginning immediately.

Sulaymon Eshkabilov on 27 Oct 2021

Another good and useful tool for students is to have a built-in function to reset all changes made by a user in preferences and interface menu options back to the default. Students quite frequently make changes and have diffculty to reset back their MATLAB menu panel and preferences.

Sulaymon Eshkabilov on 27 Oct 2021

One of the most common pitfalls for the beginners are how to do correct memory allocation even though MATLAB automatically pinpoints that memory allocation is necessary for [for .. end] and [while .. end] loops when the values from every iteration are being saved.

That would be great to have additonal MATLAB's builtin function that detects a necessary memory allocation. And if the user decides to employ this, he/she could just click ok to the proposed option, and all is done like filler options of a Live Script Editor.

Andrew Janke on 25 Oct 2021

I would like to see support for a more structured form of helptext, like Javadoc or Markdown, which could be used to produce richer documentation pages from the inline helptext in class and function source code.

Right now, the helptext is minimally-processed (in a loosey-goosey manner that I've never found formally specified anywhere) that supports basic references to other functions and classes, and definition of an Examples section. In doc for user-defined classes and functions, the helptext is rendered simply, mostly as-is in fixed-width font.

I'd like to be able to have an alternate helptext format that produced richer documentation output, which could be rendered as web pages with proportional font by default and support for various formatting, like section headers (maybe multi-level), fixed-width and demarcated code examples, hyperlinks, maybe even embedded images. It might also be nice to have some structuring that allowed you to specifically document the exceptions a method throws, maybe pre-and-postconditions, function arguments (for functions and methods which do not have arguments blocks that document the arguments separately), return values, and so on.

For methods and functions which have arguments blocks, I'd like to be able to add helptext on each of the arguments, in the manner in which one can put helptext on individual class properties, and have the help for all those arguments be automatically incorporated into the display of help <func> and doc <func>. That auto-generated documentation should also include representations of any declarative type & value constraints and default values that are defined for those arguments. Would be nice if arguments were expanded to include output arguments, so those could be documented as well (though I'm not sure how that would work in the case where one uses the same variable name as both an input and output argument).

I think Markdown, specifically GitHub Flavored Markdown (but maybe allowing arbitrary embedded HTML; I'm not sure), would be a nice format to do this "richer helptext" in. It's easy for most people to pick up, very readable in its source form (for people who are browsing the source code and reading the help there, and for the back-compatibility case where you want to use Matlab code written in the new format in an older version of Matlab), and supports most of the formatting controls I would like.

Maybe there should be a mechanism to use alternate formats for helptext.

One way you could do this in a flexible and even back-compatible manner would be to introduce a new %# pragma for specifying the format that helptext is in: something like %#<helpfmt:foo> where "foo" is the format of the helptext, like "markdown" for Markdown, "helptext" for the legacy Matlab helptext format, maybe "html" for arbitrary HTML, or "<whatever>" for a new structured Matlab documentation format, if you want to use that. For example:

%#<helpfmt:markdown>
%#<helpfmt:helptext>
%#<helpfmt:html>

If the pragma appears at the beginning of a block of helptext for a classdef, function, property, or so on, it would apply only to that one helptext block. If it appears at the beginning of a file, before the initial classdef or function line (or at the top of a Contents.m file), it should apply to all helptext in that file (and could be overridden by additional %#<helpfmt:...> pragmas on a per-block basis. Maybe the could even be some config file at the root of a source tree (that is, in the directory that goes on the Matlab path) to set the default helptext format for all files in a project/codebase.

It would maybe be nice if this supported some mechanism for linking to separate doco pages supplied by a user-defined Matlab library/project as separate HTML/Markdown/whatever files, that could be viewed in the Matlab doc browser, but have larger and richer content than is feasible to stick into embedded helptext comments, or doesn't make sense as the main help for a specific function or class.

You could even support user-defined custom helptext formats by allowing the "format" in %#<helpfmt:format> to be an arbitrary identifier (valid Matlab name), and provide a per-session hook to register user-defined handlers for custom formats. Like matlab.registerHelpfmtHandler('formatname', 'pkg.qualified.class.Name' where pkg.qualified.class.Name is the name of a user-defined Matlab class that conforms to an interface (or maybe inherits from a specific abstract class) that Matlab defines for helpfmt processor/handlers. Maybe it should be an actual object instance, but I don't think that would play well with clear classes.

I've been playing around with something like this in my MlxShake project, but it's hard to implement decently without some built-in support from Matlab itself.

Walter Roberson on 18 Oct 2021

As part of a discussion https://www.mathworks.com/matlabcentral/answers/1450984-what-should-go-in-a-next-generation-matlab-x#comment_1788796 I hypothesized that:

If, hypothetically, a new assignment operator were created that allowed the user to manage

A = object_of_class_B

inside class B, something along the lines of

function target = assign(obj, target) %obj being the object of the class

then that could perhaps have some advantages.

But what should the semantics be ? What would the use-cases be?

such a thing could potentially make resource tracking easier
there might be reason to warn about assigning between unlike data types. For example if A were uint8 but class B carried int8 then you might want a warning about negative values being truncated
not sure what else...

If such an operator existed, you would need a way to distinguish the case where the target was a location that did not exist yet.

Hypothetically that could be handled by nargin < 2 or exist('target', 'var') being false.

But hypothetically perhaps there would be reasons to instead associate each name with a class such as UnassignedLocation, and then isa(target, 'UnassignedLocation')

An existing target of an assignment should definitely be made available inside such a function, so that its datatype can be examined, and resources poked around at.

There is commentary somewhere along the lines that if the target of an assignment is a class name or static method of a class, then the class cannot have influence on what the assignment means: that otherwise the statement

A = B

could change its meaning if a new class A were introduced. I think the implication of that is that there should not be an operator introduced that intercepted assignments onto a class. But possibly I have overlooked some reason why the kind of assignment operator I describe here should not be created.

Walter Roberson on 13 Oct 2021

The ability to assign a subset of fields to a struct (array) would be useful. It is common to want to be change a few settings, such as in a user initialization file, or to have a function that is concerned with getting only a subset of properties from the user. There thus might be a struct of updates to be applied to an existing struct. At the moment you have to loop through the fieldnames of the update struct, setting the fields of the existing struct one by one.

The ability to concatenate or assign between structs with the same fields in different orders would be useful. We have the experience of tables to look at: tables re-order as necessary to match the first order.

1 Reply

Andrew Janke on 18 Oct 2021

+1. struct and object "subset of fields" assignment or "merging" like this is such a common use case in the sort of code that I work with that any nontrivial code base typically ends up with a half dozen different custom helper functions for doing this, each with slightly different behavior.

Bjorn Gustavsson on 12 Oct 2021

Allow elementary mathematical operations on function-handles. So instead of writing the sum of two functions as:

f1 = @(x) x.^2;
f2 = @(y) cos(y);
f_sum = @(x,y) f1(x) + f2(y);

It would be allowed to do:

f1 = @(x) x.^2;
f2 = @(y) cos(y);
f_sum = f1 + f2;

With the same resulting f_sum. Sure some design-choices would have to be made, but I can see benefits with such capability.

Ravi Narasimhan on 11 Oct 2021

Here are some low-falutin' features I'd like to see. My perspective: Periodic Matlab user who mostly likes the tool but has frustrations all the same.

1) DIfferent data types having access to commonly used operators like ==, <=, etc.

2) A good general purpose data container. Cells with their smooth/curly braces and different operators are very confusing when writing code and even more so when revisiting it weeks or months later. e.g.

T = readtable('patients.dat'); % Tables are a great addition
[T.Age < 30].' % This works because I can compare numbers with <, ==, >, etc.
ans = 1×100 logical array
   0   0   0   0   0   0   0   0   1   0   0   0   1   0   0   0   0   1   0   0   0   0   0   0   1   1   0   1   0   0   0   0   1   0   0   0   0   0   0   0   0   0   0   0   0   0   0   0   0   0
T(T.Gender == 'Female',:); % but not chars or strings apparently
Operator '==' is not supported for operands of type 'cell'.

3) Better error messages. This isn't 1986 Unix or 1993 Microsoft anymore. What operators are available? doc cell doesn't list them. At minimum provide a pointer/link to someplace in TFM where such a list IS available.

4) Reduce the amount of web searching required to find answers. e.g. subsetting a table. Best I could find was to make T.Gender a categorical or use strcmp to make the comparison. Both nonobvious, the latter especially because strings are doublequotes whereas single quotes are used for chars (See 1).

T(strcmp(T.Gender,'Female'),:);

The above works but since there's an error before it, I can't run this fragment by itself in this tool.

5) Dictionaries can be useful. Renaming containers.Map would be fine by me. Just be clear in the docs what datatypes can be keys.

And so on.

Massimiliano Zanoli on 11 Oct 2021

Very basic constistency points that currently defy my comprehension:

Default everything to the 1st dimension (i.e. columns are default, not rows). Such as 1:3 should give [1 2 3].' and not [1 2 3].
Then follow the dimensions in order (everything scales accordingly). In such way you can drop all those nd doppelgängers...
Suppress the minumum of 2 dimensions and drop this 2D matrix "shortcut" (for instance repmat(1, 2) should give [1 1].' and not [1 1 ; 1 1]).
Make the order of axes X, Y, Z not Y, X, Z as it is now for *some* functions, but not for others. In my humble opinion MATLAB should follow maths, not CRTs...
do ... while ???
Correspondence between MATLAB's online help and the "help" for each function. Formatted help for custom functions.

Less urgent but kind of:

utf-8 as standard.
Easier and possibly native handling of large constant datasets in parallel working (parallel.pool.constant().... really?).
Ability to assign different GPUs to different workers.
Type check for variables? So much time could be spared when the compiler can check and warn about what you are feeding a function... but there are pros and cons.

All the best!

/Max

Chetan Bhavsar on 8 Oct 2021

An option to Show Inport name to left side instead of below.

and Outport name to right side instead of below.

To avoid this weird looking in case we have more than 20,00o input .

Chad Greene on 6 Oct 2021

If I could design Matlab from scratch I'd

get rid of semicolons to suppress output, and
make element-wise operations the default, rather than having to specify .*, ./, and .^ for the operation that most people want to do most of the time.

James Tursa on 6 Oct 2021

Symmetric variables and Hermitian variables. MATLAB could implement bit flags in the mxArray header to indicate this and they could propagate through operations and function calls when appropriate. This could make symmetric tests easier/faster and background functions could take advantage of this. Also provide mex functions access to these flags.

Chad Greene on 6 Oct 2021

I'd like to define ranges using square brackets for inclusive and rounded brackets for exclusive. So insead of

if x>=20 & x<30 
    disp 'x is in the twenties'
end

I'd introduce another comparison operator, say #, to look like this:

if x # [20 30)
    disp 'x is in the twenties'
end

With this new syntax perhaps we could eliminate the all-too-common usage of elseif forever. Because in my opinion, elseif tends to produce error-prone and unreadable code like this:

if x<0
    disp 'x is negative' 
elseif x==pi
    disp 'x is pi' 
elseif x>=20 & x<30 
    disp 'x is in the twenties'
elseif x>=30 & x<40 
    disp 'x is in the thirties'
else
    disp 'x might be a hundred'
end

The code above is the cleanest, simplest version I can come up with to illustrate the difficulty of following the logic of a series of elseif statements, but in practice it tends to be much more difficult to parse, because it's usually cluttered with longer variable names or more complicated logic.

With the bracket syntax I'm suggesting, switch could be adapted to accept ranges like this:

switch x 
    case <0
        disp 'x is negative'
    case pi
        disp 'x is pi'
    case [20 30)
        disp 'x is in the twenties'
    case [30 40)
        disp 'x is in the thirties'
    otherwise 
        disp 'x might be a hundred'
end

Isn't that so much nicer?

1 Reply

Chad Greene on 6 Oct 2021

Taking this one step further, multiple switch inputs:

switch x,y
    case >0,<0
        'The point x,y is in the lower right quadrant.'
    case >0,>0
        'The point x,y is in the upper right quadrant.'
    case <0,>0
        'The point x,y is in the upper left quadrant.'
    case <0,<0
        'The point x,y is in the lower left quadrant.'
    case 0,0
        'The point x,y is at the origin.'
    otherwise 
        'The point x,y cannot be found on a cartesian plot.'
end

Andrew Janke on 30 Sep 2021

Remove the length function.

Its behavior of "size along the longest dimension, picked at run time" is a little weird, most junior programmers don't expect it, and it leads to subtle bugs that can silently produce incorrect results instead of erroring out. In my 15 years of Matlab programming experience, I've seen so many people call length, and I've never seen one who actually wanted what length does instead of numel or size.

Let everyone just use numel or size instead; those work "safely".

gwoo on 30 Sep 2021

I would also like:

auto-complete options on inputs to custom functions
specified type of arguments such that if an argument is supposed to be a filename or path, then it would allow you to autocomplete a path the way imread() and dir() do, but for custom functions
keyword arguments (is that already a thing?) like in python, instead of all arguments being "equal" and having to parse out
functional programming features such as in-line loops, if statements, direct indexing into function outputs (without an intermediate variable explicitly created).

Tobias Held on 30 Sep 2021

Darktheme
Standart font with distinguishable lI1, 0O etc. (eg. FiraCode, Input)

Munin on 29 Sep 2021

An LSP for other IDEs, better documentation of the Python engine, easier install of MEfP using some kind of shell script or dep manager, and a modern IDE UI supporting dark theme.

Also all components like Coder require a support of MATLABs licensing scheme so that they are usable in CI etc.

gwoo on 29 Sep 2021

I don't know the technical name for it but being able to call methods, properties, or indexing without having to make a new variable first. Kind like in python where you can call a function that will output an array or whatever and instead of saving it to a variable first and then indexing, you can just index right off the end of the function call. I know you can do this for strings and structs, but not for cells or arrays. Also, being able to perform a series of functions on an array, the way you can now with strings.

For example:

[5, 1, 2](2) = 1
horzcat([3;2;1], [5;6;7])(3,2) = 1

Image Analyst on 23 Sep 2021

I'd like a way to enter 2-D matrices interactively easier. The current way with inputdlg() or input() is not WYSIWIG and very clunky and non-intuitive (do I put bracket, parentheses, commas, semicolons - no clue!) We need something like

% Pop up a modal dialog box with a 4 by 5 grid (worksheet) where users can enter values:
m = inputmatrix('Enter your values', 4, 5);

Aik-Siong Koh on 23 Sep 2021

I would suggest MATLAB learn how to implement Pure Object Oriented Programming from Smalltalk.

Pure OOP embodies the following fundamentals:

Everything is an object all the time.
Every operation is through message passing.

Pure OOP enables the following capabilities:

Environment is running and alive all the time.
Run everywhere, Inspect everywhere, Debug everywhere, Edit everywhere.
Entire environment object is saved to disk for fast reload.

Andrew Janke on 21 Sep 2021

Personally, I think that the Answers format is particularly well suited to this sort of discussion, because instead of a linear threading model like a regular forum, it allows people to post various suggestions as top-level Answers and to have them be voted on to indicate community interests, and to let each of those suggestions have their own discussion thread hanging off it.

Walter Roberson on 20 Sep 2021

Benjamin : you flagged this as Not Appropriate for MATLAB Answers . However, it is a classic discussion that fits in well, similar to existing questions such as https://www.mathworks.com/matlabcentral/answers/1325-what-is-missing-from-matlab from early 2011.

Walter Roberson on 19 Sep 2021

This would not break backwards compatibility, but something to consider:

A lot of time, people try to

for x = first:increment:last

with non-integer increment. And then they want to

f(x) = value;

but of course x is non-integer so that fails.

There are standard ways of rewriting this: the common

    counter = 1;
    for x = first:increment:last
      f(counter) = value;
      counter = counter + 1;
    end
    x = 

or (less likely by far, but cleaner since counter is more sensible)

    counter = 0;
    for x = first:increment:last
      counter = counter + 1;
      f(counter) = value;
    end

or the formal and flexible

    xvals = first:increment:last;
    num_x = numel(xvals);
    f = zeros(1, num_x);
    for xidx = 1 : num_x
       x = xvals(xidx);
       f(xidx) = value;
    end

But... keeping those counters is a bit of a nuisance, and people get them wrong.

So I would suggest something I have seen in a couple of programming languages: that there be an accessible automatic counter. We could imagine, for example,

    for x = 0:.01:2*pi
      f(#x) = sin(x.^2 - pi/7);
    end

where the #x translates as "the number of x values we have processed so far".

Indexing a variety of arrays with the same # would be considered valid, so you could write

    for x = 0:.01:2*pi
      f(#x) = sin(x.^2 - phase(#x));
    end

But now we have a question that might lead to some backwards incompatibility: suppose we have

    for x = 0:.01:2*pi
      y = 0;
      for x = 1 : .5 : 5
        y = y + z.^(x-1)./gamma(x+1);
      end
      f(#x) = sin(x.^2 - y);
    end

and the question is: in that f(#x) that is after the nested for x, should the #x refer to

the last index associated with the inner x?
the index after the last one associated with the inner x?
the index associated with the outer x?

Consistency with existing nested for loops would say it should be the first of those, that at any point, this hypothetical #x should refer to the last for index for variable x that was encounted in the flow of execution -- just like the way that the sin(x.^2 - y) is going to use the last x value from the for x = 1 : .5 : 5 .

I would kind of like such an operator to be associated with the innermost enclosing loop so that in this example the f(#x) would be counting relative to the for x = 0:.01:2*pi loop, but I do admit that it would be confusing to have the #x refer to that loop at the same time that the x itself would be what was left-over for the for x = 1: 0.5 : 5 loop. Also, in a context such as

    f = zeros(1,5000);
    for x = 0:.01:2*pi
      if x.^2 - sin(x) > 1; break; end
      f(#x) = acos(x);
    end
    f(#x+1:end) = [];

then it would make sense for the counter to survive the loop itself, which argues for the status quo of "last value assigned" rather than "according to scope". I think the factors are in tension here.

Now, if we are going to have automatic counters with for loops it might make sense to have automatic counters associated with while loops as well:

    x = 0;
    while x <= 2*pi & x.^2 - sin(x) < 1
       f(#???) = acos(x);
       x = x + 0.01;
    end

But while loops have no associated variable. So I might suggest

    x = 0;
    while x <= 2*pi & x.^2 - sin(x) < 1
       f(#) = acos(x);
       x = x + 0.01;
    end

where # by itself is the counter for the innermost enclosing for or while loop. Which would then permit

    for x = 0:.01:2*pi
      f(#) = sin(x.^2 - phase(#));
    end

which is not ambiguous. Now about about with nested loops?

    for x = 0:.01:2*pi
      y = 0;
      for x = 1 : .5 : 5
        y = y + z.^(x-1)./gamma(x+1);
      end
      f(#) = sin(x.^2 - y);
    end

The innermost enclosing for or while loop would be the outer for x loop... the one the user probably intended in such a context.

With the discussion above about what #x means after the end of a for x loop, this proposed behavior of # by itself would lead to the possibility that at that point, assigning to f(#) would be assigning according to the loop counter for the outer for x, but that assigning f(#x) would be assigning according to the loop counter for the inner for x . That is not ideal for readability, and is likely to lead to confusion.

It seems to me that in some cases, people would want a #x at that point to refer to the outer loop, but people would also sometimes want a #x to refer to the inner for x . It would also not surprise me at all if people wanted both ways at the same time. Of course, if they wanted clarity and readability, they probably should not have used nested for loops with the same variable name !!!

Andrew Janke on 18 Sep 2021

Parallel array iteration!

Let's say I've got some arrays in variables x, y, and z, with the same number of columns.

I'd like to be able to say this:

for (x_i, y_i, z_i) = (x, y, z)
    % ... do stuff ...
end

Instead of this:

for i = 1:size(x,2)
    [x_i, y_i, z_i] = deal(x(:,i), y(:,i), z(:,i));
    % ... do stuff ...
end

Andrew Janke on 18 Sep 2021

Convenience thing:

The fieldnames function returns a string row vector, not a cellstr column vector, so you can loop over cell fields with for fld = fieldnames(s) instead of for fld = string(fieldnames(s)'), which is uglier.

Walter Roberson on 17 Sep 2021

Currently the model of MATLAB is that it always evaluates from left to right [*] finding the left-most unprocessed sub-expression and evaluating it, and then finding and evaluating the right hand side operand, and then performing the operation. The right operand is not processed until the left is evaluated, but unless the left operand results in an error, or the operation is && or || the right will always be evaluated.

[*] exception: there are some funky things with chains of ^ and .^ operators, they are not left strictly left to right.

This behavior prevents there from being function forms of if/else operations -- there is no equivalent to C's ?: operation. In C, the unselected operation is not evaluated at all.

The hack work-arounds require embedding the work to be done inside an anonymous function and writing a function like

function varargout = ifelse(expr, basepart, elsepart)
   if expr
       if isa(base_part, 'function_handle')
          [varargout{:}] = basepart();
       else
           varargout{1} = basepart;
       end
   elseif isa(elsepart, 'function_handle')
      [varargout{:}] = elsepart();
   else
       varargout{1} = elsepart;
   end
end

and using that gets ugly... and probably messes up multiple output processing.

Piecewise(x ~= 0, 0, 1./x)

can't be done and would have to look like

Piecewise(x ~= 0, 0, @(x)1./x)

I would like to see a cleaner way of handling this -- one in which the function being called does not need to know that a delayed evaluation is being done.

In the Maple programming language, there are two related mechanisms available. First, there is a simple syntax to delay evaluation. This is indicated by using ' ' around the expression. For example,

Piecewise(x <> 0, 0, '1/x')

In Maple, this is not a quoted string: Maple uses double-quotes for strings. Instead it is a delayed evaluation. Each time the relevant expression is evaluated, one level of unevaluation is removed; when it is eventually evaluated in a context where there are not remaining protective uneval() levels, then the expression is evaluated.

Secondly, Maple allows procedures (that is, functions) to declare a parameter as being of type "uneval", which has the effect of adding a layer of uneval around what is passed in. For example,

Piecewise := proc(x, basepart::uneval, elsepart::uneval) #stuff; end proc;

would permit uses to code

Piecewise(x <> 0, 0, 1/x)

and the 1/x will not be evaluated before being passed in to the procedure.

Some programming languages deal with these kinds of issues by using "lazy evaluation". Something like

Piecewise(x <> 0, 0, 1/x)

would not evaluate any of the parameters until such time as the code inside Piecewise asked for their value -- so if the code logic did not ask for the value of a particular parameter, it would never be evaluated.

If I understand correctly, tallarray() already does some delayed evaluation, building up expressions and then internally finding ways to reduce the memory access during evaluation.

Matt J on 17 Sep 2021

My wish list:

(1) Colon operator produces column vectors, not row vectors:

(2) Optimization Toolbox solvers should have only one algorithm per solver, i.e., instead of,

x1=lsqnonlin(fun,x0,lb,ub, optimoptions(@lsqnonlin,'Algorithm','levenberg-marquardt'))
x2=fminunc(fun,x0, optimoptions(@fminunc,'Algorithm','trust-region'))

we would just have

x1=lsqnonlinLevMarq(fun,x0,lb,ub)
x2=fminuncTrustReg(fun,x0)
etc...

(3) The Image Processing and the Computer Vision Toolboxes would be designed around the coordinate conventions of ndgrid() instead of meshgrid().

(4)One-dimensional array types, i.e., with ndims(X)=1.

Jan on 16 Sep 2021

A complete list of changes for each command.

Currently we find "introduced in Rxy" already, but modifications of inputs and outputs are very useful also. Examples: When did unique introduce the 'legacy' flag? When did strncmp change the behaviour for empty strings and n=0?

1 Reply

Andrew Janke on 16 Sep 2021

This would be useful.

No reason to wait until MATLAB X to start doing it though; MathWorks could add a per-function/class Changelog to the doco any time, I think!

Jim Svensson on 15 Sep 2021

Most important

Start indexing from 0
Redo package system
Improve the class system
Improve language a bit (like value += delta)

Andrew Janke on 15 Sep 2021

Oh, here's one!

Comments can begin with "#" in addition to "%".

This would enable Octave compatibility. But I think that might be to MathWorks's benefit: it would enable you to easily take existing Octave code and migrate your workloads to Matlab, which is the direction that MathWorks would like people to move.

Also enables use of "shebang" lines on Unix, so you could easily create executable commands as Matlab scripts.

Andrew Janke on 11 Sep 2021

A possibly radical one:

Semicolons are no longer needed to suppress display of a statement's result. Instead, output is suppressed by default, and if you do want it displayed, you append a "!" (or something else) to the end of the statement. Semicolons are now just statement separators, and you can omit them in most places with no effect.

Maybe this should apply only to function and classdef files, and statement result display is on by default in script files, and you still suppress its display by appending a ";" there.

Andrew Janke on 11 Sep 2021

Oh, thanks! Reading through that now.

Tucker Downs on 11 Sep 2021

Yes! I think in all established products it's occasionally neccissary to make major pruning of older functionality for the good of the product / eco system. In companies I've worked for we've done this and made plenty of annoucements "Your legacy code might not work in version!!!! but we have guides on how to change it / we will support old matlab for the next X (many many) years."

For the most part it's always been well recieved.

I'll add

max(2,[]) should not return []

++ incrementing

maps as a more prominant base data type

expose more internal apis for making subclasses for plot objects, like custom arrows

Walter Roberson on 11 Sep 2021

Andrew Janke on 11 Sep 2021

In MATLAB X, I would like to see:

An object display customization API like Python's __str__ and __repr__. (`disp` isn't suitable.) (See The Dispstr API)
In mixed-mode arithmetic (combining floats and ints), ints widen to floats instead of narrowing to ints.
Integer-looking literals (like 1234) produce ints instead of doubles.
Both single-quoted and double-quoted string literals produce string arrays; to get char arrays you need to explicitly call char(...).
Every function uses string arrays instead of char vectors or cellstrs in its return values, when not determined by the type of one of the inputs.
Figure handle properties use string arrays instead of char vectors.
In string literals, backslash escapes are interpreted by the string literal itself, and not by the *printf() functions.
import statements have file scope, not function scope.
Class properties with (1,1) string validators default to string(missing) instead of the empty string "".
There's a date-only localdate type to complement the date + time datetime type.
now() and today() return datetime and localdate values, instead of double datenums.
For that matter, pretty much every date or time returned by a function is a datetime or localdate instead of a double datenum.
Maybe classes and functions in the same package are visible by default, using unqualified names, instead of requiring package qualification or an import statement. (Though this is mostly handled if import gets file scope.)
The "`if false or true`" parsing quirk (where the stuff after "false" is considered the first statement inside the if block) is fixed, and the whole "false or true" is considered part of the if condition.
File IO is done OOP style, with fopen returning a file object instead of a numeric handle.
UTF-8 becomes the default encoding for all external text IO on all platforms.
A revamped helptext system for embedding somewhat-formatted, somewhat-structured API reference documentation in source code. The existing helptext format is too simple and loosey-goosey.
Maybe chars should become Unicode code points instead of UTF-16 code units, and strings and chars should be stored in Python-style "flexible-width string" format. Would save memory, and make it easier to work with emoji or exotic scripts.
The GUI Layout Toolbox's functionality is pulled in to base Matlab, including support for relative positioning and sizing of widgets (like how Java Swing layouts work), and relative positioning layouts become the default (instead of 'normalized' or absolute-units positioning like it is now).

Things I do not want to see:

Multithreading.

Ideas
Follow

You are now following this channel

You are now following this topic

What should go in a next-generation MATLAB X?

359 Comments

Andrew Janke

Posts by this author

Tags

You are now following this channel

You are now following this topic

What should go in a next-generation MATLAB X?

An Error Occurred

359 Comments