【问题标题】:Boost::spirit::qi parser not consuming entire stringBoost::spirit::qi 解析器不消耗整个字符串
【发布时间】:2014-08-26 20:55:27
【问题描述】:

我正在为一个简单的计算器创建语法,但我无法确定一个特定测试用例不起作用的原因。这是我的解析器的一个功能示例:

#include <iostream>
#include <vector>
#include <string>
#include <boost/spirit/include/qi.hpp>
#include <boost/spirit/include/qi_char.hpp>
#include <boost/spirit/include/qi_parse.hpp>
#include <boost/spirit/include/phoenix_bind.hpp>
using namespace boost::spirit;
using namespace boost::phoenix;
using std::endl;
using std::cout;
using std::string;
using std::vector;

void fPushOp(const string& op){
  cout << "PushOp: " << op << endl;
}

void fPushInt(string& my_str){
  cout << "PushInt: " << my_str << endl;
}

template<class Iterator>
struct Calculator : public qi::grammar<Iterator> {

    qi::rule<Iterator>  
      expression, logical_or_expression, logical_and_expression, negate_expression, series_expression,
      single_expression, inclusive_or_expression, exclusive_or_expression, and_expression, equality_expression, 
      relational_expression, shift_expression, additive_expression, multiplicative_expression, 
      term, complement_factor, factor, number, integer, variable, variable_combo, word, result;

    Calculator() : Calculator::base_type(result)
    {
          number = 
              lexeme[
                qi::as_string[
                    ("0x" >> +qi::char_("0-9a-fA-F"))     
                  | ("0b" >> +qi::char_("0-1"))
                  | ("0" >>  +qi::char_("0-7"))
                  | +qi::char_("0-9")
                ] [bind(&fPushInt, qi::_1)]
              ] 
             ;

          complement_factor = number
              | ('~' >> number)[bind(&fPushOp, "OP_COMPLEMENT")]
              | ('!' >> number)[bind(&fPushOp, "OP_NEGATE")];
              ;
          term = complement_factor
            >> *( (".." >> complement_factor)[bind(&fPushOp, "OP_LEGER")]
                | ('\\' >> complement_factor)[bind(&fPushOp, "OP_MASK")]
                ); 
          multiplicative_expression = term
            >> *( ('/' >> term)[bind(&fPushOp, "OP_DIV")]
                | ('%' >> term)[bind(&fPushOp, "OP_MOD")]
                | ('*' >> term)[bind(&fPushOp, "OP_MUL")]
                );
          additive_expression = multiplicative_expression
            >> *( ('+' >> multiplicative_expression)[bind(&fPushOp, "OP_ADD")]
                | ('-' >> multiplicative_expression)[bind(&fPushOp, "OP_SUB")]
                );
          shift_expression = additive_expression
            >> *( (">>" >> additive_expression)[bind(&fPushOp, "OP_SRL")]
                | ("<<" >> additive_expression)[bind(&fPushOp, "OP_SLL")]
                );
          relational_expression = shift_expression
            >> *( ('<' >> shift_expression)[bind(&fPushOp, "OP_LT")]
                | ('>' >> shift_expression)[bind(&fPushOp, "OP_GT")]
                | ("<=" >> shift_expression)[bind(&fPushOp, "OP_LET")]
                | (">=" >> shift_expression)[bind(&fPushOp, "OP_GET")]
                );
          equality_expression = relational_expression 
            >> *( ("==" >> relational_expression)[bind(&fPushOp, "OP_EQ")]
                | ("!=" >> relational_expression)[bind(&fPushOp, "OP_NEQ")] 
                );
          and_expression = equality_expression 
            >> *(('&' >> equality_expression)[bind(&fPushOp, "OP_AND")]); 
          exclusive_or_expression = and_expression 
            >> *(('^' >> and_expression)[bind(&fPushOp, "OP_XOR")]); 
          inclusive_or_expression = exclusive_or_expression 
            >> *(('|' >> exclusive_or_expression)[bind(&fPushOp, "OP_OR")]); 
          single_expression = inclusive_or_expression;
          series_expression = inclusive_or_expression 
            >> *((',' >> inclusive_or_expression)[bind(&fPushOp, "OP_SERIES")]);
          logical_and_expression = series_expression
            >> *(("&&" >> series_expression)[bind(&fPushOp, "OP_LOGICAL_AND")]); 
          logical_or_expression = logical_and_expression 
            >> *(("||" >> logical_and_expression)[bind(&fPushOp, "OP_LOGICAL_OR")]);
          expression = logical_or_expression;

          result = expression;
    }
};

int main(){
  Calculator<string::const_iterator> calc;
  const string expr("!3 && 0,1");
  string::const_iterator it = expr.begin();
  parse(it, expr.end(), calc, qi::space);
  cout << "Remaining: " << (string(it,expr.end())) << endl;

  return 0;
}

预期的输出如下:

PushInt: 3
PushOp: OP_NEGATE
PushInt: 0
PushInt: 1
PushOp: OP_SERIES
PushOp: OP_LOGICAL_AND
Remaining: 

expr!3 &amp;&amp; 0,1 时的当前输出似乎表明&amp;&amp; 0,1 没有被消耗:

PushInt: 3
PushOp: OP_NEGATE
Remaining:  && 0,1

如果expr!3&amp;&amp;0,1,那么它工作得很好。在调用qi::parse 时使用qi::space 跳过程序,我看不出这两个字符串的不同之处。谁能指出我的问题?

【问题讨论】:

    标签: c++ boost boost-spirit-qi


    【解决方案1】:

    您的规则没有声明船长:

    qi::rule<Iterator>  
    

    因此,它们是隐含的lexemes。有关与船长相关的 lexeme[] 的背景信息,请参阅 Boost spirit skipper issues

    正确应用船长

    • 你需要在语法和规则定义中声明skipper

      template<class Iterator, typename Skipper = qi::space_type>
      struct Calculator : public qi::grammar<Iterator, Skipper> {
      
          qi::rule<Iterator, Skipper>  
            expression, logical_or_expression, logical_and_expression, negate_expression, series_expression,
            single_expression, inclusive_or_expression, exclusive_or_expression, and_expression, equality_expression, 
            relational_expression, shift_expression, additive_expression, multiplicative_expression, 
            term, complement_factor, factor, result;
      
          qi::rule<Iterator>  
            number, integer, variable, variable_combo, word;
      
    • 在传递skipper类型的实例时需要使用phrase_parse

      phrase_parse(it, expr.end(), calc, qi::space);
      

    固定代码

    补充说明:

    • 清理了包含(最好包含完整的phoenix.hpp,因为如果您缺少细微的位,您被莫名其妙的错误所困扰。当然,如果您知道哪些位,请随时通过选择性地包含子标题来减少编译时间)
    • 强烈建议反对using namespace,除非绝对必要。在这种情况下,您很容易在bind 的众多品牌之一之间引起混淆。而且,不,只是说using boost::phoenix::ref 是不够的,因为

      using boost::phoenix::ref;
      std::string s;
      bind(foo, ref(s))(); 
      

      由于 ADL,最终使用 std::ref,而不是 boost::phoenix::ref

    #include <iostream>
    #include <string>
    #include <boost/spirit/include/qi.hpp>
    #include <boost/spirit/include/phoenix.hpp>
    namespace qi = boost::spirit::qi;
    namespace phx = boost::phoenix;
    
    void fPushOp(const std::string& op){
        std::cout << "PushOp: " << op << std::endl;
    }
    
    void fPushInt(std::string& my_str){
        std::cout << "PushInt: " << my_str << std::endl;
    }
    
    template<class Iterator, typename Skipper = qi::space_type>
    struct Calculator : public qi::grammar<Iterator, Skipper> {
    
        qi::rule<Iterator, Skipper>  
          expression, logical_or_expression, logical_and_expression,
            negate_expression, series_expression, single_expression,
            inclusive_or_expression, exclusive_or_expression, and_expression,
            equality_expression, relational_expression, shift_expression,
            additive_expression, multiplicative_expression, term,
            complement_factor, factor, result;
    
        qi::rule<Iterator>  
            number, integer, variable, variable_combo, word;
    
        Calculator() : Calculator::base_type(result)
        {
            number = 
                qi::lexeme[
                  qi::as_string[
                      ("0x" >> +qi::char_("0-9a-fA-F"))     
                    | ("0b" >> +qi::char_("0-1"))
                    | ("0" >>  +qi::char_("0-7"))
                    | +qi::char_("0-9")
                  ] [phx::bind(&fPushInt, qi::_1)]
                ] 
               ;
    
            complement_factor = number
                | ('~' >> number)[phx::bind(&fPushOp, "OP_COMPLEMENT")]
                | ('!' >> number)[phx::bind(&fPushOp, "OP_NEGATE")];
                ;
            term = complement_factor
              >> *( (".." >> complement_factor)[phx::bind(&fPushOp, "OP_LEGER")]
                  | ('\\' >> complement_factor)[phx::bind(&fPushOp, "OP_MASK")]
                  ); 
            multiplicative_expression = term
              >> *( ('/' >> term)[phx::bind(&fPushOp, "OP_DIV")]
                  | ('%' >> term)[phx::bind(&fPushOp, "OP_MOD")]
                  | ('*' >> term)[phx::bind(&fPushOp, "OP_MUL")]
                  );
            additive_expression = multiplicative_expression
              >> *( ('+' >> multiplicative_expression)[phx::bind(&fPushOp, "OP_ADD")]
                  | ('-' >> multiplicative_expression)[phx::bind(&fPushOp, "OP_SUB")]
                  );
            shift_expression = additive_expression
              >> *( (">>" >> additive_expression)[phx::bind(&fPushOp, "OP_SRL")]
                  | ("<<" >> additive_expression)[phx::bind(&fPushOp, "OP_SLL")]
                  );
            relational_expression = shift_expression
              >> *( ('<' >> shift_expression)[phx::bind(&fPushOp, "OP_LT")]
                  | ('>' >> shift_expression)[phx::bind(&fPushOp, "OP_GT")]
                  | ("<=" >> shift_expression)[phx::bind(&fPushOp, "OP_LET")]
                  | (">=" >> shift_expression)[phx::bind(&fPushOp, "OP_GET")]
                  );
            equality_expression = relational_expression 
              >> *( ("==" >> relational_expression)[phx::bind(&fPushOp, "OP_EQ")]
                  | ("!=" >> relational_expression)[phx::bind(&fPushOp, "OP_NEQ")] 
                  );
            and_expression = equality_expression 
              >> *(('&' >> equality_expression)[phx::bind(&fPushOp, "OP_AND")]); 
            exclusive_or_expression = and_expression 
              >> *(('^' >> and_expression)[phx::bind(&fPushOp, "OP_XOR")]); 
            inclusive_or_expression = exclusive_or_expression 
              >> *(('|' >> exclusive_or_expression)[phx::bind(&fPushOp, "OP_OR")]); 
            single_expression = inclusive_or_expression;
            series_expression = inclusive_or_expression 
              >> *((',' >> inclusive_or_expression)[phx::bind(&fPushOp, "OP_SERIES")]);
            logical_and_expression = series_expression
              >> *(("&&" >> series_expression)[phx::bind(&fPushOp, "OP_LOGICAL_AND")]); 
            logical_or_expression = logical_and_expression 
              >> *(("||" >> logical_and_expression)[phx::bind(&fPushOp, "OP_LOGICAL_OR")]);
            expression = logical_or_expression;
    
            result = expression;
        }
    };
    
    int main(){
      Calculator<std::string::const_iterator> calc;
    
      const std::string expr("!3 && 0,1");
      std::string::const_iterator it = expr.begin();
    
      phrase_parse(it, expr.end(), calc, qi::space);
    
      std::cout << "Remaining: " << std::string(it,expr.end()) << std::endl;
    
      return 0;
    }
    

    【讨论】:

    • 我认为parse(it, expr.end(), calc, qi::space); 中的qi::space 做同样的事情?
    • 它只是告诉解析器 API 要传递什么 Skipper instance,但语法忽略它!
    • 我已经解释了修复所需的步骤,并发布了a working sample
    • 我似乎无法编译您的工作示例? boost/spirit/home/qi/nonterminal/rule.hpp:303: error: no match for call to '(const boost::function&lt;bool(__gnu_cxx::__normal_iterator&lt;const char*, std::basic_string&lt;char, std::char_traits&lt;char&gt;, std::allocator&lt;char&gt; &gt; &gt;&amp;, const __gnu_cxx::__normal_iterator&lt;const char*, std::basic_string&lt;char, std::char_traits&lt;char&gt;, std::allocator&lt;char&gt; &gt; &gt;&amp;, ...
    • 我只能假设您没有全部复制/粘贴 :) 考虑再次复制,因为我改进了一些东西。另请参阅答案中的备注。
    猜你喜欢
    • 1970-01-01
    • 2017-08-10
    • 1970-01-01
    • 1970-01-01
    • 1970-01-01
    • 2023-03-12
    • 1970-01-01
    • 1970-01-01
    • 1970-01-01
    相关资源
    最近更新 更多